makeasnek@lemmy.ml to

AI@lemmy.mlEnglish · 10 months ago

LLM ASICs on USB sticks?

cross-posted to:
[email protected]

1

LLM ASICs on USB sticks?

makeasnek@lemmy.ml to

AI@lemmy.mlEnglish · 10 months ago

cross-posted to:
[email protected]

Source: nostr

https://snort.social/nevent1qqsg9c49el0uvn262eq8j3ukqx5jvxzrgcvajcxp23dgru3acfsjqdgzyprqcf0xst760qet2tglytfay2e3wmvh9asdehpjztkceyh0s5r9cqcyqqqqqqgt7uh3n

Paper: https://arxiv.org/abs/2406.02528

Chat

Mike1576218@lemmy.ml
link
fedilink
arrow-up
0·
10 months ago
llama2 gguf with 2bit quantisation only needs ~5gb vram. 8bits need >9gb. Anything inbetween is possible. There are even 1.5bit and even 1bit options (not gguf AFAIK). Generally fewer bits means worse results though.

AI@lemmy.ml

artificial_intel@lemmy.ml

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

4 users / day
11 users / week
58 users / month
131 users / 6 months
0 local subscribers
4.71K subscribers
303 Posts
859 Comments
Modlog

mods: