AI Companions @lemmy.world pavnilschanda @lemmy.world 6mo ago

[Resource] Llama3 70B Successfully Deployed on a Single 4GB GPU

huggingface.co Run the strongest open-source LLM model: Llama3 70B with just a single 4GB GPU!

A Blog post by Gavin Li on Hugging Face

The open-source language model Llama3 has been released, and it has been confirmed that it can be run locally on a single GPU with only 4GB of VRAM using the AirLLM framework. Llama3's performance is comparable to GPT-4 and Claude3 Opus, and its success is attributed to its massive increase in training data and technical improvements in training methods. The model's architecture remains unchanged, but its training data has increased from 2T to 15T, with a focus on quality filtering and deduplication. The development of Llama3 highlights the importance of data quality and the role of open-source culture in AI development, and raises questions about the future of open-source models versus closed-source ones in the field of AI.

Summarized by Llama 3 70B Instruct

You're viewing a single thread.

4 comments

Only works on apple silicon. Am I reading that right?
- No, they just mention that only Apple silicon is supported if you're using MacOS