Free Open-Source Artificial Intelligence

•

Llama 3.1 Megathread

Meta has released and open-sourced Llama 3.1 in three different sizes: 8B, 70B, and 405B

This new Llama iteration and update brings state-of-the-art performance to open-source ecosystems.

If you've had a chance to use Llama 3.1 in any of its variants - let us know how you like it and what you're using it for in the comments below!

Llama 3.1 Megathread

For this release, we evaluated performance on over 150 benchmark datasets that span a wide range of languages. In addition, we performed extensive human evaluations that compare Llama 3.1 with competing models in real-world scenarios. Our experimental evaluation suggests that our flagship model is competitive with leading foundation models across a range of tasks, including GPT-4, GPT-4o, and Claude 3.5 Sonnet. Additionally, our smaller models are competitive with closed and open models that have a similar number of parameters.

As our largest model yet, training Llama 3.1 405B on over 15 trillion tokens was a major challenge. To enable training runs at this scale and achieve the results we have in a reasonable amount of time, we significantly optimized our full training stack and pushed our model training to over 16 thousand H100 GPUs, making the 405B the first Llama model trained at this scale.

Official Meta News & Documentation

See also: The Llama 3 Herd of Models paper here:

https://ai.meta.com/research/publications/the-llama-3-herd-of-models/

HuggingFace Download Links

`8B`

Meta-Llama-3.1-8B

https://huggingface.co/meta-llama/Meta-Llama-3.1-8B

Meta-Llama-3.1-8B-Instruct

https://huggingface.co/meta-llama/Meta-Llama-3.1-8B-Instruct

Llama-Guard-3-8B

https://huggingface.co/meta-llama/Llama-Guard-3-8B

Llama-Guard-3-8B-INT8

https://huggingface.co/meta-llama/Llama-Guard-3-8B-INT8

`70B`

Meta-Llama-3.1-70B

https://huggingface.co/meta-llama/Meta-Llama-3.1-70B

Meta-Llama-3.1-70B-Instruct

https://huggingface.co/meta-llama/Meta-Llama-3.1-70B-Instruct

`405B`

Meta-Llama-3.1-405B-FP8

https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-FP8

Meta-Llama-3.1-405B-Instruct-FP8

https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct-FP8

Meta-Llama-3.1-405B

https://huggingface.co/meta-llama/Meta-Llama-3.1-405B

Meta-Llama-3.1-405B-Instruct

https://huggingface.co/meta-llama/Meta-Llama-3.1-405B-Instruct

Getting the models

You can download the models directly from Meta or one of our download partners: Hugging Face or Kaggle.

Alternatively, you can work with ecosystem partners to access the models through the services they provide. This approach can be especially useful if you want to work with the Llama 3.1 405B model.

Note: Llama 3.1 405B requires significant storage and computational resources, occupying approximately 750GB of disk storage space and necessitating two nodes on MP16 for inferencing.

Learn more at:

https://llama.meta.com/docs/getting_the_models

Running the models

More guides and resources

How-to Fine-tune Llama 3.1 models

https://llama.meta.com/docs/how-to-guides/fine-tuning

Quantizing Llama 3.1 models

https://llama.meta.com/docs/how-to-guides/quantization

Prompting Llama 3.1 models

https://llama.meta.com/docs/how-to-guides/prompting

Llama 3.1 recipes

https://github.com/meta-llama/llama-recipes

YouTube media

Rowan Cheung - Mark Zuckerberg on Llama 3.1, Open Source, AI Agents, Safety, and more

https://www.youtube.com/watch?v=Vy3OkbtUa5k

Matthew Berman - BREAKING: LLaMA 405b is here! Open-source is now FRONTIER!

https://www.youtube.com/watch?v=JLEDwO7JEK4

Wes Roth - Zuckerberg goes SCORCHED EARTH.... Llama 3.1 BREAKS the "AGI Industry"*

https://www.youtube.com/watch?v=QyRWqJehK7I

1littlecoder - How to DOWNLOAD Llama 3.1 LLMs

https://www.youtube.com/watch?v=R_vrjOkGvZ8

Bloomberg - Inside Mark Zuckerberg's AI Era | The Circuit

https://www.youtube.com/watch?v=YuIc4mq7zMU

11 comments

Free Open-Source Artificial Intelligence

brucethemoose

•

Qwen2.5: A Party of Foundation Models!

https://qwenlm.github.io/blog/qwen2.5/

GITHUB HUGGING FACE MODELSCOPE DEMO DISCORD Introduction In the past three months since Qwen2’s release, numerous developers have built new models on the Qwen2 language models, providing us with valuable feedback. During this period, we have focused on creating smarter and more knowledgeable language models. Today, we are excited to introduce the latest addition to the Qwen family: Qwen2.5. We are announcing what might be the largest opensource release in history!

0 comments

Free Open-Source Artificial Intelligence

reactive_recall

•

How does Llama3 generates images?

Does Llama3 use any other model for generating images? Or is it something that llama3 model can do by itself?

Can Llama3 generate images with ollama?

4 comments

Free Open-Source Artificial Intelligence

brucethemoose

•

Cohere Drops Command-R 35B 08-2024 Update, Just About a Perfect Local LLM for 24GB GPUs.

CohereForAI/c4ai-command-r-08-2024 · Hugging Face

https://huggingface.co/CohereForAI/c4ai-command-r-08-2024

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

CohereForAI/c4ai-command-r-08-2024 · Hugging Face

0 comments

Free Open-Source Artificial Intelligence

wiki_me

•

Israeli Voice Recognition Startup Unveils Model ‘Faster Than OpenAI’

https://miniza.pages.dev/opensource/israeli-voice-recognition-startup-unveils-model-faster-than-openai

1 comments

Free Open-Source Artificial Intelligence

makingStuffForFun

•

Local, privacy respecting, open source LLM that can transcribe phone calls and summarize the call?

Hi everybody, I find a huge part of my job is talking to colleagues and clients and at the end of those phone calls, I have to write a summary of what happened, plus any key points that I need to focus on followup.

I figured it would be an excellent task for a LLM.

It would need intercept the phone call dialogue, and transcribe the dialogue.

Then afterwards I would want to summarize it.

I'm not talking about teams meetings or anything like that, I'm talking a traditional phone call, via a mobile phone to another phone.

I understand that that could be two different pieces of software, and that would be fine, but I am wondering if there is any such tool out there, or a tool in the making?

If you have any leads, I'd love to hear them.

Thank you so much

7 comments

Free Open-Source Artificial Intelligence

thirdBreakfast

•

Solid introduction to LLMs from Andrej Karpathy

- YouTube

https://www.youtube.com/watch?v=zjkBMFhNj_g

Auf YouTube findest du die angesagtesten Videos und Tracks. Außerdem kannst du eigene Inhalte hochladen und mit Freunden oder gleich der ganzen Welt teilen.

1 comments

Free Open-Source Artificial Intelligence

bazsalanszky

•

Mistral AI just dropped their new model, Mistral Large 2

Large Enough

https://mistral.ai/news/mistral-large-2407/

Today, we are announcing Mistral Large 2, the new generation of our flagship model. Compared to its predecessor, Mistral Large 2 is significantly more capable in code generation, mathematics, and reasoning. It also provides a much stronger multilingual support, and advanced function calling capabilities.

0 comments

Free Open-Source Artificial Intelligence

Even_Adder

•

The Open Model Initiative - Invoke, Comfy Org, Civitai and LAION, and others coordinating a new next-gen model. - r/StableDiffusion

Blocked

https://old.reddit.com/r/StableDiffusion/comments/1do5gvz/the_open_model_initiative_invoke_comfy_org/

0 comments

Free Open-Source Artificial Intelligence

ylai

•

Not all ‘open source’ AI models are actually open: here’s a ranking

https://www.nature.com/articles/d41586-024-02012-5

Many of the large language models that power chatbots claim to be open, but restrict access to code and training data.

Not all ‘open source’ AI models are actually open: here’s a ranking

1 comments

Free Open-Source Artificial Intelligence

Llama 3.1 Megathread