AI news about Llama3

Maria Deutscher @ SiliconANGLE

Meta's Llama 3.3 70B: Efficient Open-Source Large Language Model - 12d

Read more: siliconangle.com

Meta has unveiled Llama 3.3 70B, a new open-source large language model (LLM) boasting significant improvements in efficiency and cost-effectiveness. This 70-billion parameter model achieves comparable output quality to its larger predecessor, Llama 3.1 405B, but at a fraction of the infrastructure cost. Meta reports that Llama 3.3 70B is nearly five times more cost-efficient, significantly reducing expenses associated with prompt responses. This reduction is attributed to an optimized Transformer architecture and improved attention mechanisms, lowering inference costs. The model was trained using 15 trillion tokens from public web data and over 25 million synthetic examples, utilizing Nvidia H100-80GB GPUs.

The model underwent further refinement through supervised fine-tuning and reinforcement learning from human feedback (RLHF), enhancing its performance and alignment with user preferences. Benchmark comparisons show Llama 3.3 70B trailing Llama 3.1 405B by less than 2% in several tests, even surpassing it in others and generally outperforming OpenAI's GPT-4o. Meta highlights the drastic cost savings: processing and generating a million tokens costs just 10 cents and 40 cents respectively, compared to $1 and $1.80 with Llama 3.1 405B. The source code for Llama 3.3 70B is publicly available on Hugging Face, making advanced AI capabilities more accessible to a broader range of developers and researchers.

Original img attribution: https://d15shllkswkct0.cloudfront.net/wp-content/blogs.dir/1/files/2024/12/unsplash-1.png

References:

SiliconANGLE - Meta releases efficiency-optimized Llama 3.3 70B large language model - 13d
MarkTechPost - Meta AI Just Open-Sourced Llama 3.3: A New 70B Multilingual Large Language Model (LLM) - 13d
blog.quintarelli.it - 2:4 Sparse Llama: Smaller Models for Efficient GPU Inference - 12d
venturebeat.com - Meta launches open source Llama 3.3, shrinking powerful bigger model into smaller size - 10d
Analytics Vidhya - Llama 3.3 70B is Here! 25x Cheaper than GPT-4o - 10d

Classification:

HashTags: Llama3 Meta OpenSourceAI
Company: Meta
Target:
Attacker:
Product: Llama 3.3
Feature: Text generation
Type: Research
Severity: Informative

FlagThis AI

Meta's Llama 3.3 70B: Efficient Open-Source Large Language Model - 12d

References:

Classification: