Meta’s latest AI model is free for all

Beneath the hood

Getting LLaMA 2 able to launch required plenty of tweaking to make the mannequin safer and fewer more likely to spew poisonous falsehoods than its predecessor, Al-Dahle says.

Meta has loads of previous gaffes to be taught from. Its language mannequin for science, Galactica, was taken offline after solely three days, and its earlier LlaMA mannequin, which was meant just for analysis functions, was leaked on-line, sparking criticism from politicians who questioned whether or not Meta was taking correct account of the dangers related to AI language fashions, equivalent to disinformation and harassment.

To mitigate the danger of repeating these errors, Meta utilized a mixture of completely different machine studying methods geared toward bettering helpfulness and security.

Meta’s strategy to coaching LLaMA 2 had extra steps than normal for generative AI fashions, says Sasha Luccioni, a researcher at AI startup Hugging Face.

The mannequin was educated on 40% extra information than its predecessor. Al-Dahle says there have been two sources of coaching information: information that was scraped on-line, and an information set fine-tuned and tweaked in accordance with suggestions from human annotators to behave in a extra fascinating means. The corporate says it didn’t use Meta person information in LLaMA 2, and excluded information from websites it knew had numerous private data.

Regardless of that, LLaMA 2 nonetheless spews offensive, dangerous, and in any other case problematic language, identical to rival fashions. Meta says it didn’t take away poisonous information from the information set, as a result of leaving it in may assist LLaMA 2 detect hate speech higher, and eradicating it might threat by chance filtering out some demographic teams.

However, Meta’s dedication to openness is thrilling, says Luccioni, as a result of it permits researchers like herself to review AI fashions’ biases, ethics, and effectivity correctly.

The truth that LLaMA 2 is an open-source mannequin can even enable exterior researchers and builders to probe it for safety flaws, which is able to make it safer than proprietary fashions, Al-Dahle says.

Liang agrees. “I am very excited to strive issues out and I feel will probably be helpful for the group,” he says.

Source link

Meta’s latest AI model is free for all

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Eric Evans receives Department of Defense Medal for Distinguished Public Service | MIT News

5 Surprising Benefits of Palletizing Ergonomics on Employee Mental Health

Bot inspired by baby turtles can swim under the sand

Recommended For You

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Eric Evans receives Department of Defense Medal for Distinguished Public Service | MIT News

Imperva optimizes SQL generation from natural language using Amazon Bedrock

AI in Manufacturing: Overcoming Data and Talent Barriers

Bot inspired by baby turtles can swim under the sand

Scientists develop stretchable robotic fabrics that enable error-prone robotic modules to march in formation

Parsec Receives Strategic Investment from BVP Forge to Accelerate Manufacturing Operations Management for Enterprises

Leave a Reply Cancel reply

A technique for more effective multipurpose robots | MIT News

Helping robots grasp the unpredictable | MIT News

The Current State of AI! (My Personal News Recap)

Robotics investments reach $418M in November 2023

2024 World Battery & Energy Storage Industry Expo (WBE)

MIT faculty, instructors, students experiment with generative AI in teaching and learning | MIT News

What is AI – Artificial Intelligence in Telugu | Future of AI | TeluguBadi

Zion Solutions Group Joins Forces with Locus Robotics to Supercharge Warehouse Productivity

A method to enable safe mobile robot navigation in dynamic environments

Robot Talk Episode 90 – Robotically Augmented People

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

RBR50 Spotlight: Slip Robotics minimizes trailer loading times with simple approach

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password

Meta’s latest AI model is free for all

You might also like

Beneath the hood

5 Surprising Benefits of Palletizing Ergonomics on Employee Mental Health

Bot inspired by baby turtles can swim under the sand

Recommended For You

Leave a Reply Cancel reply

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password