Introducing improvements to the fine-tuning API and expanding our custom models program

Assisted Nice-Tuning

At DevDay final November, we introduced a Customized Mannequin program designed to coach and optimize fashions for a particular area, in partnership with a devoted group of OpenAI researchers. Since then, we have met with dozens of shoppers to evaluate their {custom} mannequin wants and developed our program to additional maximize efficiency.

In the present day, we’re formally asserting our assisted fine-tuning providing as a part of the Customized Mannequin program. Assisted fine-tuning is a collaborative effort with our technical groups to leverage methods past the fine-tuning API, corresponding to further hyperparameters and varied parameter environment friendly fine-tuning (PEFT) strategies at a bigger scale. It’s notably useful for organizations that want assist organising environment friendly coaching knowledge pipelines, analysis methods, and bespoke parameters and strategies to maximise mannequin efficiency for his or her use case or process.

For instance, SK Telecom, a telecommunications operator serving over 30 million subscribers in South Korea, needed to customise a mannequin to be an knowledgeable within the telecommunications area with an preliminary give attention to customer support. They labored with OpenAI to fine-tune GPT-4 to enhance its efficiency in telecom-related conversations within the Korean language. Over the course of a number of weeks, SKT and OpenAI drove significant efficiency enchancment in telecom customer support duties—a 35% enhance in dialog summarization high quality, a 33% enhance in intent recognition accuracy, and a rise in satisfaction scores from 3.6 to 4.5 (out of 5) when evaluating the fine-tuned mannequin to GPT-4.

Customized-Educated Mannequin

In some instances, organizations want to coach a purpose-built mannequin from scratch that understands their enterprise, trade, or area. Totally custom-trained fashions imbue new data from a particular area by modifying key steps of the mannequin coaching course of utilizing novel mid-training and post-training methods. Organizations that see success with a completely custom-trained mannequin usually have massive portions of proprietary knowledge—hundreds of thousands of examples or billions of tokens—that they need to use to show the mannequin new data or complicated, distinctive behaviors for extremely particular use instances.

For instance, Harvey, an AI-native authorized software for attorneys, partnered with OpenAI to create a custom-trained massive language mannequin for case legislation. Whereas basis fashions had been sturdy at reasoning, they lacked the in depth data of authorized case historical past and different data required for authorized work. After testing out immediate engineering, RAG, and fine-tuning, Harvey labored with our staff so as to add the depth of context wanted to the mannequin—the equal of 10 billion tokens value of information. Our staff modified each step of the mannequin coaching course of, from domain-specific mid-training to customizing post-training processes and incorporating knowledgeable legal professional suggestions. The ensuing mannequin achieved an 83% enhance in factual responses and attorneys most popular the personalized mannequin’s outputs 97% of the time over GPT-4.

Source link

Introducing improvements to the fine-tuning API and expanding our custom models program

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Eric Evans receives Department of Defense Medal for Distinguished Public Service | MIT News

Karl Storz enters talks to acquire surgical robot developer Asensus Surgical

Hitchhiker’s Guide to MLOps for Time Series Forecasting with Sklearn | by Marco Cerliani | Apr, 2024

Recommended For You

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Eric Evans receives Department of Defense Medal for Distinguished Public Service | MIT News

Imperva optimizes SQL generation from natural language using Amazon Bedrock

AI in Manufacturing: Overcoming Data and Talent Barriers

Hitchhiker’s Guide to MLOps for Time Series Forecasting with Sklearn | by Marco Cerliani | Apr, 2024

Apple explores making personal robots: Report

Agility Robotics partners with leading WMS provider, cuts staff

Leave a Reply Cancel reply

A technique for more effective multipurpose robots | MIT News

Helping robots grasp the unpredictable | MIT News

The Current State of AI! (My Personal News Recap)

Robotics investments reach $418M in November 2023

2024 World Battery & Energy Storage Industry Expo (WBE)

MIT faculty, instructors, students experiment with generative AI in teaching and learning | MIT News

What is AI – Artificial Intelligence in Telugu | Future of AI | TeluguBadi

Zion Solutions Group Joins Forces with Locus Robotics to Supercharge Warehouse Productivity

A method to enable safe mobile robot navigation in dynamic environments

Robot Talk Episode 90 – Robotically Augmented People

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

RBR50 Spotlight: Slip Robotics minimizes trailer loading times with simple approach

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password

Introducing improvements to the fine-tuning API and expanding our custom models program

You might also like

Karl Storz enters talks to acquire surgical robot developer Asensus Surgical

Hitchhiker’s Guide to MLOps for Time Series Forecasting with Sklearn | by Marco Cerliani | Apr, 2024

Recommended For You

Leave a Reply Cancel reply

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password