Flash 1.5, Gemma 2 and Project Astra

1.5 Flash excels at summarization, chat functions, picture and video captioning, information extraction from lengthy paperwork and tables, and extra. It’s because it’s been skilled by 1.5 Professional by means of a course of known as “distillation,” the place essentially the most important data and expertise from a bigger mannequin are transferred to a smaller, extra environment friendly mannequin.

Learn extra about 1.5 Flash on the Gemini expertise web page, and find out about 1.5 Flash’s availability and pricing. We’ll share extra particulars in an up to date Gemini 1.5 technical report quickly.

Considerably enhancing 1.5 Professional

Over the previous few months, we’ve considerably improved 1.5 Professional, our greatest mannequin for basic efficiency throughout a variety of duties.

Past extending its context window to 2 million tokens, we’ve enhanced its code technology, logical reasoning and planning, multi-turn dialog, and audio and picture understanding by means of information and algorithmic advances. We see sturdy enhancements on public and inner benchmarks for every of those duties.

1.5 Professional can now comply with more and more advanced and nuanced directions, together with ones that specify product-level habits involving position, format and magnificence. We’ve improved management over the mannequin’s responses for particular use instances, like crafting the persona and response type of a chat agent or automating workflows by means of a number of operate calls. And we’ve enabled customers to steer mannequin habits by setting system directions.

We added audio understanding within the Gemini API and Google AI Studio, so 1.5 Professional can now purpose throughout picture and audio for movies uploaded in Google AI Studio. And we’re now integrating 1.5 Professional into Google merchandise, together with Gemini Superior and in Workspace apps.

Learn extra about 1.5 Professional on the Gemini expertise web page. Extra particulars are coming quickly in our up to date Gemini 1.5 technical report.

Gemini Nano understands multimodal inputs

Gemini Nano is increasing past text-only inputs to incorporate pictures as effectively. Beginning with Pixel, functions utilizing Gemini Nano with Multimodality will be capable to perceive the world the best way folks do — not simply by means of textual content, but additionally by means of sight, sound and spoken language.

Learn extra about Gemini 1.0 Nano on Android.

Source link

Flash 1.5, Gemma 2 and Project Astra

Generating audio for video – Google DeepMind

A smarter way to streamline drug discovery | MIT News

Technique improves the reasoning capabilities of large language models | MIT News

Introducing Veo and Imagen 3 generative AI tools

More ways Google is delivering on its responsible AI commitment

Recommended For You

Generating audio for video – Google DeepMind

A smarter way to streamline drug discovery | MIT News

Technique improves the reasoning capabilities of large language models | MIT News

Building a Mature ML Development Process

Introducing Google’s new Academic Research Awards

More ways Google is delivering on its responsible AI commitment

How 4 artists used Imagen 2 to reimagine Alice’s Adventures in Wonderland

Gemini 1.5 Pro updates, 1.5 Flash debut and 2 new Gemma models

Leave a Reply Cancel reply

A technique for more effective multipurpose robots | MIT News

Helping robots grasp the unpredictable | MIT News

The Current State of AI! (My Personal News Recap)

Robotics investments reach $418M in November 2023

Exploring frontiers of mechanical engineering | MIT News

AI revolutionizing the real estate business #ai #ainews #artificialintelligence #news

Robo-revolution: From lab to market

Generating audio for video – Google DeepMind

Germany’s VDMA Robotics and Automation Halves Growth Forecast – Positive Signs From International Business

Edible batteries, sensors and actuators unlock robots designed to be eaten

A smarter way to streamline drug discovery | MIT News

What happened when 20 comedians got AI to write their routines

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password

Flash 1.5, Gemma 2 and Project Astra

You might also like

Considerably enhancing 1.5 Professional

Gemini Nano understands multimodal inputs

Introducing Veo and Imagen 3 generative AI tools

More ways Google is delivering on its responsible AI commitment

Recommended For You

Leave a Reply Cancel reply

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password