Google DeepMind at NeurIPS 2023

Analysis

Printed

8 December 2023

In direction of extra multimodal, sturdy, and normal AI programs

Subsequent week marks the beginning of the thirty seventh annual convention on Neural Data Processing Techniques (NeurIPS),the most important synthetic intelligence (AI) convention on this planet. NeurIPS 2023 will likely be going down December 10-16 in New Orleans, USA.

Groups from throughout Google DeepMind are presenting greater than 180 papers on the most important convention and workshops.

We’ll be showcasing demos of our innovative AI fashions for world climate forecasting, supplies discovery, and watermarking AI-generated content material. There may even be a possibility to listen to from the group behind Gemini, our largest and most succesful AI mannequin.

Right here’s a take a look at a few of our analysis highlights:

Multimodality: language, video, motion

UniSim is a common simulator of real-world interactions.

Generative AI fashions can create work, compose music, and write tales. However nevertheless succesful these fashions could also be in a single medium, most wrestle to switch these abilities to a different. We delve into how generative skills might assist to study throughout modalities. In a highlight presentation, we present that diffusion fashions can be utilized to categorise photographs with no further coaching required. Diffusion fashions like Imagen classify photographs in a extra human-like means than different fashions, counting on shapes relatively than textures. What’s extra, we present how simply predicting captions from photographs can enhance computer-vision studying. Our method surpassed present strategies on imaginative and prescient and language duties, and confirmed extra potential to scale.

Extra multimodal fashions might give technique to extra helpful digital and robotic assistants to assist individuals of their on a regular basis lives. In a highlight poster, we create brokers that might work together with the digital world like people do — by means of screenshots, and keyboard and mouse actions. Individually, we present that by leveraging video era, together with subtitles and closed captioning, fashions can switch data by predicting video plans for actual robotic actions.

One of many subsequent milestones might be to generate lifelike expertise in response to actions carried out by people, robots, and different kinds of interactive brokers. We’ll be showcasing a demo of UniSim, our common simulator of real-world interactions. One of these expertise might have functions throughout industries from video video games and movie, to coaching brokers for the true world.

Constructing protected and comprehensible AI

An artist’s illustration of synthetic intelligence (AI). This picture depicts AI security analysis. It was created by artist Khyati Trehan as a part of the Visualising AI venture launched by Google DeepMind.

Giant Language Fashions can generate spectacular solutions, however are liable to “hallucinations”, textual content that appears right however is made up. Our researchers elevate the query of whether or not a technique to discover a truth saved location (localization) can allow modifying the very fact. Surprisingly, they discovered that localization of a truth and modifying the situation doesn’t edit the very fact, hinting on the complexity of understanding and controlling saved info in LLMs. With Tracr, we suggest a novel means of evaluating interpretability strategies by translating human-readable packages into transformer fashions. We’ve open sourced a model of Tracr to assist function a ground-truth for evaluating interpretability strategies.

When creating and deploying giant fashions, privateness must be embedded at each step of the way in which. For coaching, our groups are finding out find out how to measure if language fashions are memorizing information – to be able to defend non-public and delicate materials. In parallel, our researchers display find out how to consider privacy-preserving coaching with a way that’s environment friendly sufficient for real-world use. In one other oral presentation, our scientists examine the restrictions of coaching by means of “scholar” and “instructor” fashions which have completely different ranges of entry and vulnerability if attacked.

Emergent skills

An artist’s illustration of synthetic intelligence (AI). This picture imagines Synthetic Normal Intelligence (AGI). It was created by Novoto Studio as a part of the Visualising AI venture launched by Google DeepMind.

As giant fashions grow to be extra succesful, our analysis is pushing the boundaries of recent skills to develop extra normal AI programs.

Whereas language fashions are used for normal duties, they lack the mandatory exploratory and contextual understanding to resolve extra complicated issues. We introduce the Tree of Ideas, a brand new framework for language mannequin inference to assist fashions discover and motive over a variety of potential options. By organizing the reasoning and planning as a tree as an alternative of the generally used flat chain-of-thoughts, we display {that a} language mannequin is ready to clear up complicated duties like “recreation 24” far more precisely.

To assist individuals clear up issues and discover what they’re searching for, AI fashions must course of billions of distinctive values effectively. With Characteristic Multiplexing, one single illustration house is used for a lot of completely different options, permitting giant embedding fashions (LEMs) to scale to merchandise for billions of customers.

Lastly, with DoReMi we present how utilizing AI to automate the combination of coaching information sorts can considerably pace up language mannequin coaching and enhance efficiency on new and unseen duties.

Fostering a worldwide AI neighborhood

We’re proud to sponsor NeurIPS, and help workshops led by LatinX in AI, QueerInAI, and Ladies In ML, serving to foster analysis collaborations and creating a various AI and machine studying neighborhood. This yr, NeurIPS can have a artistic monitor that includes our Visualising AI venture, which commissions artists to create extra numerous and accessible representations of AI.

In the event you’re attending NeurIPS, come by our sales space to study extra about our cutting-edge analysis and meet our groups internet hosting workshops and presenting throughout the convention.

Study extra

Source link

Google DeepMind at NeurIPS 2023

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Eric Evans receives Department of Defense Medal for Distinguished Public Service | MIT News

ISO and ASTM define standard for additive manufacturing in construction

UltraFastBERT: Exponentially Faster Language Modeling

Recommended For You

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Eric Evans receives Department of Defense Medal for Distinguished Public Service | MIT News

Imperva optimizes SQL generation from natural language using Amazon Bedrock

AI in Manufacturing: Overcoming Data and Talent Barriers

UltraFastBERT: Exponentially Faster Language Modeling

Revolutionizing Healthcare: Exploring the Impact and Future of Large Language Models in Medicine

Creative Robot Tool Use with Large Language Models – Machine Learning Blog | ML@CMU

Leave a Reply Cancel reply

A technique for more effective multipurpose robots | MIT News

Helping robots grasp the unpredictable | MIT News

The Current State of AI! (My Personal News Recap)

Robotics investments reach $418M in November 2023

2024 World Battery & Energy Storage Industry Expo (WBE)

MIT faculty, instructors, students experiment with generative AI in teaching and learning | MIT News

What is AI – Artificial Intelligence in Telugu | Future of AI | TeluguBadi

A method to enable safe mobile robot navigation in dynamic environments

Robot Talk Episode 90 – Robotically Augmented People

Eliminating Vector Quantization: Diffusion-Based Autoregressive AI Models for Image Generation

RBR50 Spotlight: Slip Robotics minimizes trailer loading times with simple approach

Voyage Multilingual 2 Embedding Evaluation | by Lars Wiik | Jun, 2024

Coval upgrades its CVGC Carbon Vacuum Gripper with an even more versatile second generation

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password

Google DeepMind at NeurIPS 2023

You might also like

Multimodality: language, video, motion

Constructing protected and comprehensible AI

Emergent skills

Fostering a worldwide AI neighborhood

Study extra

ISO and ASTM define standard for additive manufacturing in construction

UltraFastBERT: Exponentially Faster Language Modeling

Recommended For You

Leave a Reply Cancel reply

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password