New technique improves AI ability to map 3D space with 2D cameras

10 Use Cases of Claude 3.5 Sonnet: Unveiling the Future of Artificial Intelligence AI with Revolutionary Capabilities

SoulGen Pricing, Pros Cons, Features, Alternatives

A Crash Course of Planning for Perception Engineers in Autonomous Driving | by Patrick Langechuan Liu | Jun, 2024

Researchers have developed a way that enables synthetic intelligence (AI) applications to raised map three-dimensional areas utilizing two-dimensional photos captured by a number of cameras. As a result of the method works successfully with restricted computational sources, it holds promise for enhancing the navigation of autonomous autos.

“Most autonomous autos use highly effective AI applications known as imaginative and prescient transformers to take 2D photos from a number of cameras and create a illustration of the 3D house across the car,” says Tianfu Wu, corresponding writer of a paper on the work and an affiliate professor {of electrical} and laptop engineering at North Carolina State College. “Nonetheless, whereas every of those AI applications takes a unique method, there’s nonetheless substantial room for enchancment.

“Our method, known as Multi-View Attentive Contextualization (MvACon), is a plug-and-play complement that can be utilized together with these present imaginative and prescient transformer AIs to enhance their capacity to map 3D areas,” Wu says. “The imaginative and prescient transformers do not get any extra knowledge from their cameras, they’re simply in a position to make higher use of the info.”

MvACon successfully works by modifying an method known as Patch-to-Cluster consideration (PaCa), which Wu and his collaborators launched final yr. PaCa permits transformer AIs to extra effectively and successfully establish objects in a picture.

“The important thing advance right here is making use of what we demonstrated with PaCa to the problem of mapping 3D house utilizing a number of cameras,” Wu says.

To check the efficiency of MvACon, the researchers used it together with three main imaginative and prescient transformers — BEVFormer, the BEVFormer DFA3D variant, and PETR. In every case, the imaginative and prescient transformers have been amassing 2D photos from six completely different cameras. In all three cases, MvACon considerably improved the efficiency of every imaginative and prescient transformer.

“Efficiency was notably improved when it got here to finding objects, in addition to the velocity and orientation of these objects,” says Wu. “And the rise in computational demand of including MvACon to the imaginative and prescient transformers was virtually negligible.

“Our subsequent steps embrace testing MvACon towards extra benchmark datasets, in addition to testing it towards precise video enter from autonomous autos. If MvACon continues to outperform the present imaginative and prescient transformers, we’re optimistic that will probably be adopted for widespread use.”

The paper, “Multi-View Attentive Contextualization for Multi-View 3D Object Detection,” will likely be offered June 20 on the IEEE/CVF Convention on Laptop Imaginative and prescient and Sample Recognition, being held in Seattle, Wash. First writer of the paper is Xianpeng Liu, a current Ph.D. graduate of NC State. The paper was co-authored by Ce Zheng and Chen Chen of the College of Central Florida; Ming Qian and Nan Xue of the Ant Group; and Zhebin Zhang and Chen Li of the OPPO U.S. Analysis Heart.

The work was executed with assist from the Nationwide Science Basis, underneath grants 1909644, 2024688 and 2013451; the U.S. Military Analysis Workplace, underneath grants W911NF1810295 and W911NF2210010; and a analysis present fund from Innopeak Expertise, Inc.

Source link

New technique improves AI ability to map 3D space with 2D cameras

10 Use Cases of Claude 3.5 Sonnet: Unveiling the Future of Artificial Intelligence AI with Revolutionary Capabilities

SoulGen Pricing, Pros Cons, Features, Alternatives

A Crash Course of Planning for Perception Engineers in Autonomous Driving | by Patrick Langechuan Liu | Jun, 2024

Optimizing AI Workflows: Leveraging Multi-Agent Systems for Efficient Task Execution

Detroit launching robotaxi pilot for people with mobility issues

Recommended For You

10 Use Cases of Claude 3.5 Sonnet: Unveiling the Future of Artificial Intelligence AI with Revolutionary Capabilities

SoulGen Pricing, Pros Cons, Features, Alternatives

A Crash Course of Planning for Perception Engineers in Autonomous Driving | by Patrick Langechuan Liu | Jun, 2024

Dolphin{anty} Antidetect Browser: The Ultimate Antidetect Browser for Online Anonymity and Multi-Account Management

Claude 3.5 Sonnet: Redefining the Frontiers of AI Problem-Solving

Detroit launching robotaxi pilot for people with mobility issues

igus acquires Atronia, invests in smart plastics sensors for Industry 4.0

Transformer-based Model for ASR N-Best Rescoring and Rewriting

Leave a Reply Cancel reply

Amazon Reports Record Q1 2024 Earnings and Launches Amazon Q Assistant

Meet LangGraph: An AI Library for Building Stateful, Multi-Actor Applications with LLMs Built on Top of LangChain

Robots-Blog | AMBER Lucid ONE, first choice for bioinspired Robot’s arm, launches on Kickstarter

Japan Releases Fully Functioning Female Robots

October 2023 Robotics Investments Equals $980 Million

AI accelerates problem-solving in complex scenarios | MIT News

Training AI to Play Pokemon with Reinforcement Learning

First Look at Rabbit R1 AI Device

What is Robotics and Automation?

Living brain cells in a dish can now learn to drive robots

Maja Matarić’s work with socially assistive robotics earns her the Athena Lecturer Award

10 Use Cases of Claude 3.5 Sonnet: Unveiling the Future of Artificial Intelligence AI with Revolutionary Capabilities

SoulGen Pricing, Pros Cons, Features, Alternatives

A Crash Course of Planning for Perception Engineers in Autonomous Driving | by Patrick Langechuan Liu | Jun, 2024

Biohybrid Robotics: Living Skin Successfully Bonded to Humanoid Robots

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password

New technique improves AI ability to map 3D space with 2D cameras

You might also like

Optimizing AI Workflows: Leveraging Multi-Agent Systems for Efficient Task Execution

Detroit launching robotaxi pilot for people with mobility issues

Recommended For You

Leave a Reply Cancel reply

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password