Hierarchical and Dynamic Prompt Compression for Efficient Zero-shot API Usage

Lengthy prompts current a big problem for sensible LLM-based programs that must function with low latency and restricted sources. We examine immediate compression for zero-shot dialogue programs that study to make use of unseen APIs instantly in-context from their documentation, which can take up a whole bunch of immediate tokens per API. We begin from a lately launched strategy (Mu et al., 2023) that learns to compress the immediate into a couple of “gist token” activations throughout finetuning. Nevertheless, this easy thought is ineffective in compressing API documentation, leading to low accuracy in comparison with the baseline utilizing an uncompressed immediate. On this work, we introduce two main enhancements. First, we specialize gist tokens for various hierarchies inside an API: we use one Gist_arg token for compressing an argument and one Gist_value token for compressing an appropriate worth of a categorical argument. We then dynamically reveal Gist_value tokens solely when they’re wanted. Second, we add a reconstruction loss to foretell the API documentation from the gist tokens. On a number of API-calling duties, our proposed system retains the simplicity, effectivity, and enormous compression issue (20x on SGD) of the gist token strategy whereas attaining considerably higher accuracy.

Source link

Hierarchical and Dynamic Prompt Compression for Efficient Zero-shot API Usage

A data-driven approach to making better choices | MIT News

NotebookLM goes global with Slides support and better ways to fact-check

Mouth-based touchpad enables people living with paralysis to interact with computers | MIT News

How to Use Python Built-In Decoration to Improve Performance Significantly | by Christopher Tao | Apr, 2024

Here’s How Robots Are Helping in Wildlife Conservation | RobotShop Community

Recommended For You

A data-driven approach to making better choices | MIT News

NotebookLM goes global with Slides support and better ways to fact-check

Mouth-based touchpad enables people living with paralysis to interact with computers | MIT News

I tried 8 of Google’s newest AI products and updates at I/O 2024

Apply to our new program for startups using AI to improve U.S. infrastructure

Here's How Robots Are Helping in Wildlife Conservation | RobotShop Community

Entering microROS, microcontrollers can also easily implement ROS2 robot applications! | RobotShop Community

Best Career Options and courses after 12th Commerce in 2024

Leave a Reply Cancel reply

Unveiling Japan’s Latest AI Female Robots: Capable of Anything!

How to Optimize Hyperparameter Search Using Bayesian Optimization and Optuna

Universal Robots debuts UR20’s welding abilities

Japan Releases Fully Functioning Female Robots

Universal Robots increases UR20 cobot production to meet demand

Intellinum Unveils Flexi AI | RoboticsTomorrow

The capabilities of multimodal AI | Gemini Demo

Unleashing potential: The role of software development in advancing robotics

Validating the Causal Impact of the Synthetic Control Method | by Ryan O’Sullivan | Jun, 2024

Maria Middelares Hospital autotransplants kidney with da Vinci SP via single incision

List of Activities and Their Corresponding Suitable LLMs in the Artificial Intelligence AI World Right Now: A Comprehensive Guide

Propagandists are using AI too

Best AI Porn Generators – ai2people.com

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password

Hierarchical and Dynamic Prompt Compression for Efficient Zero-shot API Usage

You might also like

How to Use Python Built-In Decoration to Improve Performance Significantly | by Christopher Tao | Apr, 2024

Here’s How Robots Are Helping in Wildlife Conservation | RobotShop Community

Recommended For You

Leave a Reply Cancel reply

CATEGORIES

SITE MAP

Welcome Back!

Retrieve your password