Social Skill Training With LLMs

😱 Self-reflection and action planning? What’s next, AI therapy sessions?

Apr 12, 2024

Welcome to another edition of AI Fridays, an essential weekly breakdown of the most compelling AI news and breakthroughs! This edition is brought by HackerPulse in collaboration with AIModels.fyi.

🔄 Mixture-of-Depths in Transformer-Based Language Models
⚔️ Unveiling the Double-Edged Sword of AI
🧠 “Think-and-Execute” for improving the algorithmic reasoning capabilities of LLMs
🔍 Efficient Streaming Language Models with Attention Sinks
🤝 Social Skill Training with LLMs

Mixture-of-Depths Technique in Transformer-based Language Models (🔗 Read Paper)

The “Mixture-of-Depths” technique significantly increases the efficiency of transformer-based language models. By dynamically adjusting the depth of transformer layers based on input complexity, this approach optimizes computation resources, enhancing speed and accuracy. Analogous to assigning different readers to books of varying complexity, this method ensures that each input receives tailored processing, maximizing efficiency without compromising performance.

AI & the Problem of Knowledge Collapse (🔗 Read Paper)

Despite AI's ability to process vast amounts of data and generate insights, there's a risk that reliance on AI-generated content could harm public understanding, hindering innovation, creativity, and cultural richness. By presenting a model that examines the impact of discounted AI-assisted processes on public beliefs, the researchers uncover a troubling finding: even a modest discount could lead to beliefs significantly diverging from truth.

“Think-and-Execute” Improves Algorithmic Reasoning in LLMs (🔗 Read Paper)

The new “Think-and-Execute” framework aims to enhance the algorithmic reasoning capabilities of large language models (LLMs). Algorithmic reasoning, involving the breakdown of complex patterns into logical steps, poses a challenge for LLMs despite their proficiency in other tasks. Previous methods attempted to use programming languages like Python, but struggled to generate executable code efficiently. The Think-and-Execute framework decomposes the reasoning process into two steps: discovering task-level logic and tailoring it to specific instances through pseudocode.

Efficient Streaming Language Models With Attention Sinks (🔗 Read Paper)

StreamingLLM is a cutting-edge framework that addresses critical challenges in deploying Large Language Models (LLMs) in streaming applications, such as multi-round dialogue. Traditional LLMs struggle with memory consumption during decoding (Caching previous tokens' Key and Value states (KV) during decoding exhausts memory resources) and cannot generalize to longer texts than their training sequence length. StreamingLLM enables LLMs to process infinite sequence lengths without fine-tuning. By leveraging an attention sink phenomenon and a specialized placeholder token, StreamingLLM achieves stable and efficient language modeling, even on text sequences up to 4 million tokens long.