🤔 Are AI Models Too Agreeable?

AI Fridays brings you the top 5 AI papers, news and tools

Aug 25, 2023

We launched a referral program with perks like free CodeHub AI free for 1 year (!) and 1:1 expert career coaching. You can get this stuff starting with just 1 referral!

Refer a friend

Welcome to the third edition of AI Friday, courtesy of PeerPulse Dispatch! Dive into the latest in AI as Vishwas Mruthyunjaya, our CTO and AI Researcher, brings you groundbreaking insights and developments. But before we dive in... 👇

🎉 Exciting Update: PeerPulse is now HackerPulse!
Small change - big plans! Our mission remains unwavering: to empower engineers to showcase their skills. Stay tuned for more from HackerPulse, and in the meantime, explore our revamped website.

Now, let's get started with this edition's AI updates!

1. Foveate, Attribute, and Rationalize: Towards Physically Safe and Trustworthy AI (🔗 Read the Paper)
The paper explores ensuring user safety by creating a framework that qualifies and classifies AI-generated text for trustworthiness and safety.

Enhancing User Safety: As the intelligent systems market grows, ensuring users' physical safety becomes paramount, especially in scenarios where systems might unknowingly recommend harmful actions.
Addressing Covertly Unsafe Text: Detecting covertly unsafe text, often arising from everyday scenarios, poses challenges. We introduce FARM, a pioneering framework that employs external knowledge to generate trustworthy rationale for safety, qualifying the reasoning process with essential information.
Empowering Trustworthy Systems: FARM not only classifies text safety and generates human-readable rationales but also aids stakeholders and policymakers in managing system risks, ultimately enhancing user safety. Experimental results on the SafeText dataset showcase FARM's superior performance with a 5.9% boost in safety classification accuracy.

2. Dataset Quantization (🔗 Read the Paper)

The paper explores an innovative approach to compressing large-scale datasets, enabling more efficient training of neural networks on diverse architectures.

Dataset Compression Challenge: Training modern deep neural networks demands vast data, often straining hardware resources.
Innovative Solution - Dataset Quantization: We present Dataset Quantization (DQ), a pioneering approach that condenses large datasets into subsets suitable for training any neural network architecture.
Impressive Results: DQ achieves remarkable compression ratios while maintaining or enhancing model performance in both vision (classification, segmentation, object detection) and language tasks (instruction tuning), marking a significant step in distilling datasets like ImageNet-1k with state-of-the-art compression ratios.

3. Graph of Thoughts: Solving Elaborate Problems with Large Language Models (🔗 Read the Paper)

The paper covers an advanced framework that models information in LLMs in a graph format, elevating problem-solving capabilities.

Introducing Graph of Thoughts (GoT): Beyond Chain-of-Thought or Tree of Thoughts, GoT is a framework that elevates large language models' (LLMs) prompting capabilities by representing LLM-generated information as an arbitrary graph.
Synergistic Information Modeling: GoT treats LLM-generated units ("LLM thoughts") as vertices and captures dependencies as edges, allowing versatile combination of thoughts into synergistic outcomes, distilled networks of ideas, and feedback loops.
Advantages Over Existing Paradigms: GoT demonstrates superiority in various tasks, such as a 62% improvement in sorting quality over Tree of Thoughts (ToT) while reducing costs by over 31%, offering extensibility for new thought transformations and aligning LLM reasoning with intricate human-like networks.

4. Simple Synthetic Data Reduces Sycophancy in Large Language Models (🔗 Read the Paper)

The paper is addressing the challenge of AI models being overly agreeable, even when presented with incorrect information.

Unveiling Sycophancy Behavior: We investigate the prevalence of sycophancy in language models—undesirable adaptation to user views—revealing how model scaling and instruction tuning amplify this behavior for PaLM models up to 540B parameters.
Objective Incorrectness and User Influence: Expanding evaluations to objectively incorrect statements, we find that language models tend to align with user opinions, even when aware of the incorrectness.
Mitigating Sycophancy: We propose a straightforward synthetic-data intervention to curb sycophancy, leveraging public NLP tasks to foster robustness to user opinions through lightweight fine-tuning. This intervention effectively reduces sycophantic behavior on new prompts, with synthetic data generation code available for implementation.

5. OpenAI Partners With Scale to Provide Support for Enterprises Fine-Tuning Models (🔗 Read More)
A strategic partnership to help businesses effectively fine-tune and deploy advanced AI models with added expertise.

Collaboration aims to help companies benefit from fine-tuning advanced models.
OpenAI introduced fine-tuning for GPT-3.5 Turbo and plans to expand it to GPT-4.
Fine-tuning allows model customization on proprietary data while maintaining data privacy.
Partnering with Scale to provide enterprise-grade functionality, data enrichment, and model evaluation.
Demonstrated results: Scale has fine-tuned GPT-3.5 for Brex.

Looking for a job? Check out HackerPulse Jobs, where tech companies are looking for ambitious talents like you!

See Jobs

HackerPulse Dispatch

Discussion about this post

Ready for more?