First Hallucination-Free LLM; Fine-Tune or Retrieval; Privacy Issues in LLMs; New Embedding Model by Google; What Resilience Means and More.

Danny Butvinik

Chief Data Scientist | 100K+ Followers | FinCrime | Writer | Author of AI Vanguard Newsletter

Published Apr 4, 2024

Editor's Paper Recommendations

Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs: The ability of large language models (LLMs) to respond to various questions across various domains demonstrates that they contain significant factual information within their pre-trained weights. However, this knowledge is inherently limited, relying heavily on the characteristics of the training data. Consequently, using external datasets to incorporate new information or refine the capabilities of LLMs on previously seen information poses a significant challenge. This study compares two common approaches: fine-tuning and retrieval-augmented generation (RAG). We evaluate both approaches on a variety of knowledge-intensive tasks across different topics. Our findings reveal that while fine-tuning offers some improvement, RAG consistently outperforms it for existing knowledge encountered during training and entirely new knowledge. Moreover, LLMs struggle to learn new factual information through fine-tuning. Exposing them to numerous variations of the same fact during training could alleviate this problem.

Mutual Enhancement of Large and Small Language Models with Cross-Silo Knowledge Transfer: Large language models (LLMs) are empowered with broad knowledge, but task-specific performance is often suboptimal. It necessitates fine-tuning LLMs with task-specific data, which may be inaccessible due to privacy concerns. This paper proposes a novel approach to enhance LLMs with smaller language models (SLMs) trained on clients using their private task-specific data. We suggest CrossLM as a way for LLMs and SLMs to improve each other. In this model, the SLMs encourage the LLM to produce task-specific, high-quality data, improving both the LLM and the SLM. We evaluate CrossLM using publicly accessible language models across benchmark tasks. These results show that CrossLM improves the task-specific performance of both the SLMs on clients and the LLM on the cloud server while keeping the LLM's ability to generalize.

Privacy Issues in Large Language Models: A Survey: This is the first survey of the active area of AI research focusing on privacy issues in Large Language Models (LLMs). Specifically, we focus on work that red-teams models to highlight privacy risks, attempts to build privacy into the training or inference process, enables efficient data deletion from trained models to comply with existing privacy regulations, and mitigates copyright issues. We summarize technical research that develops algorithms, proves theorems, and runs empirical evaluations. While an extensive body of legal and policy work addresses these challenges from a different angle, that is not the focus of our survey. Nevertheless, these works and recent legal developments inform how these technical problems are formalized, so we discuss them briefly in Section 1. While we have made our best effort to include all the relevant work, we may have missed some recent work due to the fast-moving nature of this research. If we still need to include some of your work, please get in touch with us, and we will attempt to keep this survey relatively up-to-date. We maintain a repository with the list of papers covered in this survey and any relevant code publicly available at this https URL.

Are you looking to advertise a product, job opening, or event to an audience of over 40,000 AI researchers and engineers? Feel free to contact us on LinkedIn to explore your options.

Enjoy the newsletter? Help us make it bigger and better by sharing it with colleagues and friends.

Industry Insights

Growth Zone

What Resilience Means and Why It Matters

Expert Advice

The AI Vanguard

43,634 followers

+ Subscribe

myArt.ai > Ai-Generated Social Media Platform

8mo

Can't wait to read the AI Vanguard Newsletter! Danny Butvinik

1 Reaction

DataInsta

8mo

Exciting AI newsletter! Anything groundbreaking on AI ethics or explainable AI in this edition? Danny Butvinik

1 Reaction

CHESTER SWANSON SR.

Realtor Associate @ Next Trend Realty LLC | HAR REALTOR, IRS Tax Preparer

8mo

Thanks for Sharing.

2 Reactions

See more comments

To view or add a comment, sign in

See all

First Hallucination-Free LLM; Fine-Tune or Retrieval; Privacy Issues in LLMs; New Embedding Model by Google; What Resilience Means and More.

Danny Butvinik

Chief Data Scientist | 100K+ Followers | FinCrime | Writer | Author of AI Vanguard Newsletter

Editor's Paper Recommendations

Recommended by LinkedIn

Industry Insights

Growth Zone

Expert Advice

The AI Vanguard

43,634 followers

More articles by this author

Insights from the community

Others also viewed

Top LLM Papers of the Week (September Week 3, 2024)

March 2024 AI: Musk vs. Altman, Google's Offensive AI, and the Ethical Dilemmas Shaping the Industry

Top LLM Papers of the Week (August Week 4, 2024)

A (Very) Long Discussion of Legal Document Summarization using LLMs

Robo-lawyers: The Future of Legal Practice or Ethical Nightmare?

Opinion | Public Policy Can Force AI Platforms to Nix Deepfakes Before their Creation

In This Naveed's Weekly Dose Of AI

Are Today's Lawyers Going to Be the Prompt Engineers of the Future?

This Week's Review: Callidus

No More Bad News for OpenAI

Explore topics

Editor's Paper Recommendations

Recommended by LinkedIn

Industry Insights

Growth Zone

Expert Advice

The AI Vanguard

43,634 followers

Assessing GPT-4 on Reasoning; Mathematical Perspective On Transformers; Family Of Multimodal Models; Why Small LMs Are The Next Thing; and More.

Apr 18, 2024

LLM Fine-Tuning on Graphs; How To Evaluate LLMs; Uncovering Knowledge Gaps Using RAG; Claud 3 on Bedrock; Overcoming Limits Of RAG; and More.

Mar 12, 2024

Generation Model – What Do They Know? Cracking Length Generalization: AI's Reasoning Evolution; Can We Drastically Reduce Training Costs?; and More.

Mar 3, 2024

Multimodal LLMs; Orca 2; Cosmopedia – Largest Open Synthetic Data by Huggin Face; How To Fine-Tune On Single GPU; and More.

Feb 27, 2024

ChatGPT vs Gemini; Uncertainty Quantification in GenAI; GPT-4 vs. GPT-4V vs. Humans On Abstraction and Reasoning; Private vs Public LLMs; and More.

Feb 20, 2024

Survey on Hallucination in LLM; LLM’s Understanding Math; GPT4All Open-Source LMs; Next Chapter of Gemini; Improved GPT-4 Performance; and More.

Feb 13, 2024

Bard vs. ChatGPT; Jina Embedding 2; Text2Structure; Does GPT-4 Pass Turing Text?; Transformer As Graph2Graph; and More.

Feb 6, 2024

Hallucination in LLMs – Perspectives and Remediations; Fine-Tuning With Feedback; What LLMs DO NOT KNOW; LLaMA 2 Explained; and More.

Jan 30, 2024

What Algorithms Can Transformers Learn; Reasoning Agent for Graphs; Supervised Fine-Tuning; Context Understanding in LLMs; and More.

Jan 23, 2024

Why LLMs Hallucinate; GraphGPT; Inside Microsoft’s small LLM; Deploy Tiny Llama on AWS EC2; Fine-Tune LLM using PyTorch; and More

Jan 16, 2024

Insights from the community

Others also viewed

Top LLM Papers of the Week (September Week 3, 2024)

March 2024 AI: Musk vs. Altman, Google's Offensive AI, and the Ethical Dilemmas Shaping the Industry

Top LLM Papers of the Week (August Week 4, 2024)

A (Very) Long Discussion of Legal Document Summarization using LLMs

Robo-lawyers: The Future of Legal Practice or Ethical Nightmare?

Opinion | Public Policy Can Force AI Platforms to Nix Deepfakes Before their Creation

In This Naveed's Weekly Dose Of AI

Are Today's Lawyers Going to Be the Prompt Engineers of the Future?

This Week's Review: Callidus

No More Bad News for OpenAI

Explore topics