The Large Language Model chapter is online!

The Large Language Model chapter is online!

Hey! The Large Language Model chapter of my book is now online, and it features not one, not two, not three, but six (6) Colab notebooks! Not only have you learned to code a Transformer language model from scratch in the previous Transformer chapter, but in this LLM chapter, you will also learn how to finetune it for various business scenarios. These include:

  1. Finetune an LLM for an emotion classification task as a generative model.
  2. Finetune an LLM for an emotion classification task as a multi-class classifier.
  3. Compare the classifiers' performance to a bag of words + logistic regression combination (a baseline).
  4. A full finetune of a pretrained LLM to answer questions.
  5. A LoRA finetune (for cost efficiency).

The code in these notebooks can be used for solving your real business problems by simply replacing the data file and the prompting format.

The chapter also includes the sampling strategies from a language model and prompt engineering techniques.

Enjoy! The printed book is coming very soon. Stay tuned.


Vasilis Kalyvas

Senior Data Scientist at Coca-Cola HBC | AI/ML online articles & tutorials

1w

Andriy Burkov actually only the 1st link with code works, is there anything wrong on my side?

Like
Reply
David Brabant

Principal Software Architect at EVS Broadcast Equipment

2w

"The printed book is coming very soon." Not soon enough ;-) Any idea when it will be available for ordering?

Ron J Beliard

Senior consultant at Page Marketing Inc.

2w

AMAZING... Can't wait to buy the book. It is worth it. Congrats Andriy. Great Treasure ! 👏

Vasyl Stetsyuk

Software Automation Engineer / SQAE / Cypress / Nightwatch.js / Playwright /

2w

"read first, buy later" is an interesting concept in the best open-source/freeware traditions.

  • No alternative text description for this image

To view or add a comment, sign in

More articles by Andriy Burkov

  • My new book on Language Models is here

    My new book on Language Models is here

    After 9 months of hard work, I'm excited to announce that my book, The Hundred-Page Language Models Book is now…

    8 Comments
  • Artificial Intelligence #258

    Artificial Intelligence #258

    Hey, in this issue: 📚my new book on language models is available for purchase; how to use Google Deep Research to save…

    9 Comments
  • It’s here: My new book on Language Models

    It’s here: My new book on Language Models

    After 9 months of hard work, I'm excited to announce that my book, The Hundred-Page Language Models Book is now…

    63 Comments
  • Artificial Intelligence #257

    Artificial Intelligence #257

    Hey, in this issue: 10 breakthrough technologies; what’s next for AI in 2025; AI means the end of internet search as…

    6 Comments
  • Artificial Intelligence #257

    Artificial Intelligence #257

    Hey, in this issue: 10 breakthrough technologies; what’s next for AI in 2025; AI means the end of internet search as…

    8 Comments
  • The Transformer chapter is online!

    The Transformer chapter is online!

    Hey! I’ve been thinking about a special Christmas gift for my subscribers. How about the sixth chapter of my upcoming…

    21 Comments
  • Artificial Intelligence #256

    Artificial Intelligence #256

    Hey, in this issue: the 8 worst technology failures of 2024; scaling test-time compute with open models; is the tech…

    11 Comments
  • Artificial Intelligence #256

    Artificial Intelligence #256

    Hey, in this issue: the 8 worst technology failures of 2024; scaling test-time compute with open models; is the tech…

    7 Comments
  • Artificial Intelligence #255

    Artificial Intelligence #255

    Hey, in this issue: a state-of-the-art quantum chip from Google; the AI revolution is running out of data; new database…

    16 Comments
  • Artificial Intelligence #255

    Artificial Intelligence #255

    Hey, in this issue: a state-of-the-art quantum chip from Google; the AI revolution is running out of data; new database…

    11 Comments

Explore topics