Silicon Valley Rises Again - Combat Layoffs by Embracing LLMs (Part 1)
Slide Credit - Weights & Biases Carey Phelps
Disclaimer: The views expressed here are my own and not that of my employer
Thank you to @Bhargav Shubra, an intern who examined over 500+ AI startups, their mission and finances. Thanks Kumar G. for the introduction.
I am writing this article to share my views on the changing AI landscape (part 1) and how we can participate in this LLM wave (part 2)
In recent months, a transformative movement has emerged to combat the devastating impact of layoffs (~210,000), marked by an inspirational sentiment: "We will rise again." At conferences and meetups, including the FullyConnected conference hosted by Weights & Biases on June 7th ( Weights & Biases 's Phillip Reinhart 🪄🐝 , Lukas Biewald , Carey Phelps , Shawn Lewis , Brent Chalker , Casey Penrose , Jillian Clark ), attended by 2,500 people, Silicon Valley is witnessing a resurgence.
This resurgence can be attributed to the exponential growth of data generation (View previous blog), acting as a catalyst for the development of Large Language Models (LLMs). These powerful models, such as GPT, Claude, PaLM, LLaMA, have revolutionized the prediction of word sequences, text analysis, summarization, and even art generation from text with innovations like OpenAI 's DALL-E Open Ai , Stability AI 's Stable diffusion, and Midjourney . As a result, influential technology influencers, venture capitalists, and corporations like Microsoft and NVIDIA have eagerly invested in this domain.
Here is a leaderboard (lmsys.org) of top-ranked LLM models as of May 2023 shared by Andrej Karpathy
Why should you care about the buzz surrounding Generative AI LLMs? Here are three reasons why this train is about to hit you:
Credit: Andrej Karpathy 's slide
Let's explore the Hot investment area theory:
As we have more context now, let's get deeper into the Ethics of AI with data as the centerpiece of AI excellence.
With significant investments driving AI research, data takes center stage, making discussions on data sourcing, privacy, protection, bias, and adversarial hacking. The US government's Office of Science and Technology Policy has already laid the groundwork for The White House 's Blueprint for the AI bill of rights, focusing on Safe and effective systems, Algorithmic bias, Data privacy, Observability with Transparency, and Human controls.
Recommended by LinkedIn
Over the past two years, approximately $20 billion has been invested in AI, with around $15 billion directed toward Generative AI with OpenAI securing the lion's share ($10 billion from Microsoft ). These investment amounts span various aspects of the AI flywheel, including data sourcing, annotation, synthesis, LLM/Generative AI model creation, aggregation, and ML infrastructure optimization.
As humans are expected to generate a staggering 460 Exabytes of data per day by 2025, it becomes crucial to establish guardrails for Generative AI model creators, who utilize diverse forms of open-source data for training. Data ownership, data utilization by models, and fair compensation for data providers and crowd-sourced creators emerge as significant concerns. Examples like Reddit's API charging dispute and Twitter's pricing model for tweets exemplify these issues.
Next, the psychological support for moderators' (and image classifying labelers) viewing disturbing data is at center stage with lawsuits against Meta 's and its labeling contract with @Sama
To address data discipline and AI ethics, I simplify it into three key areas:
a) Data sourcing with privacy, compliance, and auditing controls.
b) Model bias and transparency, incorporating observability.
c) Deployment security, including adversarial testing.
In navigating the AI landscape, these considerations play a vital role in ensuring responsible and ethical AI practices.
To address the growing demand for transparency in AI, startups have secured close to US$ 0.5 Billion in funding for Observability, Risk & Compliance, and error detection.
Notably, Nvidia has contributed to this cause by open-sourcing its NeMo guardrails project, featuring Topical, Safety, and Security Guardrails. Thanks to Jonathan Cohen , VP of Applied research at NVIDIA for his insightful presentation at the Fullyconnected Weights & Biases conference.
LLMs have solidified their position with substantial investments and widespread interest. A thrilling opportunity awaits for technology companies to join forces and define ethical guardrails, safeguarding data and model privacy while fortifying deployment against emerging threats. Collaboration with government agencies further enhances this initiative.
To unlock the immense value of LLMs, we must revamp our technology stack and embrace the latest models for diverse use cases. In the upcoming Part 2, we will explore this topic and its possibilities.
Disclaimer: The views expressed here are my own and not that of my employer.
Managing Director
1yAarthi, thanks for sharing!
MLOps & GTM Nerd interested in ✌🏻 things: 1) Working with good-hearted, unstoppable human-beings and 2) Improving our world with AI/ML, data and analytics.
1yGlad we could be part of the story! Cc Weights & Biases
Product Management Executive | eCommerce Retail and Marketplaces | Social Commerce, AI, Personalization | Dad
1yGreat read!