Silicon Valley Rises Again - Combat Layoffs by Embracing LLMs (Part 1)
Slide Credit - Weights & Biases Carey Phelps

Silicon Valley Rises Again - Combat Layoffs by Embracing LLMs (Part 1)

Slide Credit - Weights & Biases Carey Phelps

Part 2 is here

Disclaimer: The views expressed here are my own and not that of my employer

Thank you to @Bhargav Shubra, an intern who examined over 500+ AI startups, their mission and finances. Thanks Kumar G. for the introduction.

I am writing this article to share my views on the changing AI landscape (part 1) and how we can participate in this LLM wave (part 2)

In recent months, a transformative movement has emerged to combat the devastating impact of layoffs (~210,000), marked by an inspirational sentiment: "We will rise again." At conferences and meetups, including the FullyConnected conference hosted by Weights & Biases on June 7th ( Weights & Biases 's Phillip Reinhart 🪄🐝 , Lukas Biewald , Carey Phelps , Shawn Lewis , Brent Chalker , Casey Penrose , Jillian Clark ), attended by 2,500 people, Silicon Valley is witnessing a resurgence.

This resurgence can be attributed to the exponential growth of data generation (View previous blog), acting as a catalyst for the development of Large Language Models (LLMs). These powerful models, such as GPT, Claude, PaLM, LLaMA, have revolutionized the prediction of word sequences, text analysis, summarization, and even art generation from text with innovations like OpenAI 's DALL-E Open Ai , Stability AI 's Stable diffusion, and Midjourney . As a result, influential technology influencers, venture capitalists, and corporations like Microsoft and NVIDIA have eagerly invested in this domain.

Here is a leaderboard (lmsys.org) of top-ranked LLM models as of May 2023 shared by Andrej Karpathy

No alt text provided for this image
MSFT developer talks by @andrej Karpathy

Why should you care about the buzz surrounding Generative AI LLMs? Here are three reasons why this train is about to hit you:

  1. A Hot Investment Area: Brace yourself for a surge in hiring and economic fuel as this becomes a hot investment opportunity. The demand for talent in this field is skyrocketing, making it an exciting prospect for job seekers and entrepreneurs alike.
  2. Skillset Development and Retention: Developers are eager to jump on board and work with this cutting-edge technology. By honing their skills in Generative AI, they position themselves at the forefront of innovation. For companies, this also presents a retention challenge as skilled professionals seek opportunities in this rapidly evolving field.
  3. The Superiority of Gen AI Models: This latest wave of AI technology surpasses previous versions of machine learning. With LLMs, theoretical research has been practically implemented, resulting in backpropagating Gen AI Models that boast enhanced capabilities in predicting the next word with a larger input token size (32,000 vs. 512 tokens with BERT). The incorporation of techniques like Supervised Fine Tuning (SFT) with Rewards Modeling (RM) and Reinforcement Learning with Human Feedback (RLHF) has taken AI to new heights.

No alt text provided for this image
@Andrej Karpathy's slide at MSFT's BUILD Conference

Credit: Andrej Karpathy 's slide

Let's explore the Hot investment area theory:

  1. OpenAI and Microsoft: OpenAI, initially a non-profit organization co-founded by Tesla 's @ElonMusk and OpenAI 's @SamAltman, transitioned into a (capped) for-profit company after Microsoft 's $10 billion investment in 2019. This partnership brought increased marketing resources, expanded technology reach, and the ability to attract top talent. The user-friendly interface ChatGPT is also driving universal awareness and engagement and OpenAI recently announced that GPT-4 (the latest and greatest), is now generally available.
  2. Cloud Giants in the Race: Microsoft , Google , and Amazon are vying to become leading providers of Generative AI. Microsoft invested in OpenAI and integrated GPT into its Office suite and offers commercial API access through its cloud platform. Google invested in Cohere (ex-Googler Brain researchers Aidan Gomez and Nick Frosst ), and Swami Sivasubramanian announced Amazon's US$ 100 MM investment along with the Bedrock product to host LLM models from various startups like AI21 Labs , Anthropic and Stability AI
  3. VC Investments in Developer Infrastructure: Venture capitalists are pouring substantial funding (US$ ~20B already deployed in GenAI startups - Thanks @Bhargav for the detailed analysis) into the infrastructure supporting Generative AI and LLMs. VC investments in the developer infrastructure space for Generative AI and LLM models have surged, with a promise of an additional US$ 10 billion in funding. Notably, Inflection AI recently secured US$ 1.3 billion from major players including Microsoft , Reid Hoffman , Bill Gates , Eric Schmidt and NVIDIA . This significant investment showcases Silicon Valley's prowess, as proven serial entrepreneurial founders receive funding from tech tycoons. Inflection AI 's Pi chatbot, though not an entirely novel concept of a personal assistant, gained a longer runway to find the right Product Market Fit. This is made possible through ample funding and network effects from renowned researcher Mustafa Suleyman , founder of Google DeepMind (acquired by Google ), as well as investor portfolio synergies. This strategic funding round underscores the ongoing developments and competition in the world of Generative AI.

As we have more context now, let's get deeper into the Ethics of AI with data as the centerpiece of AI excellence.

With significant investments driving AI research, data takes center stage, making discussions on data sourcing, privacy, protection, bias, and adversarial hacking. The US government's Office of Science and Technology Policy has already laid the groundwork for The White House 's Blueprint for the AI bill of rights, focusing on Safe and effective systems, Algorithmic bias, Data privacy, Observability with Transparency, and Human controls.

No alt text provided for this image
The White House Blueprint for an AI Bill of Rights

Over the past two years, approximately $20 billion has been invested in AI, with around $15 billion directed toward Generative AI with OpenAI securing the lion's share ($10 billion from Microsoft ). These investment amounts span various aspects of the AI flywheel, including data sourcing, annotation, synthesis, LLM/Generative AI model creation, aggregation, and ML infrastructure optimization.

As humans are expected to generate a staggering 460 Exabytes of data per day by 2025, it becomes crucial to establish guardrails for Generative AI model creators, who utilize diverse forms of open-source data for training. Data ownership, data utilization by models, and fair compensation for data providers and crowd-sourced creators emerge as significant concerns. Examples like Reddit's API charging dispute and Twitter's pricing model for tweets exemplify these issues.

No alt text provided for this image
Open-sourced data used for training GPT-3

Next, the psychological support for moderators' (and image classifying labelers) viewing disturbing data is at center stage with lawsuits against Meta 's and its labeling contract with @Sama

To address data discipline and AI ethics, I simplify it into three key areas:

a) Data sourcing with privacy, compliance, and auditing controls.

b) Model bias and transparency, incorporating observability.

c) Deployment security, including adversarial testing.

In navigating the AI landscape, these considerations play a vital role in ensuring responsible and ethical AI practices.

To address the growing demand for transparency in AI, startups have secured close to US$ 0.5 Billion in funding for Observability, Risk & Compliance, and error detection.

No alt text provided for this image
Investments in Ethics based infrastructure - last 12 months (Thanks @Bhargav Subra for your analysis)

Notably, Nvidia has contributed to this cause by open-sourcing its NeMo guardrails project, featuring Topical, Safety, and Security Guardrails. Thanks to Jonathan Cohen , VP of Applied research at NVIDIA for his insightful presentation at the Fullyconnected Weights & Biases conference.

No alt text provided for this image
Source: @Jonathan Cohen Nvidia's Nemo Guardrails

LLMs have solidified their position with substantial investments and widespread interest. A thrilling opportunity awaits for technology companies to join forces and define ethical guardrails, safeguarding data and model privacy while fortifying deployment against emerging threats. Collaboration with government agencies further enhances this initiative.

To unlock the immense value of LLMs, we must revamp our technology stack and embrace the latest models for diverse use cases. In the upcoming Part 2, we will explore this topic and its possibilities.

Disclaimer: The views expressed here are my own and not that of my employer.

Aarthi, thanks for sharing!

Like
Reply
Phillip Reinhart 🪄🐝

MLOps & GTM Nerd interested in ✌🏻 things: 1) Working with good-hearted, unstoppable human-beings and 2) Improving our world with AI/ML, data and analytics.

1y

Glad we could be part of the story! Cc Weights & Biases

Andy Feierfeil

Product Management Executive | eCommerce Retail and Marketplaces | Social Commerce, AI, Personalization | Dad

1y

Great read!

To view or add a comment, sign in

Insights from the community

Others also viewed

Explore topics