This week's latest generative AI updates

Welcome to this week’s SymphonyAI generative AI weekly newsletter summarizing all the important AI industry developments and technology advancements you need to know.

While the spotlight often shines on massive AI models, the true revolution is unfolding with Small Language Models (SLMs). These efficient models thrive by using smarter, leaner reasoning. With advancements in model compression, knowledge distillation, and retrieval-augmented generation (RAG), SLMs have evolved beyond their earlier limitations. These innovations allow developers to shrink large models without compromising intelligence. RAG in particular allows SLMs to tap into external knowledge bases on demand, letting them remain nimble without needing to store vast amounts of data internally. The result? AI that’s not just smaller—it’s specialized, efficient, and lightning-fast.

This week, we cover how leading companies are rapidly adopting SLMs and the key developments driving this transformation.

SymphonyAI news

Customer success—Retail: Save A Lot automates food traceability

Awards—Financial Services: SymphonyAI triumphs as risk management provider of the year at ALB Pan Asian Regulatory Awards 2024

Webinar recording—Enterprise IT: What you really need to know before you invest in a copilot for IT

Webinar recording—Media: AI in the media tech stack

Next-gen AI: Small models and multimodality

Precision Meets Efficiency in AI: NVIDIA introduced Llama 3.1-Nemotron-51B, a language model derived from Meta’s Llama-3.1-70B. It achieves 2.2x faster inference, enabling 4x larger workloads, reducing costs, and facilitating scalable AI solutions.

Phi-3: Redefining what’s possible with SLMs: Microsoft recently announced the availability of Phi-3.5 MoE (Mixture of Experts) in Azure AI Studio, offering dynamic model scaling so enterprises can use powerful AI while optimizing compute efficiency.

Salesforce’s xLAM-1B: AI Efficiency Leader: Salesforce released xLAM-1B, a 1B parameter SLM outperforming larger models in function-calling tasks. Designed to deploy autonomous AI agents, xLAM-1B facilitates complex task execution while maintaining a balance between power and operational limitations.

Zamba2 Launch: Open-Source SLM: Zyphra introduces Zamba2-2.7B, a high-performance SLM. Released as open-source, this model's reduced computational demands make it suitable for a wide range of industries.

AI Voice Interaction for All: Google's Gemini Live voice mode is now available for all Android users. It enhances AI accessibility with hands-free, multi-turn conversations and supports personalized interactions.

Voice Innovations: Opportunities and Shortcomings: OpenAI's new Advanced Voice Mode for ChatGPT Plus enhances voice interactions but falls short on promised features like real-time video analysis and emotion recognition.

Boosting Productivity with Voice Agents: Supernormal has introduced Voice Agents, an AI platform to streamline sales calls, customer support, surveys, and scheduling.

AI-Driven Engagement in Social Media: Meta introduced new multimodal AI features, including voice interaction, photo editing, and translation. The updates aim to increase user engagement and enhance ad performance.

Enhancing Video AI: OpenAI's Sora Upgrade: OpenAI is upgrading its video AI, Sora, to produce longer, high-quality clips faster, addressing issues like inconsistent styles, physics errors, and bias.

ChatGPT Voice Upgrade: Global Divide: OpenAI is enhancing ChatGPT Plus and Team with an "Advanced Voice" feature, introducing five new voices, custom instructions, and memory.

AI in financial services

Navigating AI in Stock Picks: Israeli startup Bridgewise will soon launch Stocktalk, a stock-picking chatbot approved for use by Israel Discount Bank customers. Stocktalk offers financial disclosure summaries, firm background info, and market-based stock recommendations. However, concerns about AI-driven market instability persist, highlighted by SEC head Gary Gensler.

Enhanced Analytics to Combat Financial Crime: Nasdaq Verafin's Targeted Typology Analytics advances AI-based detection for terrorist financing ($11 billion) and drug trafficking ($800 billion), using data from 2,500+ financial institutions.

Generative AI impact, adoption, and projections

Rapid Rise of Generative AI Usage: A recent study shows 39.4% of Americans adopted generative AI within two years, surpassing early PC and internet adoption rates. AI is broadly used across sectors, saving time in tasks like writing (57%) and information searches (49%), with 28% of employees using AI at work and 25% weekly. Potential productivity boosts are estimated at 0.125%-0.875%.

Study Shows AI Coding Assistant Improves Developer Productivity: Researchers from Microsoft and universities found GitHub Copilot boosts developer productivity by 26%, particularly aiding less experienced developers.

Enterprise Spending on GenAI Expected to Rise 50% in 2025, as Focus Shifts From Efficiency to Expertise: Enterprise spending on Generative AI is expected to rise by 50% in 2025, shifting towards enhancing human expertise. 54% of companies expect efficiency ROI. Currently, 43% are in live pilot stages.

Navigating the New AI Infrastructure Paradigm: There is a shift towards hybrid infrastructures driven by generative AI demands, with 85% of cloud buyers deploying or planning hybrid solutions. Key use cases include latency-sensitive applications.

Fellows Fund Releases Comprehensive Enterprise AI Report: The AI-Native Paradigm report highlights trends in Enterprise AI. It underscores the emergence of autonomous "Agentic AI" and identifies opportunities in developing autonomous agents and domain-specific solutions.

From Cost Center to Competitive Edge: The Strategic Value of Custom AI Infrastructure: Custom AI infrastructure is transforming into a strategic business asset, essential for optimizing specific workloads like data preparation and model training.

Ethical Risks of Generative AI Remain a Major Concern, Deloitte Report Reveals: The report reveals that 54% of professionals view generative AI as the highest ethical risk, with only 27% of organizations having ethical guidelines. Data privacy concerns 40% of respondents.

Big tech

OpenAI Addresses Language Barriers with New Multilingual AI Dataset: OpenAI released the Multilingual Massive Multitask Language Understanding dataset, evaluating AI performance in 14 languages and created with professional translators.

Sam Altman Predicts AI Superintelligence Within a Few Thousand Days: Sam Altman predicts artificial superintelligence could emerge within a few thousand days. Despite labor market impacts, he is optimistic about unprecedented achievements in physics, sustainable energy, and societal transformation.

Sam Altman and Former Apple Executives Collaborate on AI Device: Sam Altman, Jony Ive, and Laurene Powell Jobs have launched Tin, an AI device company, combining Altman's AI expertise and Ive's design skills to innovate consumer AI hardware.

Llama 3.2: Revolutionizing Edge AI and Vision with Open, Customizable Models: Meta launched Llama 3.2, featuring vision models and text models that support on-device tasks like summarization and instruction following, enhance visual data interaction, and ensure privacy.

Meta's Llama Stack Simplifies Enterprise AI Adoption: Meta's Llama Stack simplifies AI deployment with a standardized API. Collaborating with AWS and Dell, it supports flexible hybrid or multi-cloud strategies.

Meta Tests Personalized AI Content in Facebook and Instagram Feeds: Meta is testing AI-generated content on Facebook and Instagram, targeting 400 million monthly users. AI features enable real-time personalization, enhanced by a voice-capable AI assistant.

Google Gemini Enhances Gmail with Contextual Smart Replies: Google's Gemini-powered smart replies in Gmail now offer context-aware suggestions by analyzing entire email threads.

Google DeepMind Launches AlphaChip for AI-Driven Chip Design: Google DeepMind's AlphaChip uses reinforcement learning to enhance chip design efficiency, yielding a 6.2% wire length reduction in Google’s latest TPU.

Google Integrates Gemini AI Chat App into Workspace for Enhanced Productivity: Google integrated the Gemini chatbot into Workspace. Specific features include email suggestions, meeting notes, and information summaries. A survey showed 75% of users improved work quality, saving 105 minutes weekly.

Fine-Tuning and New Model Support in Azure AI: Microsoft Azure AI introduces fine-tuning for GPT-4o and GPT-4o mini. New models like Phi-3.5-MoE and Llama 3.2 enhance Azure's capabilities in multilingual processing and image reasoning.

Microsoft Unveils AI Hallucination Correction Tool: Microsoft's new Azure AI Content Safety feature enhances AI reliability by detecting inconsistencies and triggering a smaller AI model to correct unsupported text. The Groundedness Detection tool reduces inconsistencies to 0.1-1%, aiming to mitigate hallucinations for broader AI applications.

Microsoft Launches Azure AI Inference SDK for .NET: Microsoft's Azure AI Inference SDK for .NET simplifies integration of generative AI models from Azure AI Studio's catalog. It supports advanced AI functionalities with minimal setup, enhancing tasks like chat integration.

Microsoft Copilot Enhances Transparency with Web Search Query Citations: Microsoft's latest Copilot update introduces exact web search query transparency.

Responsible AI and public policy

Empowering Futures Through AI Education: Sundar Pichai announced a $120 million Global AI Opportunity Fund to enhance AI education and equitable access worldwide.

AI Access for All: Empowering Innovation: OpenAI has launched the OpenAI Academy, providing $1 million in API credits to developers in low- and middle-income countries.

Empowering Futures: Quantum AI Challenge: Flapmax and Intel's Quantum AI Challenge invites HBCU students to tackle real-world problems using quantum computing and AI. Participants will access advanced tools, collaborate with experts, and address sustainability challenges.

US Launches AI Partnership with Meta, OpenAI, and NVIDIA: The U.S. government launched the Partnership for Global Inclusivity on AI, committing over $100 million. Key initiatives include increasing AI model access, building technical capacity, expanding datasets, and ensuring responsible governance.

California Enacts 18 New AI Laws Addressing Key Issues: California Governor Gavin Newsom signed 18 AI-related bills. Key measures include requiring AI providers to disclose data sources, extending privacy laws to generative AI, criminalizing AI-generated pornography, and mandating AI literacy in schools.

Major Tech Firms Unite to Support EU AI Act: Over 100 companies signed the EU AI Pact, committing to principles of the forthcoming EU AI Act. The pact emphasizes AI governance, risk mitigation, and transparent labeling.

FTC's Crackdown on Deceptive AI Practices Signals Industry Warning: The FTC launched “Operation AI Comply,” targeting deceptive AI claims. The action emphasizes consumer protection and accountability in AI services, and highlights that AI-marketed products must comply with existing consumer protection laws.

ValidMind Launches Advantage Program for AI Model Compliance: ValidMind's Advantage program aids fintech companies in validating AI models and complying with banking regulations like SR11-7 and the EU AI Act.

Other generative AI models

Molmo: Accessible AI for All: AI2's Molmo, a multimodal AI model family, rivals major tech firms by offering efficient, open-access AI solutions. Using a smaller, curated dataset of ~1 million images, it excels in visual interpretation with fewer errors and faster training.

Tailored AI for Insurance Efficiency: EXL has launched EXL Insurance LLM, an industry-specific language model. The model, leveraging NVIDIA's AI Enterprise platform, improves accuracy by 30%. It offers structured data ingestion, contextual classification, and real-time insights.

AI-Powered Insights for Climate Science: IBM and NASA have launched an open-source AI model for weather and climate applications. Pre-trained on 40 years of NASA data, it offers 12x resolution localized forecasts and supports diverse challenges, from high-resolution forecasts to global model improvements, enhancing predictive accuracy and environmental data analysis.

Unleashing AI Power on Single GPU: Solar Pro Preview is a single GPU AI model providing superior MMLU Pro and IFEval scores.

Notable research

Small Language Models Survey: A comprehensive survey on SLMs across architectures, training datasets, and training algorithms, this study analyzes 59 open-source SLMs and capabilities such as reasoning, in-context learning, math, and coding. Other discussions include on-device runtime costs, latency, memory footprint, and valuable insights.

Logic-of-Thought: Enhancing LLM Reasoning: Logic-of-Thought (LoT) is introduced as a prompting technique that improves logical reasoning by incorporating logical propositions into model inputs. LoT enhances model reasoning capabilities, outperforming other prompting techniques across multiple reasoning benchmarks.

LLMs Still Can’t Plan: This study finds that a domain-independent planner can solve all instances of Mystery Blocksworld but LLMs struggle, even on small instances. OpenAI’s o1-preview shows progress on more challenging planning problems, but degrades in performance as the plan length increases, showing that the accuracy gains cannot be considered general or robust.

This week's latest generative AI updates - October 8, 2024

SymphonyAI

Tailored AI applications that solve core business challenges to deliver rapid, real, relevant results.

SymphonyAI news

Next-gen AI: Small models and multimodality

AI in financial services

Generative AI impact, adoption, and projections

Big tech

Responsible AI and public policy

Other generative AI models

Notable research

Eureka

29,395 followers

More articles by this author

Insights from the community

Explore topics

SymphonyAI news

Next-gen AI: Small models and multimodality

AI in financial services

Generative AI impact, adoption, and projections

Big tech

Responsible AI and public policy

Other generative AI models

Notable research

Eureka

29,395 followers

This week's latest AI industry updates - December 17, 2024

Dec 17, 2024

This week's latest AI industry updates - December 10, 2024

Dec 10, 2024

This week's latest AI industry updates - December 3, 2024

Dec 3, 2024

This week's latest AI industry updates - November 26, 2024

Nov 26, 2024

This week's latest generative AI updates - November 19, 2024

Nov 19, 2024

This week's latest generative AI updates - November 12, 2024

Nov 12, 2024

This week's latest generative AI updates - October 1, 2024

Oct 1, 2024

This week's latest generative AI updates - September 24, 2024

Sep 24, 2024

This week's latest generative AI updates - September 17, 2024

Sep 17, 2024

This week's latest generative AI updates - September 10, 2024

Sep 10, 2024

Insights from the community

Explore topics