Google Gemini 1.5 Pro Leaps Ahead In AI Race, Challenging GPT-4o: An anonymous reader quotes a report from VentureBeat: Google launched its latest artificial intelligence powerhouse, Gemini 1.5 Pro, today, making the experimental "version 0801" available for early testing and feedback through Google AI Studio and the Gemini API. This release marks a major leap forward in the company's AI capabilities and has already sent shockwaves through the tech community. The new model has quickly claimed the top spot on the prestigious LMSYS Chatbot Arena leaderboard (built with Gradio), boasting an impressive ELO score of 1300. This achievement puts Gemini 1.5 Pro ahead of formidable competitors like OpenAI's GPT-4o (ELO: 1286) and Anthropic's Claude-3.5 Sonnet (ELO: 1271), potentially signaling a shift in the AI landscape. Simon Tokumine, a key figure in the Gemini team, celebrated the release in a post on X.com, describing it as "the strongest, most intelligent Gemini we've ever made." Early user feedback supports this claim, with one Redditor calling the model "insanely good" and expressing hope that its capabilities won't be scaled back. "A standout feature of the 1.5 series is its expansive context window of up to two million tokens, far surpassing many competing models," adds VentureBeat. "This allows Gemini 1.5 Pro to process and reason about vast amounts of information, including lengthy documents, extensive code bases, and extended audio or video content." Read more of this story at Slashdot.
Slashdot’s Post
More Relevant Posts
-
Google Gemini 1.5 Pro Leaps Ahead In AI Race, Challenging GPT-4o: An anonymous reader quotes a report from VentureBeat: Google launched its latest artificial intelligence powerhouse, Gemini 1.5 Pro, today, making the experimental "version 0801" available for early testing and feedback through Google AI Studio and the Gemini API. This release marks a major leap forward in the company's AI capabilities and has already sent shockwaves through the tech community. The new model has quickly claimed the top spot on the prestigious LMSYS Chatbot Arena leaderboard (built with Gradio), boasting an impressive ELO score of 1300. This achievement puts Gemini 1.5 Pro ahead of formidable competitors like OpenAI's GPT-4o (ELO: 1286) and Anthropic's Claude-3.5 Sonnet (ELO: 1271), potentially signaling a shift in the AI landscape. Simon Tokumine, a key figure in the Gemini team, celebrated the release in a post on X.com, describing it as "the strongest, most intelligent Gemini we've ever made." Early user feedback supports this claim, with one Redditor calling the model "insanely good" and expressing hope that its capabilities won't be scaled back. "A standout feature of the 1.5 series is its expansive context window of up to two million tokens, far surpassing many competing models," adds VentureBeat. "This allows Gemini 1.5 Pro to process and reason about vast amounts of information, including lengthy documents, extensive code bases, and extended audio or video content." Read more of this story at Slashdot.
To view or add a comment, sign in
-
Google Gemini 1.5 Pro Leaps Ahead In AI Race, Challenging GPT-4o: An anonymous reader quotes a report from VentureBeat: Google launched its latest artificial intelligence powerhouse, Gemini 1.5 Pro, today, making the experimental "version 0801" available for early testing and feedback through Google AI Studio and the Gemini API. This release marks a major leap forward in the company's AI capabilities and has already sent shockwaves through the tech community. The new model has quickly claimed the top spot on the prestigious LMSYS Chatbot Arena leaderboard (built with Gradio), boasting an impressive ELO score of 1300. This achievement puts Gemini 1.5 Pro ahead of formidable competitors like OpenAI's GPT-4o (ELO: 1286) and Anthropic's Claude-3.5 Sonnet (ELO: 1271), potentially signaling a shift in the AI landscape. Simon Tokumine, a key figure in the Gemini team, celebrated the release in a post on X.com, describing it as "the strongest, most intelligent Gemini we've ever made." Early user feedback supports this claim, with one Redditor calling the model "insanely good" and expressing hope that its capabilities won't be scaled back. "A standout feature of the 1.5 series is its expansive context window of up to two million tokens, far surpassing many competing models," adds VentureBeat. "This allows Gemini 1.5 Pro to process and reason about vast amounts of information, including lengthy documents, extensive code bases, and extended audio or video content." Read more of this story at Slashdot.
To view or add a comment, sign in
-
Google has announced the launch of its next-generation AI model, Gemini 1.5. 🌟This new model is designed to be a significant improvement over its predecessor, with a focus on enhanced performance, longer context window, and better understanding. Gemini 1.5 is built on a new version of Mixture-of-Experts (MoE) architecture, allowing it to learn and selectively activate the most relevant pathways in its neural network, thus increasing efficiency. 🌟One of the most notable features of Gemini 1.5 is its enormous context window, which allows it to handle much larger queries and look at much more information at once compared to its predecessor and other existing models. 🌟The context window of Gemini 1.5 is a whopping 1 million tokens, compared to 128,000 for OpenAI’s GPT-4 and 32,000 for the current Gemini Pro. 🌟This means it can process vast amounts of information in one sitting, including up to 1 hour of video, 11 hours of audio, and codebases with over 30,000 lines of code or over 700,000 words. 🌟Gemini 1.5 will initially be available to business users and developers through Google’s Vertex AI and AI Studio, with a full consumer rollout expected soon. 🌟The standard version of Gemini Pro will be upgraded to 1.5 Pro with a 128,000-token context window. Google is also testing the model’s safety and ethical boundaries, particularly regarding the newly larger context window. The company has conducted extensive evaluations to ensure the safe and responsible deployment of this advanced model. Gemini 1.5 represents a significant advancement in AI technology and is expected to have wide-ranging applications across various industries and sectors. #artificialintelligence #geminiai Source: https://lnkd.in/e-wRuBpR
To view or add a comment, sign in
-
Back in May, Google augmented its Gemini AI model with SynthID, a toolkit that embeds AI-generated content with watermarks it says are "imperceptible to humans" but can be easily and reliably detected via an algorithm. Today, Google took that SynthID system open source, offering the same basic watermarking toolkit for free to developers and businesses. The move gives the entire AI industry an easy, seemingly robust way to silently mark content as artificially generated, which could be useful for detecting deepfakes and other damaging AI content before it goes out in the wild. But there are still some important limitations that may prevent AI watermarking from becoming a de facto standard across the AI industry any time soon.
Google offers its AI watermarking tech as free open source toolkit
arstechnica.com
To view or add a comment, sign in
-
Bites from last week AI news 1/ Microsoft announced that their browser Edge will offer live translation and subtitles for YouTube, Coursera, Linkedin and other popular platforms 🔥 Irrespective of whether you use Edge, this is a feature that will probably be replicated across browsers and products very soon which will make a lot of content more accessible. https://lnkd.in/gitCT7iU 2/ Google introduced AI overview so search a few weeks ago which marks the biggest change in search for years. The feature does not come withouts its flaws as users quickly pointed out. This demonstrates the difficult balance an incumbent has to face compared to disruptors such as OpenAI and Perplexity. https://lnkd.in/gE4J42hi 3/ Scale the data platform behind most LLM providers, released a leaderboard that promises to be impartial and not contaminated ✨ They ensure the latter by keeping the evaluation data private. Unsurprisingly GPT4 comes first in all categories except math where its a close second. Gemini 1.5 Pro and Claude Opus are close behind. Interestingly Llama 3 70b and Mistral large perform very close on instruction following which is the most common use case for everyday users. https://lnkd.in/gUsUqSyX 4/ A really insightful blog series from AI experts on the ground shares many similarities with our own experience and previous posts. Some highlights include to start with prompting and using it rightly including few shots, proper chain of thoughts and break big prompts to smaller deterministic agentic workflows, using hybrid RAG before fine tuning and evaluating using binary or pairwise tasks which are easier and more cost effective for annotators https://lnkd.in/gn4GQJ3i 5/ Sony announces that is looking into AI to cut costs in movie production 😮 This a difficult terrain to navigate given the many ethical issues on how the technology can be used responsibly in that area. Used rightly, AI can boost the productivity of many creatives. Used wrongly it may destroy the very thing it tries to improve. https://lnkd.in/djTz5PdA
To view or add a comment, sign in
-
Google's Gemini Pro 1.5 Enters Public Preview on Vertex AI: Gemini 1.5 Pro, Google's most capable generative AI model, is now available in public preview on Vertex AI, Google's enterprise-focused AI development platform. From a report: The company announced the news during its annual Cloud Next conference, which is taking place in Las Vegas this week. Gemini 1.5 Pro launched in February, joining Google's Gemini family of generative AI models. Undoubtedly its headlining feature is the amount of context that it can process: between 128,000 tokens to up to 1 million tokens, where "tokens" refers to subdivided bits of raw data (like the syllables "fan," "tas" and "tic" in the word "fantastic"). One million tokens is equivalent to around 700,000 words or around 30,000 lines of code. It's about four times the amount of data that Anthropic's flagship model, Claude 3, can take as input and about eight times as high as OpenAI's GPT-4 Turbo max context. A model's context, or context window, refers to the initial set of data (e.g. text) the model considers before generating output (e.g. additional text). A simple question -- "Who won the 2020 U.S. presidential election?" -- can serve as context, as can a movie script, email, essay or e-book. Read more of this story at Slashdot.
To view or add a comment, sign in
-
Google Rolls Out Updated AI Model Capable of Handling Longer Text, Video: An anonymous reader shares a report: Alphabet's Google is rolling out a new version of its powerful artificial intelligence model that it says can handle larger amounts of text and video than products made by competitors. The updated AI model, called Gemini 1.5 Pro, will be available on Thursday to cloud customers and developers so they can test its new features and eventually create new commercial applications. Google and its rivals have spent billions to ramp up their capabilities in generative AI and are keen to attract corporate clients to show their investments are paying off. [...] Gemini 1.5 can be trained faster and more efficiently, and has the ability to process a huge amount of information each time it's prompted, according to Vinyals. For example, developers can use Gemini 1.5 Pro to query up to an hour's worth of video, 11 hours of audio or more than 700,000 words in a document, an amount of data that Google says is the "longest context window" of any large-scale AI model yet. Gemini 1.5 can process far more data compared with what the latest AI models from OpenAI and Anthropic can handle, according to Google. In a pre-recorded video demonstration for reporters, Google showed off how engineers asked Gemini 1.5 Pro to ingest a 402-page PDF transcript of the Apollo 11 moon landing, and then prompted it to find quotes that showed "three funny moments." Read more of this story at Slashdot.
To view or add a comment, sign in
-
"Google's Gemini Emerges as Strong Contender Against OpenAI's GPT-4, Ignites Excitement Among Enterprises and Start-ups" In a significant development, Google has introduced Gemini, a powerful multi-modal AI model that has quickly garnered attention from both enterprises and start-ups. Industry experts note that Gemini presents a compelling alternative to OpenAI's GPT-4, marking a noteworthy shift in the ongoing generative AI competition. This unveiling follows Google's introduction of its largest and most advanced AI model to date. Analysts view this move as a substantial advantage for Google in the competitive landscape, particularly against rivals like Microsoft-OpenAI's GPT and Meta's Llama 2. With Gemini, Google aims to reclaim a leading position by integrating the AI model into key offerings such as Search, Google Ads, and cloud platform services. Google's legacy in pioneering transformer technology, a crucial element in the development of GPT, reinforces its standing in the AI space. Transformers, rooted in the concept of 'self-attention,' have revolutionized how computers comprehend language. Gemini's significance lies in its multi-modal capabilities, enabling it to seamlessly process various forms of information, including text, code, audio, image, and video concurrently. Mahesh Makhija, Partner and Technology Consulting Leader at Ernst & Young (EY) India, expressed excitement about Gemini, noting that clients who are already on Google Cloud are eager to leverage this credible alternative to GPT-4. As Google re-enters the generative AI arena, the integration of Gemini signals a promising chapter, with potential widespread adoption and impact across diverse industries. https://lnkd.in/dC79EVe3
Gemini - Google DeepMind
deepmind.google
To view or add a comment, sign in
-
In the dynamic world of artificial intelligence, staying ahead of the curve is crucial. Here are three recent developments that have caught the tech community’s attention: - OpenAI has set expectations high for its upcoming May 13 announcement. The intriguing part? They've definitively stated it's neither GPT-5 nor a new search engine they're unveiling. What innovative breakthrough has OpenAI been perfecting in the lab? We can only speculate until the big reveal. [Read more](https://lnkd.in/dJPgHXic) - Microsoft is ramping up AI integration, offering OpenAI's GPT models to government cloud customers. Adding these large language models, including the sophisticated GPT-4, to its suite, Microsoft is bolstering its position in providing cutting-edge AI solutions to the public sector. [Read more](https://lnkd.in/dv54S_6h) - WIRED reports a bold move by Microsoft to utilize generative AI, based on GPT-4, for the U.S. intelligence community. This deployment is exclusive to a US government-only network, underlining the growing trust in AI's potential for sensitive applications and the continuous pursuit of leveraging technology for national security. [Read more](https://lnkd.in/dU_UVKtu) As the landscape of AI expands, these developments signal a common theme: the tailor-made and strategic application of AI tools. From enabling governments to enhancing company portfolios, the future of AI continues to promise both excitement and innovation. Will OpenAI's big reveal be the next significant leap? Time will tell. Stay tuned.
To view or add a comment, sign in
-
🚀 AI Inference Engines Performance Showdown: SGLang vs. vLLM 🚀 I’ve just published a new article comparing two powerful AI inference engines: SGLang and vLLM. If you’re interested in the performance of LLMs in a local AI environment, this deep dive into real-world testing will give you valuable insights. 👉 Curious to see which engine comes out on top? Check out the full article on Medium and let me know your thoughts! #AI #LLM #PerformanceTesting #LocalAI #SGLang #vLLM
LLM inference engines performance testing: SGLang VS. vLLM
medium.com
To view or add a comment, sign in
8,227 followers