Vaibhava Lakshmi Ravideshik’s Post

Ambassador @ DeepLearning.AI and @ Women in Data Science Worldwide

1mo

Incredible news from Anthropic 🎉 🎊 !!! It has just announced significant upgrades to its AI portfolio, including the enhanced Claude 3.5 Sonnet and the upcoming Claude 3.5 Haiku. There's also a remarkable new "computer control" feature now in public beta. 🤖💡 🔧 The upgraded Claude 3.5 Sonnet has set a new benchmark in AI-powered coding, achieving a stunning 49.0% on the SWE-bench verified benchmark. This surpasses all publicly available models, including those from OpenAI and specialized coding systems. GitLab has reported up to a 10% boost in reasoning across various use cases without any additional latency. 🏆 🖥️ Pioneering computer interaction capabilities, Claude 3.5 Sonnet can now view screens, control cursors, click, and type—mirroring human actions. This makes it the first AI model offering such human-like computer control. Initial benchmarks are promising, with an impressive 14.9% on screenshot-only OSWorld tests, nearly doubling the performance of the next-best system! 📈 And let's not forget about the upcoming Claude 3.5 Haiku, set for release later this month. It promises to match the performance of Claude 3 Opus while maintaining cost-effectiveness and speed, achieving a noteworthy 40.6% on SWE-bench verified—outperforming the original Claude 3.5 Sonnet and even GPT-4o. Anthropic remains committed to safety and responsible scaling, having conducted rigorous evaluations with both US and UK AI Safety Institutes, and adhering to the ASL-2 Standard. #AI #Coding #TechInnovation #Anthropic #MachineLearning #FutureOfWork #AIResearch #TechNews

6 Comments

Dr.Shahid Masood

President GNN | CEO 1950

1mo

This is truly groundbreaking! The advancements in AI capabilities, particularly in coding and human-like computer interaction, are set to revolutionize the tech landscape. The ability of Claude 3.5 Sonnet to control a computer like a human opens up new possibilities for automation and efficiency in various industries. I'm particularly intrigued by the potential applications in fields like cybersecurity, where such precise control could be invaluable. Looking forward to seeing how Claude 3.5 Haiku will further push the boundaries. Kudos to Anthropic for their commitment to safety and responsible innovation. Thanks for sharing this exciting update!

1 Reaction

Vedant Nayak

Quant Research Virtual Intern @JPMorganChase │ CS50 AI │ Exploring Economics & AI

1mo

Codeqwen is also good in terms of coding.

1 Reaction

See more comments

To view or add a comment, sign in

More Relevant Posts

Sunny Kusawa

AI Expert | Building Gen AI Platform @AMI AI[BMC] | Gold Medalist Engineer | Amazon Best Seller Author | YouTuber | Ex-Oracle | Ex-Sungard[FIS]
10mo
Report this post
𝐄𝐱𝐜𝐢𝐭𝐢𝐧𝐠 𝐀𝐈 𝐁𝐫𝐞𝐚𝐤𝐭𝐡𝐫𝐨𝐮𝐠𝐡 𝐟𝐫𝐨𝐦 𝐆𝐨𝐨𝐠𝐥𝐞 𝐃𝐞𝐞𝐩𝐌𝐢𝐧𝐝 𝐚𝐧𝐝 𝐎𝐱𝐟𝐨𝐫𝐝! 🚀 🔍 They demonstrate how using large-scale, unlabeled real-world data can enhance TAP (Trajectory-Aware Prediction) with minimal architectural changes, using a self-supervised student-teacher approach. 🌟 𝐇𝐢𝐠𝐡𝐥𝐢𝐠𝐡𝐭𝐬: Pioneering large-scale pipeline for improved pts track Utilizing a substantial dataset of unannotated clips with real trajectories Achieving consistent predictions with spatial transformations Predictions remain stable regardless of query point along the trajectory Impressive state-of-the-art results in point tracking benchmarks Code release includes models and support for both #JAX and #PyTorch. A remarkable stride in AI research, showcasing the power of real-world data and self-supervised learning. Explore the code and stay at the forefront of AI innovation! 💡 Research Paper: https://lnkd.in/dKxtrBKy 𝐅𝐨𝐥𝐥𝐨𝐰 𝐟𝐨𝐫 𝐋𝐚𝐭𝐞𝐬𝐭 𝐀𝐈 𝐓𝐫𝐞𝐧𝐝𝐬: https://lnkd.in/dNZ8Xmfr #AIResearch #DeepLearning #GoogleDeepMind #OxfordAI
Like Comment
To view or add a comment, sign in
Hydralogic AI

1,046 followers
5mo
Report this post
🚀 Anthropic Releases Claude 3.5 Sonnet! 🚀 Anthropic just launched Claude 3.5 Sonnet, building on the success of Claude 3 models released only 3 months ago. This new model boasts a 200k token context window and significant improvements in speed and intelligence over leading AI models. Key Highlights: Superior Performance: Outperforms Claude 3 Opus, GPT-4o, Gemini 1.5 Pro, and Llama-3 400B in benchmarks like GPQA, MMLU, math (MGSM, GSM8K), and coding (HumanEval). Speed and Cost: Operates twice as fast as Claude 3 Opus at the same cost—$3 per million input tokens and $15 per million output tokens. Vision Capabilities: Excels in visual math reasoning (MathVista), chart understanding (ChartQA), and document understanding (DocVQA). Agentic Coding: Solves 64% of coding problems, a significant improvement from Claude 3 Opus’s 38%. Artifacts Feature: Generates dynamic workspaces for content creation like code snippets and text documents, allowing real-time interaction and editing. Safety and Privacy: Classified as AI Safety Level 2 (ASL-2), tested rigorously by UK and US AI Safety Institutes. Claude 3.5 Sonnet is available for free on Claude.ai and the Claude iOS app, as well as via API, Amazon Bedrock, and Google Cloud’s Vertex AI. Learn More : https://lnkd.in/eddane8F Anthropic Hydralogic AI Titanisu #AI #Innovation #Anthropic #Claude3.5Sonnet #TechNews #ArtificialIntelligence #MachineLearning

1 Comment
Like Comment
To view or add a comment, sign in
Titanisu

431 followers
5mo
Report this post
🚀 Anthropic Releases Claude 3.5 Sonnet! 🚀 Anthropic just launched Claude 3.5 Sonnet, building on the success of Claude 3 models released only 3 months ago. This new model boasts a 200k token context window and significant improvements in speed and intelligence over leading AI models. Key Highlights: Superior Performance: Outperforms Claude 3 Opus, GPT-4o, Gemini 1.5 Pro, and Llama-3 400B in benchmarks like GPQA, MMLU, math (MGSM, GSM8K), and coding (HumanEval). Speed and Cost: Operates twice as fast as Claude 3 Opus at the same cost—$3 per million input tokens and $15 per million output tokens. Vision Capabilities: Excels in visual math reasoning (MathVista), chart understanding (ChartQA), and document understanding (DocVQA). Agentic Coding: Solves 64% of coding problems, a significant improvement from Claude 3 Opus’s 38%. Artifacts Feature: Generates dynamic workspaces for content creation like code snippets and text documents, allowing real-time interaction and editing. Safety and Privacy: Classified as AI Safety Level 2 (ASL-2), tested rigorously by UK and US AI Safety Institutes. Claude 3.5 Sonnet is available for free on Claude.ai and the Claude iOS app, as well as via API, Amazon Bedrock, and Google Cloud’s Vertex AI. Learn More : https://lnkd.in/eddane8F Anthropic Hydralogic AI Titanisu #AI #Innovation #Anthropic #Claude3.5Sonnet #TechNews #ArtificialIntelligence #MachineLearning
Like Comment
To view or add a comment, sign in
Innovation Hacks AI Inc.

5,576 followers
4mo
Report this post
🧮 𝐆𝐨𝐨𝐠𝐥𝐞’𝐬 𝐀𝐈 𝐬𝐜𝐨𝐫𝐞𝐬 𝐬𝐢𝐥𝐯𝐞𝐫 𝐚𝐭 𝐌𝐚𝐭𝐡 𝐎𝐥𝐲𝐦𝐩𝐢𝐚𝐝 Google DeepMind's AI systems, AlphaProof and AlphaGeometry 2, achieved a significant milestone in AI’s math reasoning capabilities, attaining a silver medal-equivalent score at this year's International Mathematical Olympiad (IMO). 𝐓𝐡𝐞 𝐃𝐞𝐭𝐚𝐢𝐥𝐬: - The AI system solved 4 out of 6 problems from the 2024 IMO, scoring 28 out of 42 points. - This achievement represents a significant leap from previous AI attempts, which could barely solve 1 in 100 past IMO problems. - AlphaProof uses a fine-tuned Gemini model to translate and solve complex math problems. - AlphaGeometry 2 demonstrated remarkable speed, solving a complex geometry problem in just 19 seconds. 𝐖𝐡𝐲 𝐢𝐭 𝐌𝐚𝐭𝐭𝐞𝐫𝐬: This breakthrough demonstrates AI's growing ability to perform complex math on par with the world’s smartest humans. It’s a big step towards more generalized AI systems capable of advanced reasoning, which could have implications across scientific research, engineering, finance, legal analysis, and more. AI is not just about automation; it's about augmenting human intelligence. What are your thoughts on AI's role in solving complex problems? Share your insights! #ArtificialIntelligence #MathOlympiad #GoogleDeepMind #TechInnovation #AIBreakthrough #MathReasoning #AlphaProof #AlphaGeometry Source: https://lnkd.in/gncrqvrc Image source: Google DeepMind
Like Comment
To view or add a comment, sign in
José Daniel García Espinel

Head of Innovation Growth at Ferrovial. Senior Engineer specialized in Digital Innovation, Technology, Digital Transf. and Product Development🔹Artificial Intelligence🔹IoT🔹Cloud🔹3D Printing🔹Virtual Reality🔹Metaverse
1mo Edited
Report this post
"Anthropic unveils new #Claude #AI models and ‘#computer #control’" by Ryan Daws at TechForge Media 🚀 Exciting news in the AI landscape! Anthropic has rolled out significant upgrades to its AI portfolio, featuring the enhanced Claude 3.5 Sonnet model and the introduction of the Claude 3.5 Haiku. With a remarkable 49.0% score on the SWE-bench Verified benchmark, Claude 3.5 Sonnet sets a new standard in #AICoding capabilities, outperforming all existing models, including those from OpenAI. But that's not all! The debut of computer control functionality allows Claude to #interact with #computers like a human—navigating screens and executing commands. This innovative feature, currently in public beta, positions Claude 3.5 Sonnet as a pioneer in the #AI frontier. As major tech firms integrate these capabilities, the future of AI coding looks brighter than ever. 🌟 #ArtificialIntelligence #MachineLearning #TechInnovation #AIModels #Claude3.5 #Coding #ComputerControl #Anthropic Source: https://lnkd.in/eWffxt97
Like Comment
To view or add a comment, sign in
Rana Hasan

Data Scientist at ML1 | Master's Student | Engineer & Data Enthusiast | IBM Certified
5mo
Report this post
Anthropic Launches Claude 3.5 Sonnet: The Next Leap in AI Innovation! Anthropic has just launched Claude 3.5 Sonnet, setting new standards in AI performance. Here are the highlights: Performance Highlights: - Graduate-Level Reasoning: 59.4% in GPQA, Diamond evaluations. - Knowledge Mastery: 88.7% in MMLU evaluations. - Coding Proficiency: 92.0% in HumanEval. - Multilingual Math: 91.6% in MGSM. - Text Reasoning: 87.1% in DROP, Fi evaluations. - Mixed Evaluations: 93.1% in BIG-Bench-Hard. - Math Problem-Solving: 71.1% in MATH. - Grade School Math: 96.4% in GSM8K. Why Claude 3.5 Sonnet Stands Out: - Speed: Twice as fast as Claude 3 Opus. - Cost-Effective: $3 per million input tokens, $15 per million output tokens. - Accessible: Free on Claude.ai and iOS app, also available via Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI. New Features - Artifacts: - Dynamic Workspace: Real-time content generation and editing. - Enhanced Collaboration: Seamless AI integration in projects. Commitment to Safety and Privacy: - Rigorous Testing: Maintains ASL-2 safety standards. - Privacy Focus: No use of user-submitted data without permission. Claude 3.5 Sonnet also outperforms the state-of-the-art GPT-4o in many tests, solidifying its position as the leading AI model. #AI #Innovation #ArtificialIntelligence #MachineLearning #AIPerformance #TechNews #Claude3.5Sonnet #Anthropic #AIDevelopment #FutureOfAI #AIResearch #TechInnovation #AIModel #AIFeatures #AICommunity
Like Comment
To view or add a comment, sign in
Manikanta Karthu

UX Designer | Created user-centered designs • Skilled in Figma, Adobe XD • Wireframing, Prototyping
5mo
Report this post
Exciting News! 🎉 Anthropic has just unveiled Claude 3.5 Sonnet, setting new benchmarks in AI performance and speed. With a 200k token context window and advanced capabilities in math, coding, and document understanding, this model is a game-changer. Plus, it's twice as fast as its predecessor at the same cost. Kudos to Anthropic for pushing the boundaries of AI innovation!

Hydralogic AI

1,046 followers
5mo

🚀 Anthropic Releases Claude 3.5 Sonnet! 🚀 Anthropic just launched Claude 3.5 Sonnet, building on the success of Claude 3 models released only 3 months ago. This new model boasts a 200k token context window and significant improvements in speed and intelligence over leading AI models. Key Highlights: Superior Performance: Outperforms Claude 3 Opus, GPT-4o, Gemini 1.5 Pro, and Llama-3 400B in benchmarks like GPQA, MMLU, math (MGSM, GSM8K), and coding (HumanEval). Speed and Cost: Operates twice as fast as Claude 3 Opus at the same cost—$3 per million input tokens and $15 per million output tokens. Vision Capabilities: Excels in visual math reasoning (MathVista), chart understanding (ChartQA), and document understanding (DocVQA). Agentic Coding: Solves 64% of coding problems, a significant improvement from Claude 3 Opus’s 38%. Artifacts Feature: Generates dynamic workspaces for content creation like code snippets and text documents, allowing real-time interaction and editing. Safety and Privacy: Classified as AI Safety Level 2 (ASL-2), tested rigorously by UK and US AI Safety Institutes. Claude 3.5 Sonnet is available for free on Claude.ai and the Claude iOS app, as well as via API, Amazon Bedrock, and Google Cloud’s Vertex AI. Learn More : https://lnkd.in/eddane8F Anthropic Hydralogic AI Titanisu #AI #Innovation #Anthropic #Claude3.5Sonnet #TechNews #ArtificialIntelligence #MachineLearning
Like Comment
To view or add a comment, sign in
Elias Kouloures

Chief Creative AI Officer, Marketing Polymath & Data-Driven Brand Transformer | Neurodiverse AI Innovator & Creative Director with 20+ years of international experience on 150+ global brands.
2mo
Report this post
🚀Just emerged from the neural network of brilliance that was #PAKCON2024 in Berlin! 🧠 💻 My GPUs are still overclocking from the information overload! From optimizing EBITDA with ML to the ethical tensor calculations of AI governance, this wasn't just a conference - it was a full-scale distributed learning system! 🚀🤖 💪🏻 Key training datasets acquired: • German AI ecosystem: More layers than a deep belief network! 🇩🇪🔬 • AI implementation failures: Debugging the human-AI interface 🐛🔧 • Ethical imperatives: Aligning AI utility functions with human values 🎯🌍 The real transformer architecture? The synapse-forming connections with the incredible minds from Phoenix, AI, and Management Hives. You're all quantum leaps ahead! ⚛️💡 My video montage captures the event's latent space. Fair warning: It may cause spontaneous neural net formation! 🎥🧬 Were you part of this training epoch? What activation function lit up your neurons the most? Let's keep this distributed learning going! 💬📈 #AIRevolution #MachineLearning #DeepTech #BerlinAIScene #NeuralNetworks #EthicalAI #StartupSingularity #QuantumComputing #AGI (P.S.: I was too tired and let Claude write this post 100%.) Was nice meeting you: Denys Holovatyi Johannes Helberger Dennis Walton Yaqi Li 🇺🇦Andriy 🇩🇪 M. Habib U. R. Bhatti 🔜 Gamescom Ricarda Bannert ✔ Florian Krumb Saman Arefi Vivek Modi Dr. Olivia Lewis Dr. Benedikt Flöter Judith Peterka Ole Bossdorf Michael Wandtke Elvira Bekyrova Viktor von Essen Thuy-Ngan Trinh Christoph Burseg Sebastian Herzog Isabelle Schnellbuegel Paul Becker Simon Walter Rasmus Rothe Dr. Alexander Mrozek Anna Franziska Michel Dr. Alexandra Deichsel Miriam van Straelen Cecilie Hegelund @

13 Comments
Like Comment
To view or add a comment, sign in
Rituuraj Bdwai

Digital Transformation Leader | Fractional CMO | Ex-Mahindra, Godrej, Reliance | Marketing Technology | Sustainability Evangelist
7mo
Report this post
Microsoft's research team has launched Phi-2, following the successful Phi-1 and Phi-1.5 models. Phi-2 showcases remarkable improvements with superior performance in reasoning, language, math, and coding tasks. What's more impressive is its compact structure of just 2.7 billion parameters. Yet it outperforms behemoths like the 70B-parameter Meta Llama-2 and Gemini AI Nano 2 with 3.2B parameters. The secret sauce? Phi-2 harnesses high-quality, synthetic training data and groundbreaking knowledge transfer techniques. #AI #MachineLearning #MicrosoftResearch #Innovation Article Source Link - https://lnkd.in/d-n_qtYU Video Source Link - https://lnkd.in/dAeKm3e9
Like Comment
To view or add a comment, sign in
Sancharika Debnath

Top Machine Learning Voice || Passionate Data Scientist || Machine Learning || Data Science Decoded
5mo
Report this post
🌟 Day 32 of #100DaysOfLearning! Today was another productive day diving deep into the world of AI and language models: 1️⃣ 5 LeetCode Practices: Strengthened problem-solving skills with 5 new challenges. 2️⃣ Graph Learning: Embarked on a journey into understanding graphs, exploring their applications and algorithms. 3️⃣ Research Paper: Explored "The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits". Here are the key takeaways: - BitNet b1.58: Introduces a revolutionary 1-bit variant where every parameter is ternary {-1, 0, 1}. - Performance Parity: Matches full-precision models in perplexity and task performance with reduced latency and energy consumption. - Cost-Effectiveness: Offers significant benefits in memory usage, throughput, and computational efficiency. - New Scaling Law: Defines a pathway for future LLM generations, optimizing both performance and cost. - Hardware Optimization: Opens possibilities for specialized hardware tailored to 1-bit LLMs, enhancing computational capabilities. This research marks a pivotal advancement in AI architecture, promising a future of smarter and more efficient language models. Excited to see how this transforms the landscape of AI applications! Links in comment 🔗 #AI #MachineLearning #DeepLearning #Graphs #ResearchPaper
3 Comments
Like Comment
To view or add a comment, sign in

9,411 followers

69 Posts

View Profile Follow

Vaibhava Lakshmi Ravideshik’s Post

More Relevant Posts

Explore topics