CelerData

CelerData

Software Development

Menlo Park, California 9,497 followers

The only SQL engine that is fast enough to run the most demanding workloads directly on your data lakehouse.

About us

CelerData enables enterprises to quickly and easily grow their business with the world’s most performant query engine for open lakehouses. Powered by StarRocks, CelerData delivers 3X the performance/cost of any other solution on the market and is the only platform uniquely designed to enable users to simplify their lakehouse architecture and ditch the need for a data warehouse. CelerData is used worldwide by market-leading brands including Airbnb, Pinterest, and Trip.com to generate critical new insights for these data-driven companies.

Website
www.celerdata.com
Industry
Software Development
Company size
51-200 employees
Headquarters
Menlo Park, California
Type
Public Company
Founded
2022
Specialties
Data Analytics, Real-time analytics, Database, Lakehouse, and Data Lake

Locations

Employees at CelerData

Updates

  • View organization page for CelerData, graphic

    9,497 followers

    Exciting insights from Alfred Johnson in this fantastic write-up on how Herdwatch leverages StarRocks-powered CelerData Cloud + Apache Iceberg to solve their data silos and slow dashboard problems. Delve into how they achieve faster analytics ("reducing query latency from 2–5 minutes on Athena to 700 milliseconds–1.5 seconds" 🎉), better governance, and lower costs in this article: https://lnkd.in/gUed6S4P #DataAnalytics #DataEngineering  #DataLakeAnalytics #DataLake #DataLakeHouse

    View profile for Alfred Johnson, graphic

    Cloud Data Platform Architect | Building successful data teams | Data Modelling and Governance

    At Herdwatch we've been on a journey to rebuild our data architecture to unify and analyze tens of thousands of animal lifecycle events in one unified layer across all our regions Using Apache Iceberg and analytics powerhouse StarRocks powered by CelerData we've truly been able to leverage the lake house paradigm to build a highly performant and efficient data platform. Watch this space for exciting new products we will now be able to deliver powered by our data platform! Read about our motivation and architecture here :

    Herdwatch Enable Customer-Facing Analytics Using Apache Iceberg And CelerData Cloud

    Herdwatch Enable Customer-Facing Analytics Using Apache Iceberg And CelerData Cloud

    starrocks.medium.com

  • View organization page for CelerData, graphic

    9,497 followers

    🔗 https://lnkd.in/gw3wfx-6 - ⏰ Join Pinterest's Zhenxiao Luo and StarRocks PM Sida Shen for a deep technical discussion—designed by engineers, for engineers. Discover how Pinterest is investing in customer-facing analytics and the impact this work is having on its customers and bottom line. Here's what you can expect: 🌟 Challenges and Requirements for Customer-Facing Analytics at Pinterest: Understand the specific hurdles Pinterest faced in scaling its analytics infrastructure to meet the growing demands of real-time data insights for millions of users and partners worldwide. 🌟 Performance Improvements and Cost Savings Pinterest has Gained: See how Pinterest optimized their Partner Insights platform and achieved a 50% reduction in P90 latency while utilizing only 32% of the instances previously required. 🌟 Interactive Q&A: Engage directly with Pinterest experts. This is your chance to ask questions and gain deeper insights into their success. Don't miss this opportunity to learn from an industry leader and discover how you can achieve faster, more efficient analytics that translates to more revenue for your business. Register Now to Secure Your Spot! #DataAnalytics #DataEngineering

    This content isn’t available here

    Access this content and more in the LinkedIn app

  • 🌟 We’re excited to co-host the Apache Iceberg™ SF Meetup next Thursday, February 15, alongside Daft, Amazon Web Services (AWS), Databricks, Dremio, Greybeam, and Snowflake! If you’re in the area, join us for an evening of insightful discussions, networking, and the latest updates in the Apache Iceberg ecosystem. 📅 Date: Thur February 27 | Doors open at 5pm 📍 Location: San Francisco 🔗 Register: https://lu.ma/77zbx044 Don’t forget to stop by and say hi to the CelerData team—we’d love to chat! #DataAnalytics #DataEngineering #DataLakeAnalytics #DataLake #DataLakeHouse

    Apache Iceberg™ SF Meetup · Luma

    Apache Iceberg™ SF Meetup · Luma

    lu.ma

  • CelerData reposted this

    View organization page for StarRocks, graphic

    1,595 followers

    Shoutout to the amazing NAVER Corp Engineering Team (김영진 and 이모원Moweon Lee) for sharing this in-depth breakdown of how this dominant force in South Korea’s internet industry solved their analytics challenges with StarRocks! NAVER manages 20+ PB of data in its Apache Iceberg Lakehouse, powering real-time insights across 200+ interconnected services in search, e-commerce, AI, and more. The Challenge NAVER’s ClickHouse-based analytics system initially provided fast aggregation but struggled as their needs evolved: ⚠️ Fixed Dimensions – Lack of JOINs forced rigid denormalized tables, limiting analytical flexibility. ⚠️ Scalability Bottlenecks – Balancing data across nodes required manual intervention, making scaling inefficient and resource-intensive. ⚠️ Limited Data Upserts & Deletes – Merge-on-read severely degraded performance, restricting support for real-time mutable data. Why StarRocks? To overcome these limitations, NAVER benchmarked Trino, Pinot, Druid, and StarRocks based on key criteria like multi-table JOINs, upsert performance, federated analytics, and scalability. After extensive testing, StarRocks emerged as the best solution due to: ✅ Native multi-table JOINs – Eliminates the need for denormalized tables, enabling flexible, real-time analytics. ✅ Federated Analytics – Seamless integration with Apache Iceberg, Hive, and other data sources. ✅ Superior Aggregation Performance – Matched or exceeded ClickHouse’s speed while handling dynamic workloads. ✅ Cloud-Native Scalability – Decoupled storage-compute architecture simplifies horizontal scaling in Kubernetes. ✅ Efficient Real-Time Upserts – Enables fast updates without impacting query performance, a key requirement for NAVER. Further Optimizations NAVER optimized query performance using StarRocks’ materialized views, achieving: 🔹 6x faster performance on multi-table and aggregated queries. 🔹 Automated Query Rewrite & Refresh – No manual SQL modifications required for optimizations. The Impact ✅ Real-Time Interactive Queries – Engineers can now directly query raw data using SQL, improving analytics flexibility. ✅ Superior JOIN Performance – StarRocks’ native JOIN engine enables multi-table analytics at scale. ✅ Unified Query Platform – Supports a hybrid analytics ecosystem, integrating Iceberg, Hive, and real-time data. ✅ Seamless Scalability – Kubernetes-native design ensures linear scaling, optimizing resource efficiency and cost. 👉 Read the full story here: https://lnkd.in/gGt_g4Yq #DataAnalytics #DataEngineering #RealTimeAnalytics #DataLakeAnalytics #DataLake

    How JOIN Changed How We Approach Data Infra At NAVER

    How JOIN Changed How We Approach Data Infra At NAVER

    blog.devgenius.io

  • 🎥 [on-demand] 🔗https://lnkd.in/gRjgDXYE - The recording of "Xiaohongshu/RedNote’s Secret for Customer-Facing Analytics With 200M+ Users" is now available! Watch this video to discover how one of the world’s fastest-growing social platforms has scaled its analytics using open-source technologies—all while keeping costs under control. #DataAnalytics #DataEngineering #RealTimeAnalytics #DataLakeAnalytics #DataLakeHouse

  • 🔴 Going LIVE in just 90 minutes! Haven’t signed up yet? Don’t miss out—register here 👉 https://hubs.la/Q035xlL70 🔔 Join us to uncover how #Xiaohongshu/#RedNote, one of the world’s fastest-growing social platforms, is scaling its customer-facing analytics with open-source technologies—maximizing efficiency while keeping costs in check. Attend the session to learn how Xiaohongshu: 🌟 Reduced ad platform costs after migrating to StarRocks while boosting performance to support 10,000+ concurrent users.
 🌟 Achieved sub-100ms queries across customer-facing dashboards, elevating the user experience for advertisers and internal teams.
 🌟 Transitioned to an open lakehouse architecture on Apache Iceberg, improving data governance and providing greater flexibility. 
 #DataAnalytics #DataEngineering

    Xiaohongshu/RedNote’s Secret for Customer-Facing Analytics With 200M+ Users

    Xiaohongshu/RedNote’s Secret for Customer-Facing Analytics With 200M+ Users

    celerdata.wistia.com

  • View organization page for CelerData, graphic

    9,497 followers

    👏 Great insights from TRM Labs in their latest blog on building a petabyte-scale data analytics platform. Dive into their detailed journey to see how StarRocks + Apache Iceberg tackled their previous challenges and unlocked ultra-fast, scalable, and customer-facing analytics! #DataAnalytics #DataEngineering #DataLakeAnalytics #DataLake #DataLakeHouse

    View organization page for TRM Labs, graphic

    45,561 followers

    At TRM Labs, we process petabytes of #blockchain data across 30+ chains and handle over 500 customer queries per minute — all with ultra-low latency, thanks to our distributed Postgres and BigQuery powerhouse. After years of optimizing BigQuery to scale efficiently, we recognized the need for an open, self-hosted, secure, and high-performance solution to meet our expanding demands. In our latest Data Engineering deep-dive, discover: 🚀 What drove our leap to a petabyte-scale data lakehouse 🔬 Why we bet on Apache Iceberg and StarRocks, and the experiments that shaped our decision ⚡ How our Next Generation Data Platform supercharges user-facing analytics Get all the details here ➡️ https://bit.ly/4aP00Mf #DataEngineering #TRMEngineering #BlockchainIntelligence #Hiring

    • No alternative text description for this image
  • 👥 Join us this Thursday, February 6, as Sida Shen shares how RedNote is scaling customer-facing analytics while keeping costs in check. 👉As TikTok faces uncertainty, #RedNote (#Xiaohongshu) has surged into the spotlight, welcoming a wave of new users on top of its 200M+ monthly active base. But how is it handling this surge while optimizing its data infrastructure for ads and analytics? Get the inside scoop on how RedNote: 🌟 Reduced ad platform costs after migrating to StarRocks while boosting performance to support 10,000+ concurrent users. 🌟Achieved sub-100ms queries across customer-facing dashboards, elevating the user experience for advertisers and internal teams. 🌟Transitioned to an open lakehouse architecture on Apache Iceberg, improving data governance and providing greater flexibility. https://hubs.la/Q035kTGc0 #DataAnalytics #DataEngineering #DataLakeAnalytics#DataLake #DataLakeHouse

    Xiaohongshu/RedNote’s Secret for Customer-Facing Analytics With 200M+ Users

    Xiaohongshu/RedNote’s Secret for Customer-Facing Analytics With 200M+ Users

    celerdata.wistia.com

  • Keeping up with performance SLAs and achieving low query latency for high-performing analytics can feel overwhelming. But with Materialized Views in StarRocks and CelerData Cloud, you don’t have to sweat it. 📕This article breaks down how Automatic Materialized Views make performance optimization effortless—so you can stay ahead with ease. 💡 Here’s why Auto-MV is a game-changer: ✅ Ease of Use: Automatically recommends and creates MVs based on query patterns—no expertise required. ✅ Performance Optimization: Delivers faster queries using StarRocks’ advanced query rewrites without any SQL changes. ✅ Cost Efficiency: Reduces system overhead and total cost of ownership (TCO). Dive deeper into the details: https://hubs.la/Q0356-kZ0 #DataAnalytics #DataEngineering

    Faster Customer-Facing Analytics With Automatic Materialized Views

    Faster Customer-Facing Analytics With Automatic Materialized Views

    celerdata.com

  • What an incredible turnout at the #ApacheIceberg Bay Area Meetup yesterday! 🔥 Yan Zhang, Staff Engineer at CelerData, took the stage to walk through best practices and key strategies for optimizing query performance for Apache Iceberg at scale. Big thanks to PuppyGraph, Snowflake, Amazon Web Services (AWS), Databricks, and the Apache Iceberg Community for organizing such a fantastic event! 🚀 #DataAnalytics #DataEngineering #DataLakeAnalytics#DataLake #DataLakeHouse

    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image
    • No alternative text description for this image

Similar pages

Browse jobs