New Conversational AI Tool Lets You “Chat” With Your Data

New Conversational AI Tool Lets You “Chat” With Your Data

As an ML engineer, one area where I spend a lot of time is data engineering. Can we use conversational AI technologies like ChatGPT to help boost productivity here?

Turns out Akkio recently shipped an amazing tool called “Chat Data Prep” that lets you “chat” with your data. That’s right! You can transform your data by simply using instructions. 

Data Transformation 

When preparing data to train an ML model, you need to process, transform and clean it. What if I told you that you don’t need SQL or Python to do this?

Akkio ’s new exciting feature “Chat Data Prep” uses the power of conversational AI to interact and transform your data just using text instructions. Let’s check out a short example.

Let’s say we want to train a sentiment classifier on a restaurant reviews dataset. Here are two transformations you can perform on the text data:

  • Removing punctuations
  • Lowercase text

Chat Data Prep is brilliant for this! You can transform the data with natural language instead of using code.

Here is a view of how the loaded data looks on the interface, including the columns and rows of reviews and sentiment labels:

No alt text provided for this image

Data is never perfect, especially text. You should expect to do a lot of cleaning. I like the idea of just using instructions to get this job done. That’s where Chat Data Prep comes in.

As a quick example, here is how I instructed Chat Data Prep to remove punctuations:

I am also interested in converting all the text into lowercase, which is common when processing NLP datasets. Here is how to easily do it with one text command via Chat Data Prep:

No alt text provided for this image

This a great feature because you can just continue chaining all your data transformations.

And you can do a lot more! Here is a demo of two more popular and useful data transformations you can do with Chat Data Prep:

  • 3 sigma outlier removal
  • Date reformatting

Train A Sentiment Classification Model

One really cool feature I particularly like about Akkio is the ability to quickly train a model to test the impact of the data transformations. Again, no code is needed!

I can see that the transformations I applied boosted the performance of the model. And I can easily go back and continue adding more data transformations to see if I get better performance without leaving the interface. ML is iterative so this is really useful!

Akkio also has useful integrations like Snowflake, BigQuery, Google Sheets, and others. These are useful as you can easily access your data stores and start finding value in your data.

You can even deploy a web app or production-ready API. This really accelerates experimentation and deployment.

No alt text provided for this image

Last Words

I get super excited to see how AI-powered tools like Chat Data Prep can boost productivity.

No-code tools are redefining what it is to build and deploy ML-powered apps. Excited about this partnership with Akkio and what’s ahead! Check them out here: Akkio

Emilio Grimaldi

SWE | Artificial Intelligence | Large Language Models | Research

1y

It's a fantastic product to use for ML and analytics

Like
Reply
Charlotte Ledoux

Data & AI Governance Expert 🔎 | Data Governance coaching | Women in AI | Speaker | LinkedIn™️ Top Voice in AI 🇫🇷

1y

Impressive ! Analysts are enough then for data prep, the analytics engineer can focus on other tasks

Like
Reply
Flor Elisa Trillo-Tinoco

Records and Information Management Coordinator

1y

Fascinating!

Like
Reply
Gerardo Figueroa

Engineering Lead | Prev. Co-founder & CTO

1y

And here I was thinking I had some job security :D In all seriousness though, this looks super useful, especially when data manipulation is not an everyday thing and you have to look up how to remove duplicate rows from a DataFrame, for the nth time.

Lauren M. Kaplan, PhD

Ex-Meta AI | UXR for PyTorch, Privacy-Preserving ML & AI Hardware | Mixed Methods AI UXR | Exploring New Opportunities

1y

So much potential - Hunter Mills you might find this useful! Getting to know your data just got much more interesting 🧐 Elvis S. would love to also see examples of how this might be used for numerical data e.g. for doing regression, ANOVA etc On another note I’m curious how privacy of data is ensured.

To view or add a comment, sign in

More articles by Elvis S.

  • My Favorite LLM Papers for October

    My Favorite LLM Papers for October

    Here's a list of my favorite LLM papers I read this month: 1/ Zephyr LLM - a 7B parameter model with competitive…

    2 Comments
  • Tracking LLMs with Comet

    Tracking LLMs with Comet

    When building with LLMs, you will spend a lot of time optimizing prompts and diagnosing LLMs. As you put your solutions…

    3 Comments
  • How To Build a Custom Chat LLM on Your Data

    How To Build a Custom Chat LLM on Your Data

    This is one of the fastest ways to build a custom ChatGPT-like system on top of your data. It's called ChatLLM (by…

    2 Comments
  • Data Exploration with Chat Powered by GPT-4

    Data Exploration with Chat Powered by GPT-4

    As an ML Engineer, this is one of the most useful applications of GPT-4 I've seen. Chat Explore is a powerful…

    6 Comments
  • Open Source Solution Replicates ChatGPT Training Process

    Open Source Solution Replicates ChatGPT Training Process

    ChatGPT is the biggest buzz in AI today! ChatGPT demonstrates remarkable capabilities so there is a high interest to…

    7 Comments
  • Analyzing Worldwide Energy Production with Kibana Lens

    Analyzing Worldwide Energy Production with Kibana Lens

    While there are many tools that can be used to perform a quick analysis of large-scale data, data analysis in itself is…

    1 Comment
  • XLNet outperforms BERT on several NLP Tasks

    XLNet outperforms BERT on several NLP Tasks

    Two pretraining objectives that have been successful for pretraining neural networks used in transfer learning NLP are…

    1 Comment

Insights from the community

Others also viewed

Explore topics