NVIDIA Data Science Module 2 Part 1 (Data & Scraping)
Data & Scraping

NVIDIA Data Science Module 2 Part 1 (Data & Scraping)

Mike's Bio
Upcoming Events
CPU vs GPU
RAPIDS
RAPIDS API
It's all about the goal!
Ice Breaker
NVIDIA Modules
Part 1 Data & Scraping
Homework Review
Building a brain in 10 minutes
How I would solve this


Lots of data, but wait there is more, much more!
Emphasis on understanding processes
Data Mining
The good, the bad, the ugly
LLMs to the rescue
Data you can download
Data from API's
Data to Scrape
Google Play Example
Scrape node-to-node
Scraping Google Play Reviews
The refactoring way - MVP!
As fast as you can think way
Scraping Libraries
Beautiful Soup
Scrapy
Exercise 1, Build a double scraper
Exercise 2, Put it up on Hugging Face
Data Annotation
Prepping for the Neural Net
TensorFlow Playground
MSE and Backpropagation
Labeling
Label Me
CVAT
CVAT.AI
CVAT
CVAT Homework
Data Quality
Data Quality
Data Quality
80% of out time
Companies need high-quality data
How do you get HQ data
Sentiment Analysis
Don't start from scratch
Emotion Wheel
Exercise 3, what prompt would you use
Q&A



















































Haris Ellahi

Software Engineer | 3X International Hackathon winner | Advent Of Code 2023 Participant | Python Devotee | Machine Learning | AI | REACT JS | Tailwind CSS | PHP | Laravel | MySQL

5mo

Interesting!

Like
Reply

To view or add a comment, sign in

More articles by Michael Lively

  • Planetary Extinction & Kessler Syndrome (PASQAL Hackathon)

    Planetary Extinction & Kessler Syndrome (PASQAL Hackathon)

    You might ask yourself: why would Kessler Syndrome have anything to do with planetary extinction? At first glance, one…

    5 Comments
  • AI Mentoring Job Interview (Slide Deck)

    AI Mentoring Job Interview (Slide Deck)

    This week, I had two job interviews for AI mentoring positions. One included a teaching demonstration, and I chose to…

    5 Comments
  • Predicting the Extinction of the Planet

    Predicting the Extinction of the Planet

    Abstract The project addresses the imminent global crisis of biodiversity loss and worldwide extinction due to climate…

  • First Semester Research Interest for Phy 770

    First Semester Research Interest for Phy 770

    I left my job at Great American to pursue a career path that doesn’t exist yet. I’m now a Donovan Scholar at the…

  • Unleashing the power of the GenAI Workforce

    Unleashing the power of the GenAI Workforce

    I was the lead keynote speaking this weekend at the PMI Southwest Ohio 2024 Summit on Generative AI. It was a great…

    7 Comments
  • Getting Started with Fabric Data Science DP-604

    Getting Started with Fabric Data Science DP-604

    Here are the slides to the DP-604 Fabric Data Science course I presented on 8/21. Join us for an exciting and…

    1 Comment
  • AI-3003 Getting Started with Azure NLP Services

    AI-3003 Getting Started with Azure NLP Services

    In this course, you will learn to analyze and translate text, starting with an introduction to working with labs and…

    4 Comments
  • Copilot Studio Build a YAML Topic Generator

    Copilot Studio Build a YAML Topic Generator

    In this presentation, you learn how to build a YAML Topic Generator. Mike's Bio Events AI Arms Race A New Age Up to…

    2 Comments
  • 8 Tips for Maximizing Your Resume

    8 Tips for Maximizing Your Resume

    In the competitive world of changing technologies, having a standout resume is crucial. Here are eight tips to help you…

    1 Comment
  • NVIDIA Module 1 - Introduction to Data Science

    NVIDIA Module 1 - Introduction to Data Science

    My Bio Upcoming Classes How we got here! CPU vs GPU CPU - GPU - NPU Year of Devices Ice Breaker Captured our Reasoniing…

    2 Comments

Insights from the community

Others also viewed

Explore topics