Kick start your company's data science practice (Part 1)

Kick start your company's data science practice (Part 1)

Past 5 years there has been lot of noise generated surrounding implementation of a data science practice within a company. Quite often, I have seen companies jump into establishing this program without thinking through long term ramifications. In this day and age of open source technology disruptions; companies need to be careful on what paths they choose as a go forward strategy that can be sustainable for the next several years.

There are several factors companies need to look for before attempting to get into this space:

What kind of data you possess internally?

Understanding data already available within the company is a critical step in the process. We have seen when companies grow inorganically through acquisitions same data will be processed and churned through multiple groups. It would be helpful if all of these can be collated in to central repository for further analysis. By doing so you have started building the foundation for data lakes.

Segregate or Anonymous data

Data Lakes established should have built-in security and privacy controls to help segregate and anonymous data

Search within your enterprise for all external data partnerships

Large companies always tend to buy or utilize external data (3rd party data sources) to enhance and enrich their own data. As an example; marketers may look for demographic data sold by 3rd parties to enrich their campaigns.

Look for some to lead your data science practice

When you want to make your first hire ensure the candidate meets many of the following criteria:

  • Statistical Background
  • Exposure to open source tools and languages
  • Exposure to machine learning algorithms
  • Hands-on person who likes to code and explore
  • Good communicator
  • Prior leadership experience
  • Worked on your specific data domain
  • Implemented production ready use cases
  • Does the individual has specific public profile in public competitions like KDD, Kaggle and others?

Decide on tools

There are two categories (Exploration and Production) of needs when you start looking for tools to implement data science within the company.

Best avenue to expedite GTM

The greater chance to success for GTM needs would be to partner with a company who can provide platform based consulting to identify and build your use cases. Platform based approach should address the need for reducing the effort to build new use cases as you continue to mature your offerings. External company should be able to offer seasoned consulting resources who have past experience in building these in the same or similar domain. They should be a long term candidate for some kind of OEM or partnership model.

In Part 2, I am going to talk about the state of the market place for tools, technologies, machine learning models and other measurable KPI's.



To view or add a comment, sign in

More articles by Venkatesh Guruprasad

  • LLM on Structured Data Sets

    LLM on Structured Data Sets

    Large Language Models (LLMs) on Structured Data Sets: A Synopsis of Current Challenges and Research Directions The…

  • LLMs and PDF Data Extraction (Semi-Structured and Unstructured)

    LLMs and PDF Data Extraction (Semi-Structured and Unstructured)

    Here is my proposal to solve PDF Data Extraction using LLMs. I have also highlighted some of the pitfalls and options…

  • My reflections of 2020

    My reflections of 2020

    Beginning of the year (1st 3 months) no one wanted to believe that we are living in a global pandemic Companies always…

  • Self Service Analytics

    Self Service Analytics

    What is “Self Service Reporting”? It is the concept where business users (semi-technical or non-technical) users can…

  • Contactless Payments still sucks

    Contactless Payments still sucks

    I recently started exploring utilization of contactless payments on my phone. I have found a bunch of challenges which…

  • Fundamental of Deep Learning (NN) for non technical audience

    Fundamental of Deep Learning (NN) for non technical audience

    As of now, I am not a data science practitioner on a day-to-day basis. I manage product strategy around build out of…

    1 Comment
  • Solving customer refund challenges from biller's

    Solving customer refund challenges from biller's

    Recently I went through an experience which started me thinking on why does it take so much time to get a refund from a…

  • Retail Experience in stores needs a major overhaul

    Retail Experience in stores needs a major overhaul

    I always wonder why are traditional retailers struggling? Is it because of digital economy? Is it because of eCommerce?…

    1 Comment
  • Kick start your company's data science practice (Part 2)

    Kick start your company's data science practice (Part 2)

    Know your primary need Picking tools and technologies is more of an art rather than science in the analytic world…

  • The power of Power BI

    The power of Power BI

    For the past several weeks I have been experimenting the capabilities that Microsoft's brand new Power BI tool. The…

Insights from the community

Others also viewed

Explore topics