People Data Excellence: Driving Quality through Empowerment, Standardization, and Automation
GIGO ("Garbage in, garbage out") is a significant risk for professionals working with data and AI. Ensuring high-quality (people) data is crucial for building leaders' trust in data-driven talent decisions and reducing the need for manual reconciliation. Moreover, maintaining top-tier data quality is essential for the successful implementation of AI and GenAI technologies.
When building Workforce360, IBM's internal people data platform, our goal was to go beyond abstract data quality metrics. We concentrated on initiatives that have measurable impacts on processes and outcomes, enhancing data quality through empowerment, standardization, and automation.
A. Empower: Putting Data & Knowledge in the Hands of Users
As we began developing our solution, it quickly became apparent that many data quality issues were not caused by technical flaws, but rather by user misunderstandings. Data errors were often introduced at the source (e.g., HRMS), and insights provided by our team were frequently misinterpreted by end users, particularly when the data passed through multiple hands. To address this, we prioritized giving users greater control over their data and direct access to relevant knowledge.
B. Standardize: Establishing a Unified Approach for data and processes
A second crucial aspect of data quality is addressing the issue of 'multiple versions of the truth,' which often occurs when different parts of an organization, whether within HR or across the company, collect and process their own data for a common purpose. I vividly recall seeing a slide a few years ago that displayed ten different values for what should have been a straightforward headcount metric—astonishingly, eight of these values pointed to the same IT system as the source of truth!
I vividly recall seeing a slide a few years ago that displayed ten different values for what should have been a straightforward headcount metric—astonishingly, eight of these values pointed to the same IT system as the source of truth!
To address this issue, we implemented Common Governance and radically simplified the processes and experiences that deliver these insights.
C. Automate: Enhancing Efficiency Through Technology
Finally, we focused on improving data quality from a technical perspective by automating as much of the validation process as possible. This proactive approach allows us to identify data errors and inform users before they encounter issues themselves, thereby enhancing trust.
At IBM, we recognize that high-quality people data goes beyond metrics—it’s about building trust, driving focused actions, and enabling AI to deliver meaningful recommendations. By empowering users, standardizing reports and governance, and automating data integrity validation, we've ensured that our data quality directly enhances business operations.
A Note on AI: When it comes to AI, additional quality mechanisms are essential. While strategies like bias and ethical mitigation, model fine-tuning, and hallucination detection are crucial, they can only be effective if the underlying data is of high quality. Otherwise, it’s a classic case of “garbage in, garbage out” (GIGO).
Thank you for taking the time to read this article. If you're interested in these topics and IBM’s internal people data transformation, I would be delighted if you joined my People Data Platform monthly newsletter.
Senior Business Analyst
3moVery informative and insightful
#PeopleAnalyticsforHR | Connecting the dots between HR and data/AI
3moThanks for another great article about data quality for HR. It's such an important topic. I'm curious to hear if IBM has done much work around process design (and training) to better capture data. As you mention, often data quality issues are often user generated. I believe that if processes are designed to optimally capture data, and those involved in the process understand the importance of how the data will be used, it will help with data quality.
great article Pietro Mazzoleni... "Trust in Accuracy"
AI + Organizational transformation
3mothanks for writing this. quick q, whats an example of a business metric rule that checks the 18M records daily.