Automating the Competence Matrix of a Quality Control Laboratory with the Relational Database Management Framework—A Case Study with the Analytical Chemistry Unit of a National Regulatory Agency ()
1. Introduction
Many national regulatory agencies (NRAs) monitor the manufacturing, testing, and distribution of medicines to conform to international standards as part of their core functions [1] . These NRAs’ Quality Control (QC) laboratories are facilities that conduct tests on medicines and generate reports used to make administrative decisions on pharmaceutical products that are utilized within countries. The oversight functions of an NRA depend on the competencies of the laboratory personnel who are exposed to several pieces of training that equip them with the skills requisite for their various job roles. These functions are driven by the competence matrix, which appropriately aligns and ties personnel skillsets, training, and competence assessment statuses to job roles, in form of a data management mechanism [2] . Institutions have challenges maintaining a competency matrix to map core functions for their personnel especially when they depend on manual systems which are associated with various inefficiencies [3] [4] .
The National Agency for Foods and Drugs Administration and Control (NAFDAC), Nigeria’s NRA has several laboratories that carry out QC functions for drug products used within the country [5] . The analytical chemistry (ANACHEM) unit is one of the key departments in NAFDAC laboratories saddled with the QC core function. The unit comprises several cadres of scientists who undergo different pieces of training to equip them with the skills needed for their job roles. A competence matrix provides comprehensive information on the training and competency assessment status of all personnel. It also links job roles to personnel’s current levels of proficiencies attained after satisfactory evaluations of designated competency assessments. It, therefore, forms a configuration that supports personnel data management on training and competence. However, human errors were inherent in managing the personnel training records manually and led to inefficiency in tracing personnel proficiency statuses in the drug laboratory, in addition to process pace issues [6] . In general, the skills required for laboratory QC analysts are broadly categorized as functional, managerial, and behavioral [2] . Some studies have reported variations in the proposed timelines required for QC analysts to acquire these competencies [6] [7] [8] . This is understandable given that a personnel’s previous work experience, occupation, skill, educational background, etc., can affect the ease with which they can successfully pass a competency assessment.
However, using a manual process to perform the required data management operations in NAFDAC’s ANACHEM is a complex task which involves determining the sequence of personnel training needs, pooling training records, and keeping timelines of training activities. These requirements pose challenges related to mental operations, human errors and delays in conducting events. Consequently, the personnel competence training and assessments are conducted slowly and inefficiently [6] . This study used relational database management system (RDBMS) to improve the time inefficiencies associated with ANACHEM data management process. The RDBMS described in this study can be adapted and customized for other potential users, similar to Sohne’s recommendations that a system he created for automating revenue collection can be adapted by other communities with low infrastructures [8] .
The problems with paper-based records have been extensively documented in some research reviewed. Redundancy of hospital records and inadequate information from data structure were identified as major challenges in a study that elucidated the benefits of computerizing the documentation procedures in some health-related facilities [9] [10] . To address the challenges identified with the manual operations, a prior effort developed an Excel-based system to manage personnel competence mapping for the ANACHEM unit [11] . Although the Excel-based competence matrix supported creating a list of the pieces of training and competence assessment tests and generated the duration (hours) required for each piece, it did not adequately mitigate those challenges. Like every other spreadsheet package, Excel does not essentially handle database management operations efficiently. For example, Melissa in his article pointed out some of the shortcomings in using Excel to include performance slowness, no support for data integrity, and defect in handling relational database management operations [12] . Furthermore, Gupta delineated the deficiencies of using such an Excel-based process in handling database tables [13] . The flaws observed by Gupta imply that Excel tables are not logically linked, and data query operations are difficult. Nevertheless, according to Azam, Powell, et al., and Rahman et al., data queries among other operations, are essential in effective management of the information for personnel training and competence assessments [14] [15] [16] .
Therefore, this paper provides an automation model that improves on the sluggishness and human error problem type that characterized the manual and Excel-based data management processes in the personnel competence matrix. The model aligns with the World Health Organization (WHO) recommendation to regulatory agencies in relation to automating their manual procedures for training and competence assessment data, to achieve international standard practices [17] . NRAs use the WHO guidelines to accomplish an essential strategy to deliver internationally recognized drug testing, inspection, and assessment of Good Manufacturing Practices [5] . Other research automated or computerized the business processes of health agencies to increase profits and reduce costs. A study provided details about the computerization of an erstwhile manual process and evaluated various benefits within a hospital [18] . Another attempt at automating erstwhile manual systems were earlier published in a low-income resource country [New Guinea]. The country computerized the processes in its health management systems as a means of improving healthcare delivery services [19] .
Purpose of Study
This study addressed the time wastage associated with the manual or Excel- based data management processes in personnel skills training and acquisition in the ANACHEM laboratory. It also determined how the existing Excel-based system affects the time demand in administering personnel data management in the competence matrix. It compared the existing system to the automated-based model to determine if the time for the Excel-based operation demand was substantially high. Therefore, it generated the study research question: to what extent did computerization of data management operations reduce the time demand for administering the personnel competence training and assessments in the ANACHEM laboratory?
Aside determining time demands for conducting personnel competence training and assessment, other deliverables for this study were that the computerized competence matrix system should be able to: register and keep track of personnel details, record the sequence of exposures to SOPs and OJTs for all cadres of personnel. It should also track date stamps for the start and completion of individual SOP per personnel, provide the time-point for assessing personnel competence, and access personnel proficiency status on SOPs, OJTs, and competence assessments.
2. Fundamental Principles
The NAFDAC’s ANACHEM laboratory competence matrix is a framework that maps each personnel’s skills and job roles to a definite proficiency level. Each proficiency level is associated with defined weight/credit (in hours) for standard operating procedures (SOPs) training, on-the-job tasks (OJTs) and competence assessment tests. Table 1 illustrates a matrix that maps personnel proficiency levels to their SOP trainings.
A status (quantitatively denotes a weight) is a Boolean declaration that states whether the staff is competent or not with respect to a particular SOP when a data query is executed. The competency matrix specified different minimum proficiency levels for all categories of personnel [6] . The various job roles, proficiency levels, and the number of hours needed to attain the recommended levels
Table 1. Matrix representation of personnel vs. number of available SOPs.
are listed in Table 2. The number of weights each employee (or staff) accumulates determines the achieved proficiency level in that department. The new (automated) competence matrix developed by this study utilized personnel demographics, job roles and corresponding minimum proficiency levels to create database tables, and subsequently track and update the matrix. The study categorized the new matrix into two broad functionalities: staff registration and SOP administration (or staff training).
There are four levels of competency assessments for the ANACHEM unit. The weight allocated to an increasing level of assessment is higher than the previous lower competency tests. For example, competency level 1 assessment for determining accuracy in gravimetric measurements weighs lower than Level 2, etc. The rationale is that higher levels of skills and SOP training are required as personnel progresses in the QC activities over time. Some examples of weight hours associated with some SOP and competency assessments are listed in Table 3.
The Automated-Based Model Structure
RDBMS was used as a framework to automate personnel records management as a structured database system in this study. This championed the computerization of the data management operations to enhance the operational pace and efficiency. Computer-based data management is a concept that indicates a database management prototype to integrate DBMS for efficient handling of data storage, retrieval, manipulation, and security. Computerization of some processes reported where the use of an RDBMS provided an interface between “users and application” and the database. Such studies highlighted using RDBMS as a tool in administrative functions for managing data storage, access, and performance both in the health sector, and other related fields [20] - [28] . It was identified from these reviews that computerizing processes with well-structured RDBMS was crucial in data management operations. Unfortunately, ANACHEM’s Excel-based spreadsheet did not apply any relational database management structure. However, this study addressed this limitation by extensively making use of RDBMS structure in the “New” system that it developed.
Table 2. Hours of training requirements for attaining the minimum proficiency levels for each role in the analytical chemistry unit.
Table 3. Rationale for weights allocated to some SOPs and competency assessments in determining cumulative hours of training gained by a QC lab personnel.
ANACHEM’s Excel spreadsheet for competence matrix did not adequately address the sluggishness of its personnel data management system. Some studies documented using automated-based system to explore time reduction as a means of improving the management of an erstwhile slower system. Daphine used visual studio 2005 to improve the efficiencies of the management system in a school library. Computerizing the manual process allowed the Liberians to add users and books into the system, simplified search from the system database, and managed borrowing from the library resources more efficiently [29] . Other studies demonstrated how RDBM framework improved the efficiencies of handling data; Albalkhi used the framework to increase the time efficiency of querying large heterogeneous datasets in his study core design [30] . Another study elaborated on how access control was applied to secure medical records. This study implemented the mechanism using a relational database system. Employing ANSI standard SQL statements the project created a set of access-control rules encoded as relational table rows. The mechanism determines via a single SQL query if a particular user is authorized to perform a specific operation on a definite data object [31] .
Essentially, one of the main objectives of this research is to use RDBMS-based platform to improve the turnaround time to perform the task of tracking personnel competency in the Nigerian FDA lab, this is similar to the efficiencies described by some researchers who used RDBM to improve time efficiencies of operations within healthcare and other systems [32] [33] [34] .
3. Methods
3.1. Research Design
The methods for implementing the manual and Excel-based competence matrix were reviewed as the backbone of this study. A model system was integrated with an RDBMS structure to automate the data management of the personnel training and competence assessment operation in the laboratory. This structure enables all required fields and attributes for the personnel demographics, job details, training, and competence statuses to be effectively incorporated into well- connected database tables and schema. This study designed two process methods (staff registration and staff assessment modules) using SQL language to mobilize the RDBMS structure. Specifically, MySQL database software was employed to develop the “New” system because it is an open-source database tool that centers on the specification of standard SQL [35] [36] .
3.2. Model System Design
A model system was designed as data management software to simplify the manual/Excel-based process-induced difficulty in managing the personnel competence matrix in the laboratory. Python programming language and structured query language (SQL) were employed to design the “New” system that adopted the RDBMS framework. In creating the “New” system two methodical steps were covered, namely design criteria and testing criteria. At the design criteria stage, the process flow for the manual/Excel procedures and “New” system were evaluated and validated to determine the elements required to create the backend design of the RDBMS-structured system. In designing the frontend, the backend design of the framework was integrated into the system and the process flow validated. Flowcharts, algorithms, and source codes generations were computational problem-solving tools engaged both at the frontend and backend stages. At the testing criteria level, all flowcharts and program source codes developed to generate results (inference/decision) from interoperability of the designs were evaluated and validated. The designs are illustrated in Figure 1.
Key data objects employed to design the desired relational database structure include tables for personnel’s demographics, departments, job roles, and SOPs/ OJTs. RDBMS framework was utilized to interconnect the tables for designing the “New” system, which is the automated personnel competence training and assessments data management format of the competence matrix. This study setup and configured MySQL software in a Microsoft Windows system to enhance the tasks integration. In operating the “New” system that this study developed, the ANACHEM administrator takes a decision on two tasks. Either to conduct a data management operation (on personnel enrollment using the Staff Registration module) or employee SOP/OJT or competence assessment (using the Staff Training/Assessment module). Figure 2 illustrates the process flow part for the “New” system.
However, either option will lead the operational control to flow down through
Figure 1. Flowchart of the design procedure for the “New” system.
the point where the process outcome is committed into the DBMS repository.
3.3. Data Collection and Analysis
The population in this study was the data management tasks that a laboratory administrator (head of lab.) conducts for personnel competence training and assessment operations in ANACHEM. The model or “New” system handles the stages of those tasks as modules. They include 5 stages: personnel registration (new employees or transfers into ANACHEM unit), first SOP allotment (SOP #1 to personnel), complete SOP (to confirm the previously assigned SOP is done), administer subsequent SOPs (SOP # 2, #3, #4, etc., and competence level #1),
Figure 2. The “New” system process flow is determined by a task option (Staff Training or Assessment) module selected by an administrator.
and personnel assessment (evaluates the cumulative hours of SOP modules at competence level #1 and determine/report current proficiency level). In collecting the core data, the times (T1, T2, T3, and T4) for performing 4 tasks were computed at the stipulated 5 stages of the operation.
The data samples used in this research were time measurements obtained from 40 simulated tasks that laboratory administrator conducts for competence training and assessment operations in ANACHEM. These laboratory administrative tasks encompassed the five stages of operation as highlighted above in this section. The tasks were run within four consecutive days to obtain the times t1, t2, t3, and t4 spent in performing tasks T1, T2, T3, and T4. Essentially, these 4 tasks were run on 5 hypothetical subjects (P1, P2, P3, P4, and P5) using the Excel and the New system respectively. The sampling was performed by simulating the laboratory administrator randomly. The rationale for the sampling method and size includes:
· The need to accommodate the absence of the human subject approach that this research employed.
· The sampling size of 40 is significantly adequate for a two-sample (or 20 matched pairs) problem from a randomized comparative experiment.
The “New” system (which is the automated software) used hypothetical subjects to generate data to create an assumed competence matrix alternative to the Excel system. Thus, comparative experimental design was a good fit for determining the relative performances of the alternatives. Using each option, data was obtained for performing the predefined core tasks for 5 hypothetical subjects. RStudio was used to evaluate the test statistics for the core data [37] .
This study used the “two-sample problem” approach to answer the research question by comparing the means of the response variable in two populations (operations performed with Excel system, μExcel, and the “New” system, μNew), with an assumption that the two groups were independent. Measurement used is by “matched pair” approach. The two populations were compared by obtaining the difference in the measurement for each pair of the samples. Therefore, the following null and alternative hypotheses were formulated:
Null Hypothesis: The mean in the amount of time spent within the population of tasks performed for completing the scheduled activities in ANACHEM did not differ using the Excel system and the “New” system.
Alternative Hypothesis: The difference in the mean of time spent within the population of tasks performed for completing the scheduled activities in ANACHEM using the Excel system and the “New” system was greater than zero.
A t-test statistical test was performed to determine which of the hypotheses to reject.
4. Results
4.1. A Review of the Old System
The old system was a manual register that contained employee name, job role, and date employed. It also contained a list of schedules of SOPs/OJTs and competence assessment tests for all cadres of staff. The schedules were arranged in sequence administered in the laboratory. One of the lists (or sheets) contained in the Excel competence matrix independently monitored duration, in terms of the hours required for each SOP/OJT training, see Figure 3. This list also aggregated the man-hours required for analyst to attain each competence level. As earlier observed the tables contained in Excel competence matrix are neither physically nor logically linked. Consequently, queries that were crucial for achieving efficient personnel data management in the competence matrix were difficult to attain as Excel spreadsheets were inherently not structured for database management.
4.2. Advantages of SQL over the Excel System
Unlike the Excel spreadsheet setup, the SQL used for the “New” system linked the tables in the competence matrix while defining their relationships in a database. This configuration provided a medium where tables for storing specific
Figure 3. Ensample of ANACHEM Excel competence matrix spreadsheet.
objects, for example, employees, SOPs, and Job Roles, were linked to easily communicate with each other. Therefore, SQL was a fit database programming language choice to effectively deploy RDBMS. The RDBMS framework contains features that support the strategy required to address the deficiencies that characterize the manual or Excel-based procedures in handling the tables associated with the personnel competence matrix.
4.3. Time Efficiency of the New System
The time for performing the 40 simulated tasks using the Excel and the “New” system to generate the core data is summarized in Table 4. The data set provides the simulated times it took a laboratory administrator to perform 4 different types of tasks involved in competence training and assessment using the 2 systems for 5 hypothetical subjects in the laboratory.
This study used R software as a statistical tool to compare the means of the response variable in two populations (operations performed with Excel system and with “New” system). The values for the mean difference for a sample (
), the standard deviation of the differences for the sample (
), and the number of observations (n), were computed as
,
, and
. The standard error
and test statistic
were computed as
and
The critical value
followed
distribution and the degree of freedom,
Table 4. Time spent in performing the 4 core tasks within the population for T_New (TN) and T_Excel (TE).
. Therefore, the values of critical value
and p-value were computed as
and p-value = 0.001908888. The significant p-value obtained (p = 0.00191, α = 0.05), provided enough evidence to reject the null hypothesis. Therefore, the alternate hypothesis was upheld: the mean difference for the entire population was greater than zero. This means that the time taken to accomplish the tasks using the “New” system (37.08 seconds) is significantly lower than the time measurements for the Excel system (77.39 seconds).
Therefore, there was substantial evidence that the average time of completing the scheduled activities in ANACHEM was reduced when performed using the “New” system. This implied that the difference between the average time taken to complete an operation using the Excel System and using the “New” System was found to be greater than zero. A confidence interval (CI) for mean difference (
) was computed as estimate ± margin of error:
margin of error,
.
The CI for mean difference estimated the mean increase to be about 2.01, with a margin of error of ~1.28 points. Therefore, with 95% confidence the mean increase
in using the Excel System instead of the “New” (computerized data management) system was between 0.7 and 3.3:
. The outcome verified that the Excel System performed the operation with longer time than using the “New” System. The time efficiency of the “New” system in performing data management operations of the competence matrix in ANACHEM laboratory is summarized in Figure 4 below.
Figure 4. Time efficiency of the “New” system in performing personnel data management in ANACHEM laboratory.
The computerized competence matrix (“New” system) developed using RDBMS framework is able to register and keep track of personnel details and record the sequence of exposures to SOPs and OJTs. It also tracks date stamps for the start and completion of individual SOPs and therefore provides the time-point for assessing competence, see Figure 5. These features make it possible to access personnel proficiency status on SOPs, OJTs, and competence assessments on demand.
5. Discussion
A Purdue doctoral project provided the sequence of activities and timelines required by newly recruited personnel to attain different maturity levels of competence in analytical testing in an NRA QC laboratory. This project created linkages for the various activities in the unit, mapping the various recommended pieces of training. Danziger and Kraemer reported that in organizations where decision-making is highly data dependent, personnel efficiency was increased by using automated data management processes over paper records [38] . A similar study evaluated the impact of computerized information in healthcare system
Figure 5. The Staff Status Report is a key deliverable of the computerized competence matrix with the RDBMS framework.
and recommended distinctive roles and tasks for technical personnel for effective operation [39] . This study identified various deficiencies with the manual record-keeping system in the NRA QC laboratory to include human errors, loss of data, difficulties with archival processes, and long processing and retrieval times. It also delineated the limitations of an Excel spreadsheet system that was used to mitigate the observed deficiencies. The study provided a more efficient computerized system that provides evidence that automation substantially addressed the limitations experienced in the personnel data management in NAFDAC ANACHEM. Statistical test results showed a significant decrease (p = 0.00191, α = 0.05)) in the average time demand required to perform 4 scheduled tasks regarding data management operation in ANACHEM using the “New” system compared to using the Excel system. This study result confirmed that the computerization of personnel data management operations using RDBMS reduced the time demand for administering the personnel competence training and assessments in ANACHEM.
The “New” system mitigated the limitations of the old system and was able to: register and keep track of personnel details; record the sequence of SOPs and OJTs for all cadres of personnel; track date stamps for start and completion of individual SOP/personnel; provide the time-point for assessing personnel competence; and access personnel proficiency status on SOP & OJT and competence assessments.
In line with the WHO guidance for NRAs, personnel competence training and its assessment is a process that relies on good data management (GDM) [17] . In this study, the principles of GDM were useful to transform ANACHEM’s manual personnel data management and scale up the Excel spreadsheet developed by a Purdue dissertation project [6] . This study reviewed the current competence matrix in the ANACHEM laboratory and automated its personnel data management processes. The “New” system can map the sequence and date stamps of personnel training and competence assessments. It also provides a comprehensive record of personnel training history. Therefore, the “New” system developed by this project provides a more reliable platform in line with WHO recommendation for GDM in ANACHEM laboratory.
It is envisaged that implementing a computerized system will enhance the laboratory’s quality management system. Data privacy strategy in manually driven processes cannot support long-term maintenance. Filing either raw or processed data inside folders and storing them in cabinets does not achieve the intended data secrecy. It gives false security assurance as keeping documents in file cabinets can engender security threats to the object [40] . Other associated challenges include increased cost of man-hours, human errors associated manual operations, and costs accrued to data loss. These shortcomings were addressed by the RDBMS-based competence matrix system developed in this study.
This automated “New” system supported substantially lower time in executing the same operation in comparison to the Excel system in the ANACHEM laboratory. It is recommended that this “New” system can be customized to handle other quality management processes.
6. Conclusion
This study ascertained that there was a significant decrease in the average time (37.08 seconds Vs 77.39 seconds, p = 0.00191, α = 0.05) for completing 4 scheduled sets of activities in ANACHEM laboratory when RDBMS-based “New” system was used to automate the Excel system. Unlike the Excel system, the “New” system software supports real-time and on demand access to personnel proficiency levels thereby enhancing time efficiency of the laboratory. Finally, computerization of the data management operations achieved with this study will support ANECHEM laboratory’s desire for a more efficient process for administering competence, training, and assessment.
Acknowledgements
We acknowledge the supports of Stephen Byrn and Mercy Okezue from Purdue University IPPH Department and the Nigerian FDA Analytical Chemistry laboratory.
Availability of Data and Materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Authors’ Contributions
Marcel Okezue: Conceptualization, Methodology, Data curation, Validation. Eric Dietz: Supervision, Funding Acquisition.