A Web-Based Approach to Measure Skill Mismatches and Skills Profiles for a Developing Country:. Jeisson Arley Cárdenas Rubio
Appendix B: Text Mining
Appendix C: Detailed Process Description for the Classification of Companies
C.1. Manual coding
C.2. Word-based matching methods (“Fuzzy merge”)
C.3. A return to manual coding
Appendix D: Machine Learning Algorithms
Appendix E: Support Vector Machine (SVM)
Appendix F: SVM Using Job Titles
Appendix G: Nearest Neighbour Algorithm Using Job Titles
Appendix H: Additional Tables
Figure 2.1. Labour market structure
Figure 2.2. Composition of informal economy
Figure 2.3. Labour market equilibrium under perfect competition
Figure 2.4. Labour market segmentation
Figure 3.1. Labour structure in Colombia
Figure 3.2. Participation, employment, unemployment, and informality rate trends, 2001-2018
Figure 4.1. IP traffic by source, 2016-2021
Figure 5.1. Job advertisement comparison between job portals
Figure 6.1. Steps for extracting more value from job vacancy information
Figure 6.2. Word cloud: Frequency analysis
Figure 6.3. Word association: Frequency analysis
Figure 6.4. Summary of steps carried out to obtain the Colombian vacancy database
Figure 7.1. Distribution of job placements by departments, 2016-2018
Figure 7.2 Ratio of job placements to EAP by departments, 2016-2017
Figure 7.3. Job placements by minimum educational requirements
Figure 7.4. Word cloud: Most frequent job titles by job portals
Figure 7.5. Distribution of job placements by major occupational ISCO-08 groups
Figure 7.6. Job placements by experience requirements
Figure 7.7. Trends of the labour demand by major occupational ISCO-08 groups
Figure 7.8. Trends of the most demanded occupations at a four-digit level
Figure 7.9. Occupations at a four-digit level with a positive trend
Figure 7.10. Occupations at a four-digit level with a negative trend
Figure 7.11. Wage density
Figure 7.12. Jobs by type of contract
Figure 7.13. Duration density (monthly)
Figure 8.1. Education and wages (Colombian pesos)
Figure 8.2. Occupations and wages (Colombian pesos)
Figure 8.3. Years of experience and wages
Figure 8.4. Job placements and employment distribution by occupational groups (ISCO-08)
Figure 8.5. Wage distributions
Figure 8.6. Time series: Total employment and job placements, 2016-2018
Figure 8.7. Time series: Total unemployment and job placements, 2016-2018
Figure 8.8. Time series: New hires and job placements, 2016-2018
Figure 9.1. Occupational distribution of the Colombian workforce by skill level
Figure 9.2. Unemployment and informality rates and duration of unemployment by skill level
Figure 9.3. Average wages of formal and informal workers by skill level
Figure 9.4. Labour market composition of Colombian workers by skill level, 2010-2018
Figure 9.5. Employment growth by skill level, 2011-2018
Figure 9.6. Evolution of the unemployment rate by skill level, 2015-2018
Figure 9.7. Evolution of the informality rate by skill level, 2010-2018
Figure 9.8. Beveridge curve by (major) occupational groups
Figure 9.9. Percentage change in unemployed individuals by sought occupation
Figure 9.10. Percentage change in formal employment by occupation
Figure 9.11. Percentage change in new hires by occupation
Figure 9.12. Percentage change in hours worked for formal employees by occupation
Figure 9.13. Percentage change in job placements by occupation
Figure 9.14. Percentage change in mean real hourly wage for formal employees by occupation
Figure 9.15. Occupational hourly pay premia
Figure 9.16. Occupational pay premia within job placements
Figure 9.17. Number of occupations according to the percentage of indicators that suggest skill shortages
Figure A.1. Job portal comparison
Figure A.2. Job advertisement comparison within the same job portal
Figure A.3. Code comparison between job portals
Figure A.4. HTML code structure
Figure C.1. Fuzzy merge: The classification of companies
Figure E.1. SVM classification with job titles
Table 3.1. Characteristics of the Colombian workforce
Table 4.1. OECD quality framework and guidelines
Table 4.2. Possible sources that affect the quality of information from job portals