Text Codification for Statistical Production using Machine Learning
Keywords:
Algorithms; Data; ClassificationAbstract
Objective
The main objective of this research is the usage and evaluation of Machine Learning algorithms for automatic text codification in statistical production process.
Prior Work
This is an evolving area of research due to rapid changes in technology as well as the new data ecosystem. The paper will build on previous research done on text classifications techniques.
Approach
In this paper Machine Learning algorithms will be used and evaluated for text codification. Natural Language Processing and classification algorithms will be implemented in Python.
Results
Machine Learning is powerful in the process of automation and modernization of statistical production lifecycle. Machine Learning algorithms performance is different for text classification. Data pre-processing and balance on the training data set are important to achieve good results.
Implications
This study shows that machine learning can be used in automating part of the statistical codification process. The results of this paper will serve the work of Albanian administration and more specifically statistical production.
Value
This research is a contribution to the usage of Machine Learning for the modernization of the codification process. It will serve as an initial work towards improving the timeliness and lowering statistical production costs
References
.
Downloads
Published
Issue
Section
License
Copyright (c) 2023 EIRP Proceedings
This work is licensed under a Creative Commons Attribution 4.0 International License.
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material
- for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.
Under the following terms:
-
Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- No additional restrictions — You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.