Comparing with Python: Text Analysis in Stata
Abstract
Text analysis is the process of constructing structured data from unstructured textual content, usually implemented in Python. In terms of the principles of text analysis, a computer program with the ability to read a file and match it with a regular expression is all that is needed for basic text analysis. However, few researchers have used Stata as their main text analysis tool. In this paper, I will take a step-by-step approach to the practical process, giving examples of how text analysis can be performed with Stata, and comparing the code and running time with Python.
Cite
@article{arxiv.2307.10480,
title = {Comparing with Python: Text Analysis in Stata},
author = {Xiangtai Zuo},
journal= {arXiv preprint arXiv:2307.10480},
year = {2023}
}
Comments
Declaration: I am Xiangtai Zuo and I have an English name Shutter Zor. This can be found from my Google Scholar or ORCID information. Thanks for The arXiv Content Management & User Support Team