txttool: Utilities for text analysis in Stata

This article describes txttool, a command that provides a set of tools for managing free-form text. The command integrates several built-in Stata functions with new text capabilities. These latter functions include a utility to create a bag-of-words representation of text and an implementation of Porter’s (1980, Program: Electronic library and information systems 14: 130–137) word-stemming algorithm. Collectively, these utilities provide a text-processing suite for text mining and other text-based applications in Stata.


Issue Date:
2014-2014-2014
Publication Type:
Journal Article
ISSN:
1536-8634
Language:
English
Published in:
Stata Journal, Volume 14, Number 4
Page range:
817-829

Record appears in:



 Record created 2018-01-24, last modified 2018-01-25

Fulltext:
Download fulltext
PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)