Translation from narrative text to standard codes variables with Stata

In this article, we describe screening, a new Stata command for data management that can be used to examine the content of complex narrative-text variables to identify one or more user-defined keywords. The command is useful when dealing with string data contaminated with abbreviations, typos, or mistakes. A rich set of options allows a direct translation from the original narrative string to a user-defined standard coding scheme. Moreover, screening is flexible enough to facilitate the merging of information from different sources and to extract or reorganize the content of string variables.


Issue Date:
2010
Publication Type:
Journal Article
DOI and Other Identifiers:
dm0050 (Other)
PURL Identifier:
http://purl.umn.edu/159034
Published in:
Stata Journal, Volume 10, Number 3
Page range:
458-481
Total Pages:
24

Record appears in:



 Record created 2017-04-01, last modified 2017-08-27

Fulltext:
Download fulltext
PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)