Tag: quanteda
-
R-Conference 2022
Presentation materials for “A Math-Fearing Social Scientistโs Basic R Toolkit: Scraping, Content and Network Analysis” presented at the R-Conference 2022 organized by the Malaysian R-User Group (MyRUG), 26-27 Nov 2022. Slides (PDF) R code for Google News example R code for Tale of Two Cities example (and the text data)
-
Basic Text Analysis and Visualization in R
At its most basic level, text analysis is about counting words. If words are frequently used, we assume that they are important. If words occur together, we assume that they are related. Obviously, that is not always the case, but a discerning researcher like yourself will be able to filter this information, provide context and…
-
Corpus Loading and Text Cleaning with Quanteda in R
Assuming you have a file with text data (perhaps a spreadsheet that you have exported as a CSV file, or data scraped from Google News), you can now start to build and clean your corpus. Fortunately, this is made very easy by functions in the Quanteda package. First, we load quanteda and the corpus (mycorpus.csv…
-
(Korean) Text Analysis in R and Pajek (incomplete)
R and its almost endless library of packages and plug-ins (CRAN) mean that you can do almost anything in R, including text analysis and network analysis. While you could do everything in R, that doesn’t mean you should. Specialized network analysis software can also be very useful when interpreting, analyzing or visualizing a network, as…