Skip to main content

DIGHT 260

Tools for Text Processing and Analysis

College of Humanities

Course Description

Introduction to tools and techniques suited for use with text analysis, including basic text encoding, text preparation, basic regular expression, and concordance programs.

When Taught

Fall

Grade Rule

Grade Rule 8: A, B, C, D, E, I (Standard grade rule)

Min

3

Fixed

3

Fixed

3

Fixed

0

Title

Text Representation in Computers

Learning Outcome

 Students will understand how text is represented in computers (e.g., ASCII, Unicode, emojis) and how to resolve issues with character encodings and fonts.

Title

Regular Expressions and Wildcards

Learning Outcome

Students will construct regular expressions and wildcards to search for patterns to reformat text and add markup.

Title

Basic Scripting

Learning Outcome

Students will be familiar with a scripting language and know how to modify templates for analyzing text files.

Title

Text Analysis Tools

Learning Outcome

Students will use tools to analyze texts for word frequency, keywords, collocations, n-grams, and other patterns.

Title

Corpus Design and Creation

Learning Outcome

Students will develop a WordCruncher corpus to facilitate research and text analysis.