Coursework requirements for the MS in HLT program are the same in-person and online. See the Department of Linguistics Graduate Handbook for the MS in HLT program requirements.
- LING 503: Foundations of Syntactic Theory
- LING 538: Computational Linguistics
- LING 539: Statistical Natural Language Processing (see public Course Overview page for LING 539)
- LING 578: Speech and Language Technology
- LING 581: Advanced Computational Linguistics
- LING 593: Internship (6-9 units)
While not technically required, the following courses are strongly recommended:
- LING 529: Human Language Technology I (see public Course Overview page for LING 529)
- LING 531: Human Language Technology II
- LING 582: Advanced Statistical Language Processing
The MS in HLT requires a minimum of 36 units.
For optimal sequencing, we recommend the following progression:
Remaining courses as available; see the Course Schedule for more information.
LING 529: Human Language Technology I
This class serves as an introduction to human language technology (HLT), an emerging interdisciplinary field that encompasses most subdisciplines of linguistics, as well as computational linguistics, natural language processing, computer science, artificial intelligence, psychology, philosophy, mathematics, and statistics. Content includes a combination of theoretical and applied topics such as (but not limited to) tokenization across languages, ngrams, word representations, basic probability theory, introductory programming, and version control.
LING 531: Human Language Technology II
This intermediate-level course is a continuation of LING 529 and covers a combination of theoretical and applied topics such as (but not limited to) unsupervised learning (clustering), decision tree classifiers, and the basics of information retrieval.
LING 538: Computational Linguistics
Fundamentals of formal language theory; syntactic and semantic processing; the place of world knowledge in natural language processing. Graduate-level requirements include a greater number of assignments and a higher level of performance.
LING 539: Statistical Natural Language Processing
This course introduces the key concepts underlying statistical natural language processing. Students will learn a variety of techniques for the computational modeling of natural language, including: n-gram models, smoothing, Hidden Markov models, Bayesian Inference, Expectation Maximization, Viterbi, Inside-Outside Algorithm for Probabilistic Context-Free Grammars, and higher-order language models. Graduate-level requirements include assignments of greater scope than undergraduate assignments. In addition to being more in depth, graduate assignments are typically longer and additional readings are required.
LING 581: Advanced Computational Linguistics
This course provides a hands-on project-based approach to particular problems and issues in computational linguistics.
LING 582: Advanced Statistical Natural Language Processing
This course focuses on statistical approaches to pattern classification and applications of natural language processing to real-world problems.
LING 578: Speech Technology
Topics include speech synthesis, speech recognition, and other speech technologies. This course gives students background for a career in the speech technology industry. Graduate students will do extra readings, extra assignments, and have an extra presentation. Their final project must constitute original work in a speech technology.
LING 503: Foundations of Syntactic Theory
An introduction to syntactic theory with an emphasis on data analysis, critical thinking, and theory development. Taught within the generative Principles and Parameters approach to syntax. Graduate-level requirements include a greater number of problems.
LING 508: Computational Techniques for Linguists
Students are introduced to computer programming as it pertains to collecting and analyzing linguistic data. The particular programming language is chosen at the discretion of the instructor. Graduate-level requirements include more challenging exams; 50% greater contribution to their respective group projects; 9 instead of 6 assignments; additional readings from the primary literature.
NOTE: The version offered in the online program is closer to something like “advanced programming techniques for computational linguists”
LING 696G: Topics in Computational Linguistics
The development and exchange of scholarly information, usually in a small group setting with an in-depth investigation of computational linguistics theory and application. The scope of work shall consist of research by course registrants, with the exchange of the results of such research through discussion, reports, and/or papers.
NOTE: The topic/focus may vary from offering to offering. The next offering will be structured as a followup to the Speech Technology course (LING 578).
Electives outside of the Department of Linguistics
Due to their relevance and regular availability, you may want to also consider the following non-LING courses as electives.
INFO 523: Data Mining and Discovery
This course will introduce students to the concepts and techniques of data mining for knowledge discovery. It includes methods developed in the fields of statistics, large-scale data analytics, machine learning, pattern recognition, database technology and artificial intelligence for automatic or semi-automatic analysis of large quantities of data to extract previously unknown interesting patterns. Topics include understanding varieties of data, data preprocessing, classification, association and correlation rule analysis, cluster analysis, outlier detection, and data mining trends and research frontiers. We will use software packages for data mining, explaining the underlying algorithms and their use and limitations. The course include laboratory exercises, with data mining case studies using data from many different resources such as social networks, linguistics, geo-spatial applications, marketing and/or psychology
INFO 536: Data Science and Public Interests
This course focuses on the use of modern data science methods to help learners make socially responsible decisions and mitigate harm that arises from issues like bias, discrimination, and threats to one’s personal privacy. More and more individuals are needing to make data-driven decisions in a wide variety of contexts including non-governmental organizations, not-for-profit industries, human services, environmental organizations, refugee camps, and more. Students in this class will thus learn about data science and how it can be utilized in contexts where socially-good decisions are desired and emphasized. This active learning class is designed for students who have an interest in the topic but who may have little to no previous experience with data science or programming.