AINS graduate course

AINS6004 · Natural Language Processing

Core sequence · catalog 2026–27

Description

Text as data: tokenization through modern language models, evaluation, and guardrails for NLP systems in production.

Course shells on the Castalia LMS are provisioned per license; this link opens the LMS to explore the guest demo or landing experience.

Open Castalia LMS Back to catalog

Buy license Continue on the purchase hub to request a license or institutional quote.

Syllabus outline

  1. Modules 1–2 · Representations

    • Tokenization and embeddings
    • Classification and sequence labeling
    • Retrieval and lexical resources
  2. Modules 3–4 · Modern LMs

    • Encoder and decoder families (conceptual)
    • Prompting and instruction tuning (survey)
    • Evaluation beyond accuracy
  3. Modules 5–6 · Deployment

    • Safety filters and content policies
    • Latency/cost tradeoffs
    • Domain adaptation patterns