LANG 2090
Exploring Language and Communication with Big Data
Course Description
LANG2090 introduces corpus linguistics, a method for studying language through large-scale data. Students learn how to uncover patterns in authentic language use by analysing existing corpora and building their own language datasets. By exploring how language functions across different domains—such as health, business, and digital communication—students will develop digital literacy, quantitative research skills, and a deeper understanding of the role of language in society.
Highlights
- Using corpus tools to conduct hands-on analyses of real-world language data
- Exploring topics of own interest (e.g. public discourse, online communication) through a group project
- Enhancing ability to interpret and communicate insights using data visualization and analytical writing
Assessments
- An individual analytical essay develops skills in comparative analysis across genres or registers
- A group-based corpus project to investigate language in a domain relevant to students’ interests
Target Students
- Suitable for students from all disciplines
- No prior experience in programming or linguistics needed
- Ideal for students curious about language, communication, or data
Learning Experience
Course outcomes
- practical experience working with digital language data
- transferable skills in research, critical thinking, and data interpretation
- ability to uncover hidden patterns in everyday language which equips students with valuable insights applicable to fields such as communication, education, media, and business