Skip to content

AgnieszkaFalenska/Language-in-Social-Context

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Language in Social Context: Bridging NLP and Sociolinguistics (ESSLLI 2024)

Area: Language and Computation (LaCo)

Level: Introductory

Lecturer(s): Agnieszka Faleńska and Filip Miletić

Help and practical exercises: Pema Gurung

Course Link: click here

Abstract: This course explores the relationship between language and society by drawing on complementary perspectives from sociolinguistics and natural language processing. It discusses variation in language use in connection with

  • (i) speakers’ sociodemographic properties such as age, gender, and socioeconomic status;
  • (ii) speakers’ patterns of interaction; and
  • (iii) the social meaning conveyed through the use of different linguistic variants.

In bringing together NLP and sociolinguistics, we aim to establish a comparable level of depth across the disciplines.

  • For students already acquainted with fundamental computational linguistics, the course will provide insights into the richness of language when viewed within a broader social context.
  • For students with a background in sociolinguistics, we will present examples of various computational techniques that can be used to analyze linguistic variation.
  • Students with previous experience in programming will be able to run the practical exercises independently and engage with different types of attested linguistic data.

Slides

Day Title Slides
1 Introduction Day 1
2 Demographic factors (I) Day 2
3 Demographic factors (II) Day 3
4 Language variation in interaction Day 4
5 NLP applications and challenges Day 5

Practical exercises

Students with previous experience in programming can run additional practical exercises and play with different types of attested linguistic data. They can either use Google Colab (recommended) or run the code locally (see instructions below).

Day Task Dataset Google Colab (recommended) Jupyter notebook (run locally)
1 Data Analysis Trust Data Google Colab Local Notebook
2 Geographical Variability Twitter Google Colab Local Notebook
3 Gender Variability Trust Data Google Colab Local Notebook
4 Style Accomodation Reddit CMV Google Colab Local Notebook

Pre-requirements for practical exercises

Basic Python knowledge: Python tutorial

Running code locally

Python Installation: Python Downloads

To run the code locally instead of Google Colab

  • Step1: Download all the notebooks (also available in the notebook folder)
  • Step2: Open jupyter notebook through the activated env
  • Step3: Follow the instructions at the top on how to download the data
  • Step4: Run the notebook

Virtual Enivironment Setup

To install the packages used for this project run the commands in the terminal:

  • Step1: Install pip install virtualenv
  • Step2: Run the command (replace the <version> with your version and <virtual-environment-name> with your choice of name) python<version> -m venv <virtual-environment-name>
    Example: python -m venv my_venv
  • Step3: Activate the Virtual env source my_venv/bin/activate
    Note: We will be installing all the packages to my_venv environment and using it further to run the code.

About

Repository for the ESSLLI 2024 summer school

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages