Arnaud Vincent: "Introduction to the concepts and tools of Corpus Linguistics for the analysis of textual data."
Description
- Language: French
- Level: introductory
- Prerequisite: none
- Targeted field: Social and human Sciences
- Targeted audience: Any researcher wishing to learn more about and analyse text corpora
- Software mainly used: Lancsbox (free)
- The following items will be discussed:
- Place Corpus Linguistics in the Big Data, digital humanities and text mining landscape
- Create a corpus (advice, words of caution, DIY corpora vs ready-made corpora)
- Collocations
- Frequency and dispersion
- Concordances
- Extraction of keywords, N-Grams, key N-grams
- Detection of"plagiarism" between two texts and identification of idiolects
- Duration: 9am to 4pm
- Registration: Compulsory before the 20/08/20 - see "Registration" in the section below
Practical Details
Université Saint-Louis
Boulevard du Jardin botanique 38
Room D16 (4th floor)
1000 Brussels
Écrire commentaire