Utilisation du Corpus of Contemporary American English

 

http://corpus2.byu.edu/coca/

 

1. Recherche d’un mot ou d’une expression

 

dissertation

LIST    Fréquence des mots et expressions correspondant à la requête

CHART         Tableau de leur répartition dans les sous-ensembles du corpus et dans les lustres successifs du corpus

KWIC  Affichage des contextes faisant apparaître les parties du discours en couleurs

 

2. Comparaison de mots ou d’expressions

 

COMPARE

 

a) dissertation vs thesis

 

SORT BY FREQUENCY

 

W1 et W2 représentent les deux mots comparés.

Le vert foncé indique les collocats employés préférentiellement avec l’un des mots concernés

 

SORT BY RELEVANCE

Les collocats sont tries par ordre de pertinence.

 

doctoral dissertation is 5 times more prevalent than doctoral thesis

 

La fenêtre par défaut est de 4 mots à gauche et à droite. Cette valeur peut être modifiée :

 

COLLOCATES         *        1        0

 

Cette valeur renvoie les prémodificateurs du terme de la requête.

 

Doctoral et unpublished sont les seuls adjectifs qui qualifient le nom dissertation dans le corpus.

 

b) committee vs commission

 

Pour la recherché des collocats, on peut formuler une restriction lexicale :

 

COLLOCATES         *al      1        0

 

Ou une restriction concernant la partie du discours :

 

COLLOCATES         [jj]     1        0

 

 

c) strange              odd

 

COLLOCATES         [nn]   0        1

 

duck?

 

RESET

 

odd duck

 

LIST

CHART

 

strong language

 

3. Variations diachroniques et diastratiques

 

facebook

twitter

hypothesize

 

4. Comparaison de quelques synonymes dans le sous-ensemble ACADEMIC

 

COMPARE

0        1

 

efficient vs effective

strong vs powerful

weak vs feeble

fast vs rapid

 

collocations adverbe-verbe

0        1

assumed vs supposed

[r*]

 

 

 

5. Comparaison de structures syntaxiques

 

Effacement de la conjonction THAT introduisant une proposition complétive.

 

5.1 CLAIM THAT pronoun verb

 

ACADEMIC

[claim].[v*] that [p*] [v*] 574

[claim].[v*] [p*] [v*]        385

 

SPOKEN

[claim].[v*] that [p*] [v*] 624

[claim].[v*] [p*] [v*]        1427

 

 

5.2. KNOW THAT THE noun verb

 

ACADEMIC

[know] that the [n*] [v*]   304

[know] the [n*] [v*]                  257

 

 

SPOKEN

[know] that the [n*] [v*]  758

[know] the [n*] [v*] 1182

 

 

5.3. CONTEND THAT THE noun

 

ACADEMIC

[contend] that the [nn]   271

[contend] the [nn]          18

 

SPOKEN

[contend] that the [nn]   17

[contend] the [nn]          28

 

 

6. Utilisation des parties du discours

 

Help / Information / Contact

         SEARCHES    Query syntax

                   [pos] More information

                            Click here for a list of these part of speech tags.

 

List of part of speech tags

You can also use part of speech tags is by selecting them from the drop-down list (click on [POS LIST] to show it).

 

 

 

Syntax

Meaning

Examples (Click to run)

Sample matches

[pos]
[pos*]

Part of speech (exact)
Part of speech (wildcard)

[vvg]
[v*]
 

going, using
find, does, keeping, started

 

[lemma]

Lemmas (all forms of a word)

[sing]
[tall]

sing, singing, sang
tall, taller, tallest

[=word]

Synonyms
[=weak]
 

low, tired, soft, vulnerable, etc.

 

 

word|word

Any of these words

stunning|gorgeous|charming

stunning, charming, gorgeous

*xx
x?xx
x?xx*

Wildcard: * = any # letters
Wildcard: ? = one letter

un*ly
s?ng
s?ng*

unlikely, unusually
sing, sang, song
song, singer, songbirds

-word

NOT (followed by PoS, lemma, word, etc. Most useful for "multiple slot" queries; see below)

-[nn*]

the, in, is

Combinations of preceding (samples)

You can limit to a particular part of speech by adding a period (full stop) and then the part of speech tag in brackets.

word.[pos]

Exact word and part of speech

strike.[v*]

strike (only as a verb)

word*.[pos]

Substring and part of speech

dis*.[vvd]

discovered, disappeared, discussed

[lemma].[pos]

Lemma and part of speech

[strike].[v*]

strike, struck, striking

[word].[pos]

Synonym and part of speech

[=beat].[v*]

hit, strike, defeat
(but not nouns, like rhythm or drumming)

You can add "lemma" to any other type of search, such as synonym or customized list, to see all forms of the matching words. Just use an extra set of brackets.

[[=word]]

Synonym and lemma

[[=publish]]

announced, circulating, publishes, issue
(no part of speech specified, so some noun uses)

You can also choose lemma and part of speech by combining the preceding symbols

[[=word]].[pos]

Synonym and lemma and part of speech

[[=clean]].[v*]

mop, scrubs, polishing

Multiple "slots" : Create sequences of words, using any of the preceding query types. Note that in each case, there is a space between the word "slots" in the query. These are just a few examples, from an unlimited number of combinations. Note on advanced queries involving variable length between words.

nooks and crannies

nooks and crannies

fast|quick|rapid [nn*]

fast food
rapid transit

pretty -[nn*]

pretty smart
pretty as
(but not pretty girl, pretty picture, etc)

[get] her to [v*]

get her to stay
got her to sleep

.|,|;  nevertheless [p*] [v*]
(Notice that punctuation can be used like any "word";
just make sure that it is separated from words by a space)

. Nevertheless it is
; nevertheless he said

[break] the [nn*]

break the law
broke the story

[[beat]].[v*] * [nn*]

beat the Yankees
beaten to death

[=gorgeous] [nn*]

beautiful woman
attractive wife

 

 

7. Utilisation de Google Books par le Google NGram Viewer