免费的英语语料库汇总
免费的英语语料库汇总
Some are not corpora, but (I think) they are corpus-related. The list is incomplete and just let me know if I omit any corpora.
1. The best corpora
COCA:
BNC-BYU:
TIME-BYU:
JustTheWord:
BNCweb: Jukuu(句酷): for learners
Leeds:
Lextutor:
Web Concordancer:
2. General Corpora
Jiaoda(上海交大): click on “guest” Brown/lob Corpus: Corpuseye:
Corpus swb : BNC:
Bank of English: ANC:
ICE Corpora
3. English-Chinese Parellel Corpora(英汉双语语料库)
CEO:
Babel:
The Dream Of Red Chamber(红楼梦): http://score.crpp.nie.edu.sg/hlm/index.htm
HK Poly U(香港理工大学):
Laozi(老子):
Xiamen U(厦门大学):
4. Textbook Corpora
College English:
New Horizon College English(NHCE):
New Concept English:
Family Album USA:
5. Business and Financial Corpora
Business English Corpus (BEC):
PolyU Business
Corpus:
Business Letter Corpus: Financial Corpus:
6. Literary Corpora
The Online Corpus of Old English Poetry
(OCOEP):
Shakespeare's Sonnets
Corpus:
Blues Lyric Poetry Corpus: (search Catalog).
Canadian Poets Anthology Corpus: (search Catalog). CAPA (contemporary American Poetry
Archive):
Claremont Corpus of Elizabethan Verse: (search Catalog)
Late Modern English Prose Corpus: (search Catalog) New Dragon Book of Verse Corpus : (search
Catalog).
Northwest Coast Indian mythology Corpus: (search Catalog).
Online Classics Horror and Phantasy
Fiction:
SETIS Australian Literary and Historical
Texts:
Corpus of Middle English Prose and
Verse:
Harry Potter Corpus:
Towneley Plays Corpus: (search Catalog)
Web Concordances Site: York Miracle Play Cycle Corpus: (search Catalog) ME Texts Anthology Corpus: (search Catalog)
7. Web As Corpus
Web As Corpus :
Web Corp:
WebCONC: =en&art=google
8. Learner Corpora
Chinese Learners of English(中国英语学习
者): http://www.clal.org.cn/corpus/EngSearchEngine.aspx
Corpus of Hungarian students' essays:
The Multimedia Adult English Learner Corpus:
The Uppsala Student English Corpus (USE):
Dowloadable data at
Michigan Corpus of Upper-level Student
Papers:
IWILL Corpus: Wordneighbours:
PICLE Corpus: EVA Corpus:
PolyU Language Bank Concordancer:
The Montclair Electronic Language Learners' Database under
construction)
Singapore Corpus of Research in Education:
Birkbeck Spelling Error Corpus: (search Catalog) Open Mind Commonsense Corpus: Corpus for Higher Education:
National Taiwan Normal University Corpora:
ELISA corpus: VLC:
9. News Corpora
Reuters Corpus: arpers Magazine 1879-1880 Corpus: (search Catalog).
Hong Kong South China Morning Post
Corpus: (search Catalog)
New York Newspaper Advertisements and News Items
1777-1779:
VOA Special English
Corpus:
VOA Special English audio and text
corpus:
American News Stories Corpus: (search Catalog). MPQA Opinion Corpus: