From SWRC
70 million eojeol Korean text Corpus, POS-annotated Corpus, Tree-annotated Corpus, Korean-Chinese parallel corpus, Korean-English parallel corpus...
Qualified Corpus
Title
| Language
| Size
| Release
| DOWN
|
KAIST Raw corpus
|
| 70,000,000 phrases
| 1997
| DOWN
|
High quality morpho-syntactically annotated corpus
|
| 1,000,000 phrases
| 2000
| DOWN
|
Automatically Analyzed Large Scale KAIST Corpus
|
| 40,000,000 phrases
| 1997
| DOWN
|
Korean Tree-Tagging Corpus
|
| 3,000 sentences
| 1998
| DOWN
|
Tree-tagged corpus 2
|
| 30,000 sentences
| 2000
| DOWN
|
Chinese Tagged corpus
|
| 10,000 sentences
| 2001
| DOWN
|
Chinese-English-Korean multilingual corpus
|
| 60,000 sentences
| 2000
| DOWN
|
Chinese-English multilingual corpus
|
| 60,000 sentences
| 2005
| DOWN
|
Chinese-Korean multilingual corpus
|
| 60,000 sentences
| 2005
| DOWN
|
English-Korean multilingual corpus
|
| 60,000 sentences
| 2005
| DOWN
|
Newspaper corpus (Hankyoreh)
|
| 620 files
| 2005
| DOWN
|
Newspaper corpus (Donga-Korean, English, Japanese, Chinese)
|
| 1791 files
| 2005
| DOWN
|
Processed Resources
Title
| Language
| Size
| Release
| DOWN
|
Nouns Definition of General Vocabulary
|
| 29,038 words (57,391 meaning)
| 2004
| DOWN
|
Noun Definitions Corpus of General Vocabulary
|
| 29,042 words (57,400 meaning)
| 2003
| DOWN
|
Co-occurrence data
|
| 35,731,121 (KOR) 12,504,329 (ENG)
| 2002
| DOWN
|
Alignment Model for Extracting English-Korean Translations of Term constituents
|

| 23,914
| 2003
| DOWN
|
Terminology corpus- medicine
|
| 219,967 sentences
| 2000
| DOWN
|
Terminology corpus- architectural engineering
|
| 3,681 sentences
| 2000
| DOWN
|
Terminology corpus- economics
|
| 27,690 sentences
| 2000
| DOWN
|
Terminology corpus- engineering
|
| 13,627 sentences
| 2000
| DOWN
|
Terminology corpus- physical metallurgy
|
| 7,468 sentences
| 2000
| DOWN
|
Terminology corpus- mechanical engineering
|
| 50,739 sentences
| 2000
| DOWN
|
Terminology corpus- physics
|
| 106,547 sentences
| 2000
| DOWN
|
Terminology corpus- biology
|
| 83,519 sentences
| 2000
| DOWN
|
Terminology corpus- electronic engineering
|
| 12,887 sentences
| 2000
| DOWN
|
Terminology corpus- computer science
|
| 8,679 sentences
| 2000
| DOWN
|
Terminology corpus- chemistry
|
| 66,652 sentences
| 2000
| DOWN
|
Terminology corpus- chemical engineering
|
| 19,546 sentences
| 2000
| DOWN
|
Terminology corpus- environmental engineering
|
| 21,260 sentences
| 2000
| DOWN
|
Terminology corpus- Adverbs and Frequency
|
| 2,556 words
| 2003
| DOWN
|
Terminology corpus- Nouns and Frequency
|
| 4313,557 words
| 2003
| DOWN
|
Terminology corpus- Adjectives and Frequency
|
| 1,680 words
| 2003
| DOWN
|
Terminology corpus- Verbs and Frequency
|
| 20,024 words
| 2003
| DOWN
|
Terminology corpus- Verbal Case Frames
|
| 135,329
| 2003
| DOWN
|
Terminology corpus- co-occurrence
|
| 16,464,054
| 2003
| DOWN
|
One-syllable nouns
|
| 1,025 words
| 2003
| DOWN
|
|
|
|
|
|
Images