An Introduction to The National Language Research Institute: A Sketch of its Achievements
Third Edition(1988)/ HTML Version(1997)

[contens]| [previous]| [next]

II.3.15 Concordance of Vocabulary with Contexts 1: Conversation-Data from the "Recorded Data" Column of "GENGO SEIKATU,"('Linguistic Life')

(Language Processing Data Source 2)
This data source is a concordance of the vocabulary collected from the "Recorded Data" Column which appeared in the monthly journal "GENGO SEIKATU" (Chikuma Syobo~ Publishing Co. Ltd.), Numbers 1 through 344, with their contexts. This concordance constitutes a previously unmatched body of data. It covers the 30-year period from 1951 to 1980, contains conversational data on 421 topics, involves participants from a wide range of backgrounds as far as age, sex and occupation, and contains a great number of running (total) words, slightly less than 500,000 (including auxiliaries and symbols). This "concordance of vocabulary with context," unlike a simple "index of vocabulary," indicates the context in which each vocabulary item was used and is useful for research in a variety of areas as research data for language information processing as well as for linguistic research on the vocabulary, grammar, etc. used in conversation. The contents of this data source are as follows. 1. Concordance of Vocabulary with Context, microfiche, 79 sheets, 494,956 words (including symbols) Original Data from "GENGO SEIKATU," microfiche, 11 sheets 2. Explanatory Pamphlet NAKANO Hirosi of the Department of Computational Linguistics (Section 1) directed the writing of the Explanatory Pamphlet.

[contens]| [previous]| [next]