Konferensartikel

Text Data Mining of English Books on Tourism

Hiromi Ban
Nagaoka University of Technology, Japan

Takashi Oyabu
Kokusai Business Gakuin College, Japan

Ladda ner artikel

Ingår i: KEER2014. Proceedings of the 5th Kanesi Engineering and Emotion Research; International Conference; Linköping; Sweden; June 11-13

Linköping Electronic Conference Proceedings 100:105, s. 1255-1263

Visa mer +

Publicerad: 2014-06-11

ISBN: 978-91-7519-276-5

ISSN: 1650-3686 (tryckt), 1650-3740 (online)

Abstract

Nowadays; approximately sixteen million Japanese travel abroad; and six million foreigners come to Japan for sightseeing. It can be said that it is just the time of sightseeing right now. Therefore; the knowledge of tourism has become more and more important; and reading materials in English that can be said to be a world common language has been indispensable. If we have beforehand enough knowledge of the features of English in this field; reading of the texts will become easier. In this paper; we investigated several English books on tourism; comparing with journalism in terms of metrical linguistics. In short; frequency characteristics of character- and word-appearance were investigated using a program written in C++. These characteristics were approximated by an exponential function. Furthermore; we calculated the percentage of Japanese junior high school required vocabulary and American basic vocabulary to obtain the difficulty-level as well as the K-characteristic of each material. As a result; it was clearly shown that English materials for tourism have a similar tendency to literary writings in the characteristics of character-appearance. Besides; the values of the K-characteristic for the materials on tourism are high; and the books with older publication and with higher specialty are more difficult than journalism.

Nyckelord

English style analysis; Metrical linguistics; Statistical analysis; Text data mining; Tourism

Referenser

Ban; H.; Dederick; T.; Nambo; H.; & Oyabu; T. (2004a). Metrical comparison of English materials for business management and information technology. Proceedings of the 5th Asia-Pacific Industrial Engineering and Management Systems Conference 2004; Gold Coast; Australia; 33.4.1-33.4.10.

Ban; H.; Dederick; T.; Nambo; H.; & Oyabu; T. (2004b). Stylistic characteristics of English news. Proceedings of the 5th Japan-Korea Joint Symposium on Emotion and Sensibility; Daejeon; Korea; 4 pages.

Ban; H.; Dederick; T.; & Oyabu; T. (2002). Linguistical characteristics of Eliyahu M. Goldratt’s The Goal. Proceedings of the 4th Asia-Pacific Conference on Industrial Engineering and Management Systems; Taipei; Taiwan; 1221-1225.

Ban; H.; Dederick; T.; & Oyabu; T. (2003). Metrical comparison of English textbooks in east Asian countries; the U.S.A. and U.K. Proceedings of the 4th International Symposium on Advanced Intelligent Systems; Jeju; Korea; 508-512.

Ban; H.; & Oyabu; T. (2005a). Metrical linguistic analysis of English interviews. Proceedings of the 6th International Symposium on Advanced Intelligent Systems; Yeosu; Korea; 1162-1167.

Ban; H.; Shimbo; T.; Dederick; T.; Nambo; H.; & Oyabu; T. (2005b). Metrical characteristics of English materials for business management. Proceedings of the 6th Asia-Pacific Industrial Engineering and Management Conference; Manila; Philippines; Paper No. 3405; 10 pages.

Ban; H.; Sugata; T.; Dederick; T.; & Oyabu; T. (2001). Metrical comparison of English columns with other genres. Proceedings of the 5th International Conference on Engineering Design and Automation; Las Vegas; USA; 912-917.

Teikyo University. (2006). Department of Tourism Business Administration. http://www.teikyo-u.ac.jp/en//faculty/economics/017.html.

Yule; G. U. (1944). The Statistical Study of Literary Vocabulary. Cambridge University Press.

Citeringar i Crossref