Readings

Most journal articles are available through the IU Library. If the links to these articles don't work, try this link to the generic online journals page.

Lexical differences, multi-lingual ontologies and cross-lingual information retrieval

[PDF] Carpuat, M., Ngai, G., Fung, P. and Church, K. W. (2002). Creating a bilingual ontology: a corpus-based approach for aligning WordNet and HowNet. First Global WordNet Conference.
[PDF] Espinoza, M., Pérez, A. G., and Mena, E. (2008). Enriching an ontology with multilingual information. Fourth European Semantic Web Conference.
[PDF] Fellbaum, C., and Vossen, P. (2007). Connecting the universal to the specific: towards the Global Grid. First International Workshop on Intercultural Collaboration (IWIC 2007).
[PDF] Huang, C-R., Lin, W-Y., Hong, J-F., and Su, I-L. (2008). The nature of cross-lingual lexical semantic relations: a preliminary study based on English-Chinese translation equivalents. Third Global WordNet Conference.
[PDF] Ma, N., Yang, Y., and Rogati, M. (2004). Cross-lingual event tracking. Asia Information Retrieval Symposium (AIRS), 2004.
[PDF] Majid, A., Bowerman, M., van Staden, M., and Boster, J. S. (2007). The semantic categories of cutting and breaking events: A crosslinguistic perspective. Cognitive Linguistics, 18, 133-152.
[PDF] Nguyen, D., Overwijk, A., Hauff, D., Trieschnigg, R. B., Hiemstra, D. and de Jong, F. M. G. (2008). WikiTranslate: Query translation for cross-lingual information retrieval using only Wikipedia. CLEF 2008 Workshop.
[PDF] Oard, D. W. (2009). Multilingual information access. In Encyclopedia of Library and Information Sciences (3rd Ed.), edited by M. J. Bates and M. N. Maack. Taylor and Francis,
[PDF] Ploux, S. and Ji, H. (2003). A model for matching semantic maps between languages (French/English, English/French). Computational Linguistics, 29, 155-178.
[PDF] Pouliquen, B., Steinberger, R., Ignat, C., Käsper, E., and Temnikova, I. (2004). Multilingual and cross-lingual news topic tracking. COLING 2004.
[PDF] Rogati, M. and Yang, Y. (2004). Resource selection for domain specific CLIR. ACM SIGIR 2004.
[MS Word zipped] TECHNION. (2006). Ontology-based knowledge sharing in multilingual eGovernment environments. Quality of Service and Legimitacy in eGovernment, Project IST-507767.
[PDF] Vossen, P., Agirre, E., Calzolari, N., Fellbaum, C., Hsieh, S-K., Huang, C-R., Isahara, H., Kanzaki, K., Marchetti, A., Monachini, M., Neri, F., Raffaelli, R., Rigau, G., Tesconi, M., and VanGent, J. (2008). KYOTO: a system for mining, structuring, and distributing knowledge across languages and cultures. Fourth Global WordNet Conference.
[PDF] Welty, C. and Guarino, N. (2001). Supporting ontological analysis of taxonomic relationships. Data and Knowledge Engineering, 39, 51-74.

Translation

[PDF] Avramidis, E. and Koehn, P. (2008). Enriching morphologically poor languages for statistical machine translation. ACL08, 763-770.
[PDF] Brown, P. F., Della Petra, S. A., Della Petra, V. J., and Mercer, R. L. (1993). The mathematics of statistical machine translation: parameter estimation. Computational Linguistics, 19, 263-311.
[PDF] Brown, R. D. (2001). Transfer-rule induction for example-based translation. Workshop on Example-Based Machine Translation, Machine Translation Summit XIII.
[PDF] Byrne, W., Khudanpur, S., Kim, W., Kumar, S., Pecina, P., Virga, P., Xu, P. and Yarowsky, D. (2003). The Johns Hopkins University 2003 Chinese-English Machine Translation System. MT Summit IX.
[PDF] Cahill, P., Du, J., Berndsen, J., and Way, A. (2009). Using same-language machine translation to create alternative target sequences for text-to-speech synthesis. Interspeech 2009.
[PDF] Carl, M. (2001). Inducing translation grammars from bracketed alignments. Workshop on Example-Based Machine Translation, Machine Translation Summit XIII.
[PDF] Carpuat, M. and Wu, D. (2007). Improving statistical machine translation using word sense disambiguation. EMNLP-CoNLL 2007.
[PDF] Casacuberta, F. and Vidal, E. (2004). Machine translation with inferred stochastic finite-state transducers. Computational Linguistics, 30, 205-225.
[PDF] Cheung, P. and Fung, P. (2005). Translation disambiguation in mixed language queries. Machine Translation, 18, 251-273.
[PDF] Chiang, D. (2007). Hierarchical phrase-based translation. Computational Linguistics, 33, 201-228.
[Word] Collins, B. and Somers, H. (2003). EBMT seen as case-based reasoning. In M. Carl and A. Way (Eds.), Recent advances in example-based machine translation. Dordrecht: Kluwer, pp. 115-153.
[PDF] DeNeefe, S. and Knight, K. (2009) Synchronous tree adjoining machine translation. 2009 Conference on Empirical Methods in Natural Language Processing
[PDF] Etzioni, O., Reiter, K., Soderland, S., and Sammer, M. (2007). Lexical translation with application to image search on the Web. MT Summit XI.
[PDF] Fujita, S. and Bond, J. (2008). A method of creating new valence entries. Machine Translation, 21, 1-28.
[PDF] Fung, P. and Chen, B. (2006). Robust word sense translation by EM learning of frame semantics. ACL/COLING 2006, 239-246.
[PDF] Gamallo, P. (2007). Learning bilingual lexicons from comparable English and Spanish corpora. Machine Translation Summit XI.
[PDF] Huang, F. and Papineni, K. (2007). Hierarchical system combination for machine translation. EMNLP-CoNLL 2007.
[PDF] Graehl, J., Knight, K., and May, J. (2008). Training tree transducers. Computational Linguistics, 34, 391-428.
[RTF] Koehn, P., Och, F. J., and Marcu, D. (2002). Statistical phrase-based translation. DARPA IAO Machine Translation Workshop.
[RTF] Knight, K. (1999). A statistical MT tutorial workbook. Summer 1999 Workshop on Statistical Machine Translation. Center for Language and Speech Processing, Johns Hopkins University.
[PDF] Kraaij, W., Nie, J-Y., and Simard, M. (2003). Embedding Web-based statistical translation models in cross-language information retrieval. Computational Linguistics, 29, 381-419.
[PDF] Kumar, S. and Byrne, W. (2004). Minimum Bayes-risk decoding for statistical machine translation. HLT-NAACL 2004.
[PDF] Li, H. and Li, C. (2004). Word translation disambiguation using bilingual bootstrapping. Computational Linguistics, 30, 1-22.
[PDF] Marcu, D., Wang, W., Echihabi, A., and Knight, K. (2006). SPMT: statistical machine translation with syntactified target language phrases. EMNLP 2006.
[PDF] Mariño, J. B., Banchs, R. E., Crego, J. M., de Gispert, A., Lambert, P., Fonollosa, J. A. R., and Costa-jussá, M. R. (2006). N-gram based machine translation. Computational Linguistics, 32, 527-549.
[PDF] McTait, K. (2001). Linguistic knowledge and complexity in an EMBT system based on translation patterns. Workshop on Example-Based Machine Translation, Machine Translation Summit XIII.
[PDF] Mitkov, R., Pekar, V., Blagoev, D., Mulloni, A. (2007). Methods for extracting and classifying pairs of cognates and false friends. Machine Translation, 21, 29-53.
[PDF] Munteanu, D. S. and Marcu, D. (2006). Improving machine translation performance by exploiting non-parallel corpora. Computational Linguistics, 32, 477-504.
[PDF] Nießen, S. and Ney, H. (2004). Statistical machine translation with scarce resources using morpho-syntactic information. Computational Linguistics, 30, 181-204.
[PDF] Och, F. J. (2003). Minimum error rate training in statistical machine translation. ACL03.
[PDF] Och, F. J. and Ney, H. (2004). The alignment template approach to statistical machine translation. Computational Linguistics, 30, 417-449.
[PDF] Pekar, V., Mitkov, R., Blagoev, D., Mulloni, A. (2006). Finding translations for low-frequency words in comparable corpora. Machine Translation, 20, 247-266
[PDF] Sammer, M. and Soderland, S. (2007). Building a sense-distinguished multilingual lexicon from monolingual corpora and bilingual lexicons. MT Summit XI.
[PDF] Shen, L., Xu, J., and Weischedel, R. (2008). A new string-to-dependency machine translation algorithm with a target dependency language model. ACL08.
[PDF] Shieber, S. (2007). Probabilistic synchronous tree-adjoining grammars for machine translation: the argument from bilingual dictionaries. Workshop on Syntax and Structure in Statistical Translation, NAACL-HLT 2007.
[Word] Somers, H. (2003). An overview of EBMT. In M. Carl and A. Way (Eds.), Recent advances in example-based machine translation. Dordrecht: Kluwer, pp. 3-57.
[PDF] Utsuro, T., Hino, K., Kida, M., Nakagawa, S. and Sato, S. (2004). Integrating cross-lingually relevant news articles and monolingual web documents in bilingual lexicon acquisition. COLING 2004.
[PDF] Zollmann, A. and Venugopal, A. (2006). Syntax augmented machine translation via chart parsing. NAACL 2006, Workshop on Statistical Machine Translation.

Computer-assisted language learning

[PDF] Amaral, L., and Meurers, D. (2007). Conceptualizing student models for ICALL. User Modeling 2007.
[PDF] Nagata, N. (2002). BANZAI: An application of Natural Language Processing to Web based language learning. CALICO Journal, 19, 583-599. Noriko Nagata
[PDF] Sisson, C. (2007). Text-to-speech in vocabulary acquisition and student knowledge models: a classroom study using the REAP intelligent tutoring system. Language Technologies Institute, Carnegie Mellon University, Technical report CMU-LTI-07-009.

Lexical semantics

[PDF] McCarthy, D., Koeling, R., Weeds, J. and Carroll, J. (2007). Unsupervised acquisition of predominant word senses. Computational Linguistics, 33, 553-590.
[PDF] Padó, S. and Lapata, M. (2007). Dependency-based construction of semantic space models. Computational Linguistics, 33, 161-199.
[PDF] Turney, P. D. (2006). Similarity of semantic relations. Computational Linguistics, 32, 379-416.

Bilingual lexicons

[PDF] (username: nlp, password: neno) Dijkstra, A., and Van Heuven, W.J.B. (2002). The architecture of the bilingual word recognition system: From identification to decision. Bilingualism: Language and Cognition, 5, 175-197.
[PDF] French, R., and Jacquet, M. (2004). Understanding bilingual memory: models and data. Trends in Cognitive Sciences, 8, 87-93.
[PDF] Hernandez, A., Lee, P, and MacWhinney, B. (2005). The emergence of competing models in bilingualism. Trends in Cognitive Sciences, 9, 220-225.
[HTML] Kroll, J. F. (2008). Juggling two languages in one mind. Psychological Science Agenda, 22:1.
[HTML] Thomas, M. S. C. and Van Heuven, W. (2005). Computational models of bilingual comprehension. In J. F. Kroll and A. M. B. De Groot (Eds.), Handbook of bilingualism: psycholinguistic approaches. Oxford University Press.

home

calendar

coursework

notes

resources


IU | SoIC

contact instructor