My research interests include computational linguistics, corpus linguistics, NLP (natural language processing) methods for educational assessment and instruction, and computer-assisted language learning. I have also done work in the following areas: computational morphology, computational semantics/pragmatics, corpus construction and analysis, computational stylistics and authorship attribution.
Courses Taught
LING 1330/2330 Introduction to Computational Linguistics
LING 1340/2340 Data Science for Linguists (Spring 2024) (past runs: 2023, 2022, 2021, 2019, 2017)
LING 1682/2682 Introduction to Semantic Theory
LING 1000 Introduction to Linguistics
LING 1930 Applications of Linguistics
LING 1901 Fundamentals of Text Processing for Linguists (Independent Study)
LING 2050 Special Topics in Linguistics: Corpus linguistics
Computational Linguistics
Studying Computational Linguistics at Pitt: a Guide
PyLing (Pitt Python Linguistics Group): a community of computationally minded linguists at Pitt and CMU
Tutorials and Presentations
Python 3 Notes (with annotated MyBringBack.com videos)
Python 2.7 Tutorial (with videos by MyBringBack.com)
Research Projects
Selected Publications
- Jinho D. Choi, Na-Rae Han, Jena D. Hwang, Hansaem Kim. “Penn Korean Universal Dependency Treebank,” Linguistic Data Consortium (LDC) catalog number LDC2023T05, 2023.
- Na-Rae Han. "Transforming Data". In A. Berez-Kroeker, B. McDonnell, E. Koller, L. Collister (Eds.), The Open Handbook of Linguistic Data Management, MIT Press, 2022.
- Ben Naismith, Alan Juffs, Na-Rae Han, Daniel Zheng. "Handle it in-house? Learner corpora frequency lists and lexical sophistication". International Journal of Corpus Linguistics, 1-30, 2022.
- Ben Naismith, Na-Rae Han, Alan Juffs. "The University of Pittsburgh English Language Institute Corpus (PELIC)". International Journal of Learner Corpus Research, 8 (1), 2022.
- Shyam Visweswaran, Jason B Colditz, Patrick O'Halloran, Na-Rae Han, Sanya B Taneja, Joel Welling, Kar-Hai Chu, Jaime E Sidani, and Brian A Primack. "Machine Learning Classifiers for Twitter Surveillance of Vaping: Comparative Machine Learning Study". Journal of Medical Internet Research 22(8), 2020.
- Kanayama, Hiroshi, Na-Rae Han, Masayuki Asahara, Jena D Hwang, Yusuke Miyao, Jinho D Choi, and Yuji Matsumoto. "Coordinate structures in universal dependencies for head-final languages". Proceedings of the Second Workshop on UniversalDependencies (UDW 2018), 2018.
- Naismith, Ben, Na-Rae Han, Alan Juffs, Brianna Hill, and Daniel Zheng.
"Accurate Measurement of Lexical Sophistication with
Reference to ESL Learner Data." Proceedings of Educational Data Mining 2018, 2018.
- Littell, Patrick, Tom McCoy, Na-Rae Han, Shruti Rijhwani, Zaid Sheikh, David Mortensen, Teruko Mitamura, and Lori Levin. "Parser combinators for Tigrinya and Oromo morphology." Proceedings of LREC 2018, 2018.
- Chun, Jayeol, Na-Rae Han, Jena D. Hwang, and Jinho Choi.
"Building Universal Dependency Treebanks in Korean." Proceedings of LREC 2018, 2018.
- Hwang, Jena D., Archna Bhatia, Na-Rae Han, Tim O’Gorman, Vivek Srikumar, and Nathan Schneider.
"Double trouble: the problem of construal in semantic annotation of adpositions." Proceedings of the 6th Joint Conference on Lexical and Computational Semantics, 2017, 178--188.
- Hwang, Jena D., Archna Bhatia, Na-Rae Han, Tim O’Gorman, Vivek Srikumar, and Nathan Schneider.
"Coping with construals in broad-coverage semantic annotation of adpositions." AAAI Spring Symposium on Construction Grammar and NLU, 2017.
- Han, Na-Rae, Joel Tetreault, Soo-Hwa Lee and Jin-Young Ha. "Using an Error-
Annotated Learner Corpus to Develop an ESL/EFL Error Correction System."
Proceedings of the 7th International Conference on Language Resources and Evaluation
(LREC 2010), Malta, 2010.
- Han, Na-Rae, Martin Chodorow, and Claudia Leacock. “Detecting Errors in
English Article Usage by Non-Native Speakers.” Natural Language Engineering: Special
Issue on Educational Applications, 12(2), Cambridge University Press, UK, 2006.
- Han, Na-Ree, Shijong Ryu, Sook-Hee Chae, Seung-yun Yang, Seunghun Lee, and
Martha Palmer. “Korean Treebank Annotations Version 2.0.” Linguistic Data
Consortium (LDC) catalog number LDC2006T09 and ISBN 1-58563-381-X, 2006.
- Han, Na-Rae. “Klex: A Finite-State Transducer Lexicon of Korean.” Proceedings of the 5th International Workshop on Finite-State Methods and Natural Language Processing, Springer LNCS, 2005.
|
|