If you see this when accessing some of my apps, this is NORMALClick 'Yes, get this app back up!'. It might take a few seconds for them to 'wake up' properly.
A new Corpus Query System: CORPUS TEXT EXPLORER version alpha. CORTEX manual.
CORTEX-LANG-DOC: a small web based tool for language documentation
CORTEX-DICTIONARY-LAB: a corpus based dictionary editor
A tokeniser and POS tagger in TreeTagger format for corpus indexing preparation (now only available for Indonesian (very slow), Japanese and English). Check TTR's manual
JL-PRO: Vocabulary profiler for Japanese
An interactive vocabulary profiler for Japanese. Check JL-PRO's manual.
An Interactive Stylometric analyzer. Check Stylo Profiler's manual
paste a youtube link, create a corpus of transcript and comments automatically, sentence split, POS tagged, and sentiment tagged
IPA transcriber for Indonesian
a rule based automatic transcriber. May be inaccurate. use with caution
Audio Splitter (best mode by silence)
segment a long record into sentences
2024
Principal-investigator: DICO-JALF (Diponegoro Corpus of Japanese Learners as a Foreign Language) v.1.0
Co-investigator: Corpus of Indonesian ******
Principal Investigator: Dr. Susi Yuliawati, Unpad, Indonesia (in-progress )
Consultant: Corpus of ******
Principal Investigator: Evynurul Laily Zen, Ph.D, Universitas Negri Malang, Indonesia (in-progress )
2023
Coordinator, co-author, corpus data indexer of Contemporary Indonesian Grammar (2023) -- Access by request to Badan Bahasa
Sponsored by : The ministry of Education Indonesia, Language Development and Cultivation Agency
2023
Annotator, Information distribution and language structure - correlation of grammatical expressions of the noun/verb distinction and lexical information content in Tagalog, Indonesian and German
Sponsored by : Deutsche Forschungsgemeinschaft (DFG)
Principal investigators: Prof. Dr. Gerhard Heyer (Universität Leipzig) and Professor Dr. Nikolaus Himmelmann (Universität zu Köln), German
2022
Corpus Data Collector: KOIN (Korpus Indonesia), mass media subset (2022) -- currently server is down
Sponsored by : The ministry of Education Indonesia, Language Development and Cultivation Agency
2021
Co-investigator: ICNALE (International corpus network of Asian learners of English) : rating- completed
Principal Investigator: Dr. Shinichiro Ishikawa, Kobe University, Japan
2018
Co-investigator: ICNALE (International corpus network of Asian learners of English) : spoken dialogue
Principal Investigator: Dr. Shinichiro Ishikawa, Kobe University, Japan
2015
Co-investigator: ICNALE (International corpus network of Asian learners of English) : spoken monologue
Principal Investigator: Dr. Shinichiro Ishikawa, Kobe University, Japan
2014
Co-investigator: DICORA (Digital language and content research association) sentiment analysis corpus - completed (2014)
Principal Investigator: Prof. Nam Jeesun, Hankuk University of Foreign Studies, Korea
2024
Principal Investigator: Indonesian Clitic separator and updated TreeTagger parameter file for Indonesian
Principal investigator: Prihantoro, Ph.D
2023
Co-investigator: Information distribution and language structure - correlation of grammatical expressions of the noun/verb distinction and lexical information content in Tagalog, Indonesian and German (2023)
Principal investigators: Prof. Dr. Gerhard Heyer (Universität Leipzig) and Professor Dr. Nikolaus Himmelmann (Universität zu Köln), German
2023
Principal Investigator: Indonesian Word Sketch grammar for use in Sketch Engine
Principal investigator: Prihantoro, Ph.D
2022
Principal Investigator: TreeTagger parameter file for Indonesian POS tagging and headword annotation (multi word unit support available); in addition to be used by TreeTagger, it is also used by Sketch Engine, LancsBox and CQPweb Lancaster; docker available with the help of UCREL people (2022). Full Package Indonesian TreeTagger
Principal Investigator: Prihantoro, Ph.D
2021
SANTI-morf (Sistem ANalisis Teks Indonesia - morfologi) : a morphological annotation system for Indonesian texts (alpha version)
Principal Investigator: Prihantoro, Ph.D
SANTI-pos (Sistem ANalisis Teks Indonesia - POS tagger) : a POS tagger for Indonesian texts (alpha version) (2021)
Principal Investigator: Prihantoro, Ph.D
...