I am a Postdoctoral Research Fellow at MBZUAI. Previously, I did my PhD at Unimelb, fortunately supervised by Prof. Timothy Baldwin and Dr. Jey Han Lau, and sponsored by Australia Awards scholarship. I was also an applied scientist intern at Amazon.

  • Fajri Koto
  • Research Fellow
  • Natural Language Processing
  • Abu Dhabi, UAE
  • fajri.koto91@gmail.com
  • fajri91

What's New?

  • 2023-09-05: Paper accepted to AACL 2023

    2023-08-30: We release JAIS and JAIS-chat, the largest Arabic LLM

    2023-05-10: Paper accepted to ACL 2023

    2023-05-06: We won the Outstanding Paper Award for EACL 2023

    2023-01-22: Paper accepted to EACL 2023

    2022-12-06: I will attend EMNLP in person

    2022-10-22: I join MBZUAI as a new Postdoctoral Research Fellow

    2022-09-15: Invited Talk to University of Toronto, University of Queensland, and Binus University

    2022-09-15: 1st Place of ALTA 2022 shared task

    2022-09-07: Paper accepted to CODI at COLING 2022

    2022-08-16: Paper accepted to COLING 2022

    2022-06-28: I'll be one of the keynote panels at ACL 2022, Dublin, Ireland

    2022-05-14: We won the Best Paper Award for CSRR 2022 and I got married at the same day

    2022-04-04: Paper accepted to ECNLP at ACL 2022

    2022-03-28: Paper accepted to CSRR at ACL 2022

    2022-01-23: Paper accepted to ACL 2022

    2021-12-14: Paper accepted to JAIR 2022

    2021-10-15: 2nd Place of ALTA 2021 shared task

    2021-09-26: Paper accepted to EMNLP 2021

    2021-08-21: I'll be the speaker in DaTalk, Jakarta Artificial Intelligence Research

    2021-08-02: I'll join Amazon as Applied Scientist Intern

    2021-05-06: Paper accepted to Findings of ACL 2021

    2021-03-28: Selected as one of Nominee for Data Researcher, Data Science Indonesia Award

    2021-03-10: Paper accepted to NAACL 2021

    2021-01-12: Paper accepted to EACL 2021

    2021-01-08: I'll be the speaker in INACL Webinar BEDAH PAPER #15

    2020-12-16: I'll be the speaker in IR-NLP Talk, Fasilkom, Universitas Indonesia

    2020-09-30: Paper accepted to COLING 2020

    2020-09-11: Paper accepted to AACL-IJCNLP 2020

    2020-09-05: Paper accepted to PACLIC 2020

    2019-12-03: I'll be the speaker in NLP Sydney Meetup

    2019-11-20: Paper accepted to ALTA 2019

    2019-07-31: I pass my confirmation, officially a PhD Candidate.

    2018-07-22: I officially started my PhD

Academic History

  • PhD of Computer Science2018 - 2022

    The University of Melbourne

    Fully funded program by Australia Awards Scholarship
    PhD Thesis: "From Discourse and Keyphrases, to Language Modeling in Automatic Summarization"
    Advisor : Prof. Timothy Baldwin and Jey Han Lau, Ph.D.

  • Master of Computer Science2013 - 2014

    Universitas Indonesia

    Graduated with Cum Laude (first class honor)
    Final thesis: "A Comparative Study over Twitter Sentiment Analysis: Which Features are Good?"
    Advisor : Mirna Adriani, Ph.D.

  • Bachelor of Computer Science2009 - 2013

    Universitas Indonesia

    Graduated with Cum Laude (first class honor)
    Final thesis: "Touch Sensor based Keyboard Driver using AVR ATxmega 256 A3BU"
    Advisor : Bob Hardian, Ph.D.


  • 2022: Outstanding paper award of EACL 2023

    2022: Rank 1st of ALTA 2022 shared task

    2022: Best paper award of CSRR 2022 at ACL

    2021: Rank 2nd of ALTA 2021 shared task

    2021: Data Science Indonesia Award: Nominee for Data Researcher

    2021: FEIT Unimelb conference travel scholarship for attending EACL, NAACL, ACL, EMNLP as presenter

    2020: MSE Unimelb conference travel scholarship for attending AACL, COLING as presenter

    2017: Australia Awards Scholarship (out of 5300+ applicants), estimated total of awards for PhD: A$358,000

    2014: Best session presenter at ICACSIS (International Conference on Advanced Computer Science and Information System 2014)

    2014: Cum Laude Award (Top 5), Graduation of Master Degree in Faculty of CS, Universitas Indonesia

    2013: Cum Laude Award (Top 5), Graduation of Bachelor Degree in Facutly of CS, Universitas Indonesia

    2013: Awardee of Japan Student Services Organization (JASSO), Summer research internship at NAIST, Japan

    2012: Awardee of Fast track DIKTI (Minister of Higher Education) scholarship, Bachelor + Master degree at Universitas of Indonesia

    2009: PPKB award for high-achiever high school students to get admitted to Universitas Indonesia without national exam

    2008: Rank 7th (out of 1000+), High School Math competition, West Sumatra province, Indonesia

    2007: Semi-finalist, Junior High School Physic competition, West Sumatra province, Indonesia

    2006: Top 200 (0.5%) National (out of 40,000+), Junior High School Math competition, PASIAD, Indonesia

Working Experience

  • Postdoctoral Research Fellow2022 - present


    Working on Natural Language Processing with Prof. Timothy Baldwin.
    I also collaborate on several projects with Prof. Iryna Gurevych (TU Darmstadt, Germany)

  • Applied Scientist Intern2021 - 2022

    Amazon, AUSTRALIA

    Working on NLP and Computer Vision. Projects: 1) multimodal language generation system; 2) information extraction system.
    Advisors: Prof. Chunhua Shen and Prof. Anton van den Hengel.

  • Tutor2020 - 2021

    School of CIS, University of Melbourne, AUSTRALIA

    a. Natural Language Processing COMP90042 (Semester 1, 2020) - Dr. Jey Han Lau
    b. Natural Language Processing COMP90042 (Semester 1, 2021) - Dr. Jey Han Lau

  • Data Scientist2016 - 2017


    Mainly working on a spam detection system that is integrated in BBM, Vidio, and Liputan6. Another responsibilities include data migration, tracking system, and logs extraction in AWS and GCP.
    Manager: Hafiz Badrie Lubis

  • Research Engineer2014 - 2016

    Samsung Research Institute INDONESIA

    Delivering 3 global (US) and 1 local (ID) patents.
    Manager: Agus Kurniawan

  • Research InternSummer 2013

    Nara Institute of Science and Technology (NAIST), JAPAN

    Working on Speech Technology at AHC Labs , with research topics: 1) Speech Summarization and 2) Quote Detection on Speech
    Advisors : Dr. Sakriani Sakti, Dr. Graham Neubig, Prof. Tomoki Toda, and Prof. Satoshi Nakamura

  • Teaching Assistant2010 - 2013

    Faculty of Computer Science, University of Indonesia

    a. Calculus 1 (Fall 2010) - Dr. Kasiyah
    b. Private Tutor (Spring 2011) - NA
    c. Database (Fall 2011) - Dr. Ika Alfina
    d. Discrete Math 1 (Spring 2012) - Prof. Belawati H. Widjaja, Ph.D
    e. Private Tutor (Spring 2012) - NA
    f. Statistic and Probability (Fall 2012) - Dr. Ika Alfina
    g. Theory of Language and Automata (Spring 2013) - Dr. Dina Cahyati
    h. Statistic and Probability (Fall 2013) - Dr. Ika Alfina

  • Android DeveloperSummer 2012

    PT Astra International, INDONESIA

Invited Talks

  • 10-2022: Binus University (Guest Lecture), "NLP for Indonesian Languages: The Current States and Future Works"

    09-2022: University of Toronto, "Domain-Adaptive Pretraining in Indonesian Languages: The Current State, Challenges, and Opportunities"

    09-2022: University of Queensland, "Can Pretrained Language Models Generate Persuasive, Faithful, and Informative Ad Text for Product Descriptions?"

    05-2022: Keynote Panel of ACL 2022, "Supporting Linguistic Diversity"

    06-2022: Indonesian Association for Computational Linguistics, "Scientific Article Writing"

    11-2021: University of Indonesia (Guest Lecture), "Indonesian NLP with Pretrained Language Models: State of the Art

    08-2021: DaTalk, Jakarta Artificial Intelligence Research, "Natural Language Understanding Benchmark across Languages"

    01-2021: Indonesian Association for Computational Linguistics, "IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP"

    12-2020: University of Indonesia (IR-Lab), "Document Summarization in Indonesian Text: Resources and Benchmark Model"

    10-2020: Data Science Indonesia, "IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP"

    12-2019: NLP MeetUp at Sydney, Australia, "Improved Document Modelling with a Neural Discourse Parser"

    12-2016: Wrangle Conference, Malaysia, "Spam Text and Video Detection System at Indonesian Media Company"

Academic Services

Student Supervision

  • Andrew Shen, BSc Student, 2021: Discourse Analysis - co-supervised with Prof. Timothy Baldwin and Dr. Jey Han Lau. Currently a Master student at CMU.

International Publications

Link: Research Gate and Google Scholar. * indicates equal contribution.