Personal profile
Biography
Dr. Tasnim Mohiuddin is a Scientist in the Arabic Language Technologies (ALT) group at Qatar Computing Research Institute (QCRI), where he works on pioneering initiatives in the development and enhancement of Large Language Models (LLMs). He is a core contributor to the creation of Fanar, QCRI's flagship LLM. Apart from training LLMs, his current research focuses on domain-specific LLM innovations, including specialized areas such as Code LLMs and Multimodal LLMs.
Before joining QCRI, Dr. Tasnim was a researcher at Huawei Research Center in Singapore, where he concentrated on Multimodal Representation Learning, bridging both linguistic and visual modalities. His research contributions have been consistently recognized through publications in leading academic conferences.
Tasnim completed his doctoral studies at the School of Computer Science and Engineering, Nanyang Technological University (NTU), Singapore, receiving his Ph.D. in June 2022 under the mentorship of Professor Shafiq Joty. His doctoral dissertation was honored with the prestigious Outstanding Ph.D. Thesis Award, reflecting the originality and impact of his research. During his doctoral studies, he also gained valuable experience as a research intern at Meta AI Research, working under the guidance of Professor Philipp Koehn.
Research Interests
- Agents, Tools, and Planning: Investigating the interplay of intelligent agents, tool-augmented reasoning, and strategic planning to enhance the problem-solving capabilities of language models in complex, real-world tasks.
- Pre-training and Post-training Strategies for LLMs: Developing innovative methodologies to enhance the efficiency, scalability, and performance of LLMs through advanced pre-training techniques and strategic post-training refinement, aiming to optimize model capabilities, generalizability, and domain-specific adaptability.
- Multimodal Language Models: Advancing the integration of textual, visual, and auditory modalities to develop sophisticated models capable of holistic, context-aware understanding and generation.
- Code Language Models: Designing and refining specialized LLMs for programming languages, focusing on generating accurate, efficient, and optimized code to address computational and algorithmic challenges.
Experience
|
Years |
Position |
Department |
University/Institution |
|
2024 - Present |
Scientist |
Arabic Language Technologies |
Qatar Computing Research Institute |
|
2022 - 2023 |
Scientist |
Poisson Lab |
Huawei Research, Singapore |
|
2021 - 2021 |
Research Intern |
NLLB |
Meta AI Research |
|
2015 – 2017 |
Lecturer |
Computer Science |
United International University |
|
2014 – 2015 |
Software Engineer |
- |
SSD-Tech |
Education
|
2022 |
PhD in Computer Science |
Nanyang Technological University (NTU) |
|
2014 |
BS in Computer Science |
Bangladesh University of Engineering and Technology (BUET) |
Fingerprint
- 1 Similar Profiles
Collaborations and top research areas from the last five years
-
LNMAP: Departures from isomorphic assumption in bilingual lexicon induction through non-linear mapping in latent space
Mohiuddin, T., Bari, M. S. & Joty, S., 2020, EMNLP 2020 - 2020 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference. Association for Computational Linguistics (ACL), p. 2712-2723 12 p. (EMNLP 2020 - 2020 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
36 Link opens in a new tab Citations (Scopus) -
AUGVIC: Exploiting BiText Vicinity for Low-Resource NMT
Mohiuddin, T., Bari, M. S. & Joty, S., 2021, Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021. Zong, C., Xia, F., Li, W. & Navigli, R. (eds.). Association for Computational Linguistics (ACL), p. 3034-3045 12 p. (Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
Open Access6 Link opens in a new tab Citations (Scopus) -
UXLA: A robust unsupervised data augmentation framework for zero-resource cross-lingual NLP
Bari, M. S., Mohiuddin, T. & Joty, S., 2021, ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference. Association for Computational Linguistics (ACL), p. 1978-1992 15 p. (ACL-IJCNLP 2021 - 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing, Proceedings of the Conference; vol. 1).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
Open Access17 Link opens in a new tab Citations (Scopus) -
Revisiting adversarial autoencoder for unsupervised word translation with cycle consistency and improved training
Mohiuddin, T. & Joty, S., 2019, Long and Short Papers. Association for Computational Linguistics (ACL), p. 3857-3867 11 p. (NAACL HLT 2019 - 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference; vol. 1).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
29 Link opens in a new tab Citations (Scopus) -
DM-Codec: Distilling Multimodal Representations for Speech Tokenization
Ahasan, M. M., Fahim, M., Mohiuddin, T., Mahbubur Rahman, A. K. M., Chadha, A., Iqbal, T., Ashraful Amin, M., Islam, M. M. & Ali, A. A., Nov 2025, EMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2025. Christodoulopoulos, C., Chakraborty, T., Rose, C. & Peng, V. (eds.). Association for Computational Linguistics (ACL), p. 25580-25602 23 p. (EMNLP 2025 - 2025 Conference on Empirical Methods in Natural Language Processing, Findings of EMNLP 2025).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review