Mr. Yifan Zhang

    Calculated based on number of publications stored in Pure and citations from Scopus
    20142021

    Research activity per year

    Personal profile

    Research Interests

    Software Engineering
    Generative AI, LLM, ASR, TTS

    Experience

    Experience

    Senior Software Engineer, QCRI 2013-present

    Developed QATS speech recognition system, QAT Transcription and Translation system, live speech

    recognition, and video search platform (NodeJS, React, Python, Django, C++, KALDI, MySQL)

    Organize MGB Arabic ASR Challenge – provide technical support for MGB Arabic ASR Challenge; support

    data preparation and test for challenge (Python, KALDI)

    Created QCRI news translation system and led team to win BBC ‘Best in Show’ competition – create the

    idea for QAT Transcribe and Translation system, led the development team for Hackathon (Bash, Python)

    Implemented QCRI Arabic Transcription System with in-house ASR system, worked closely with Al Jazeera

    Evaluate ASR and initiated in-house ASR system development (Python, Django, KALDI)

    Consultant, QCRI 2012-2013

    Consultancy on speech recognition for Al Jazeera – worked on EML speech recognition system, explored

    KALDI for in-house speech recognition system (HTK, Bash, Python, KALDI)

    Consultancy on architecture and design for BORAQ (later Jalees) EReader app (Objective-C)

    Redesign and developed improved EReader app (Objective-C)

    Backend Engineer, Wavii Inc, (a Google company) 2012-2013

    As backend NLP Software Engineer, I work on the information retrieval component of the system, research

    and implement new algorithms, improve system performance: Work on core information extraction

    algorithms (Python, NLP, NLTK, R)

    Work on components integrate backend and frontend information (Python, django, MySQL, Chef)

    Improve overall system performance, implement internal requests (Python, AWS, MySQL, Chef, Pyramid,

    Javascript)

    Senior Research Engineer, SpinVox Ltd. (a Nuance company) 2008-2012

    As a Senior Research Recognition Engineer in SpinVox, I have worked with the team to develop the

    speech recognition from ground to be the core part of the company. I contributed in acoustic training, in

    house development, experiments, and largely decoder development and optimization. Highlights are:

    Analyze data and conduct experiments to improve core speech recognition system (Bash/Python/Perl)

    Analyze and implement state-of-art techniques (vtln, gender, hlda, lexicon tree, combination) integrate

    into current system (C++/C) Optimize both algorithm (Gaussian selection, computation cache, search)

    and code (SIMD, cache, compilation) to achieve better performance (C++/C)

    Design and implement features API, architecture such as shared memory, cached resource, multi-processsharing, exception-safe and etc. for our application (C++) Provide solutions, frameworks and supports for

    system training, experiments, research and development in parallel environment (Bash/Python/Perl)

    R&D Engineer, Virage Softsound (Autonomy, a HP company) 2005-2008

    As Research and Development Engineer, I quickly integrated into a small team and developed Mandarin

    speech recognition system, co-developed other languages such as Romanian, Arabic and Japanese

    system, in house application development and software development, as well as research environment

    management.

    Highlights are: Build speech recognition system acoustic model and language model (Bash/Perl) Analyze

    and evaluate data, models and techniques (Bash/Perl/Python) Develop in-house applications such as

    text segmentation, dictionary generation, normalization and web crawlers (C/Python/Perl) Develop C

    library for error handling, memory, data structure, threading and so on (C) Work on projects including IIVR,

    transcript server (C++/Multi-threaded) Manage research environment of Linux and GridEngine (Bash)

    TianChuang Software 2001-2002

    As Lead Software Engineer, I led the development of bespoke Internet Information Management system

    and mentor other graduate developers. Highlights are: design and develop Internet information

    management system (ASP/SQL)

    mentor teammates and integrate team member’s work into final product (ASP, SQL)

    Education/Academic qualification

    Electronic Engineering, Master, Cardiff University

    22 Sept 200313 Sept 2004

    Award Date: 18 Mar 2005

    Information Engineering, Bachelor, Bachelar of Engineering, Xi'an Jiaotong University

    1 Sept 19971 Jul 2001

    Award Date: 1 Jul 2001

    Fingerprint

    Dive into the research topics where Yifan Zhang is active. These topic labels come from the works of this person. Together they form a unique fingerprint.
    • 1 Similar Profiles
    • A Second Pandemic? Analysis of Fake News About COVID-19 Vaccines in Qatar

      Nakov, P., Alam, F., Shaar, S., da San Martino, G. & Zhang, Y., 2021, International Conference Recent Advances in Natural Language Processing, RANLP 2021: Deep Learning for Natural Language Processing Methods and Applications - Proceedings. Angelova, G., Kunilovskaya, M., Mitkov, R. & Nikolova-Koleva, I. (eds.). Incoma Ltd, p. 1010-1021 12 p. (International Conference Recent Advances in Natural Language Processing, RANLP).

      Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

      Open Access
      14 Citations (Scopus)
    • QAT2 - the QCRI advanced transcription and translation system

      Abdelali, A., Ali, A., Guzmán, F., Stahlberg, F., Vogel, S. & Zhang, Y., 2015, In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. 2015-January, p. 726-727 2 p.

      Research output: Contribution to journalConference articlepeer-review

      2 Citations (Scopus)
    • Prta: A system to support the analysis of propaganda techniques in the news

      da San Martino, G., Shaar, S., Zhang, Y., Yu, S., Barrón-Cedeño, A. & Nakov, P., 2020, ACL 2020 - 58th Annual Meeting of the Association for Computational Linguistics, Proceedings of the System Demonstrations. Association for Computational Linguistics (ACL), p. 287-293 7 p. (Proceedings of the Annual Meeting of the Association for Computational Linguistics).

      Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

      52 Citations (Scopus)
    • COVID-19 in Bulgarian Social Media: Factuality, Harmfulness, Propaganda, and Framing

      Nakov, P., Alam, F., Shaar, S., da San Martino, G. & Zhang, Y., 2021, International Conference Recent Advances in Natural Language Processing, RANLP 2021: Deep Learning for Natural Language Processing Methods and Applications - Proceedings. Angelova, G., Kunilovskaya, M., Mitkov, R. & Nikolova-Koleva, I. (eds.). Incoma Ltd, p. 997-1009 13 p. (International Conference Recent Advances in Natural Language Processing, RANLP).

      Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

      Open Access
      23 Citations (Scopus)
    • Tanbih: Get to know what you are reading

      Zhang, Y., Da San Martino, G., Barrón-Cedeño, A., Romeo, S., An, J., Kwak, H., Staykovski, T., Jaradat, I., Karadzhov, G., Baly, R., Darwish, K., Glass, J. & Nakov, P., 2019, EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations. Association for Computational Linguistics (ACL), p. 223-228 6 p. (EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing, Proceedings of System Demonstrations).

      Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

      Open Access
      26 Citations (Scopus)