• 199 Citations
  • 6 h-Index
1994 …2020

Research output per year

If you made any changes in Pure these will be visible here soon.

Personal profile


Most Significant Research Contributions

My research contribution is centered on the application of theoretical linguistics to specific real-world problems, in particular to automated text-based question answering and summarisation. Since 2009 I have focused on medical text processing.

I joined the University of Zurich and became the principal researcher in the ExtrAns and WebExtrAns projects (from 1996 to 2001). Both projects were based on the development of answer extraction systems. Answer extraction systems locate those sentences in the source text that contain the answer posed by the user. The outcome of the project was a working system that handles questions about 500 Linux/Unix documentation documents ("manpages"). I was the principal designer of the overall system and the integration of all modules. I was also the principal contributor to the design of the logical forms and the question-answering method that used the logical forms. The success of ExtrAns and WebExtrAns is evident from the fact that the system is cited as a pioneering question-answering system, an example of a question answering system of technical domains, and an example of the use of logical forms for question answering.

In Macquarie University I established the AnswerFinder project. AnswerFinder is a question answering system that combines the use of logical information (inspired from ExtrAns and WebExtrAns), state-of-the-art approaches in question answering, and innovative graph-based machine learning methods to find the exact answer to the user question. AnswerFinder has participated in the question answering track of the Text REtrieval conference (TREC), the main international forum for the evaluation of question-answering systems, between 2003 and 2006. AnswerFinder is in fact the only Australian-based question answering system that has participated in the question answering track of TREC.

Since 2009 I have led various projects related to the development and application of text-processing technologies that help the medical doctor find and appraise clinical evidence found in the vast resources of medical publications. I have gathered a summarisation corpus sourced from the Journal of Family Practice, and used the corpus to develop and evaluate techniques for text summarisation, clustering, keyword exraction, and appraisal of the medical evidence.


Fingerprint Dive into the research topics where Diego Molla-Aliod is active. These topic labels come from the works of this person. Together they form a unique fingerprint.

  • 14 Similar Profiles

Network Recent external collaboration on country level. Dive into details by clicking on the dots.


Research Outputs

Classification betters regression in query-based multi-document summarisation techniques for question answering: Macquarie University at BioASQ7b

Mollá, D. & Jones, C., 2020, Machine Learning and Knowledge Discovery in Databases: International Workshops of ECML PKDD 2019, Proceedings. Cellier, P. & Driessens, K. (eds.). Cham, Switzerland: Springer, Springer Nature, p. 624-635 12 p. (Communications in Computer and Information Science; vol. 1168 CCIS).

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

  • Combination of loss functions for deep text classification

    Hajiabadi, H., Molla-Aliod, D., Monsefi, R. & Yazdi, H. S., Apr 2020, In : International Journal of Machine Learning and Cybernetics. 11, 4, p. 751-761 11 p.

    Research output: Contribution to journalArticle

  • Overview of the ALTA 2019 shared task: sarcasm target identification

    Molla, D. & Joshi, A., 2019, Proceedings of the 17th Annual Workshop of the Australasian Language Technology Association. Mistica, M., Piccardi, M. & MacKinlay, A. (eds.). Melbourne, VIC: Australasian Language Technology Association, p. 192–196 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

    Open Access

    Macquarie University at BioASQ 6b: deep learning and deep reinforcement learning for query-based multi-document summarisation

    Molla, D., 2018, The 6th BioASQ Workshop: A challenge on large-scale biomedical semantic indexing and question answering: Proceedings of the Workshop. Stroudsburg: Association for Computational Linguistics, p. 22-29 8 p.

    Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

  • Overview of the 2018 ALTA Shared Task: classifying patent applications

    Molla, D. & Seneviratne, D., Dec 2018, Proceedings of the Australasian Language Technology Association Workshop 2018. Kim, S. M. & Zhang, X. J. (eds.). Association for Computational Linguistics (ACL), p. 84-88 5 p.

    Research output: Chapter in Book/Report/Conference proceedingConference proceeding contribution

    Open Access