Comparative Analysis of Retriever and Reader for Open Domain Questions Answering on BPS Knowledge in Indonesian

Sulisetyo Puji Widodo

doi:10.34123/icdsos.v2023i1.384

Comparative Analysis of Retriever and Reader for Open Domain Questions Answering on BPS Knowledge in Indonesian

Authors

Sulisetyo Puji Widodo Badan Pusat Statistik

DOI:

https://doi.org/10.34123/icdsos.v2023i1.384

Keywords:

Knowledge, Retriever, Reader, BPS, Open-domain question answering

Abstract

Enumerators from Badan Pusat Statistik (BPS) still often encounter problems in finding solutions to cases encountered during censuses or surveys. Even though knowledge lists have been created and collected in various systems such as QA and knowledge management systems, enumerators still need to find appropriate answers from long and complex knowledge search results. On the other hand, Open-domain Question Answering (OpenQA) is capable of identifying answers to natural questions based on large-scale documents. OpenQA has main components, namely Retriever and Reader. For Retriever tasks, Dense Retrieval (DR) is proven to outperform traditional sparse retrieval such as TF-IDF or BM25. However, other research actually shows that BM25 is superior to DR in terms of accuracy. In this study, we compared DR and BM25 separately and DR+BM25 as a retriever. Additionally, we combine and evaluate several enhanced language models as Readers. In this way, a model with the best combination of Retriever and Reader can be obtained to be implemented in search systems such as QA and knowledge management systems.

Downloads

Published

2023-12-29

How to Cite

Widodo, S. P. (2023). Comparative Analysis of Retriever and Reader for Open Domain Questions Answering on BPS Knowledge in Indonesian. Proceedings of The International Conference on Data Science and Official Statistics, 2023(1), 337–343. https://doi.org/10.34123/icdsos.v2023i1.384

Download Citation

Issue

Vol. 2023 No. 1 (2023): Proceedings of 2023 International Conference on Data Science and Official Statistics (ICDSOS)

Section

Data Science

Comparative Analysis of Retriever and Reader for Open Domain Questions Answering on BPS Knowledge in Indonesian

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

SUPPORTED BY

SITE LINKS

CONTACT US