Research

This page summarizes my research activities, including publications in academic journals and conference papers. More to be added as the time passes by....

This page summarizes my research activities, including publications in academic journals and conference papers. More to be added as the time passes by....


This page overviews my research activities:



Paper

2024

  • Abhishek Singh, Krishnendu Ghosh
    Curriculum-based Question Generation for Mathematics and Science
    ArXiv preprint.
  • Abstract: The purpose of this study is to leverage the use of Natural Language Processing by using Large Language Models (LLMs) so that we can generate questions with respect to the curriculum for Mathematics and Science subjects. To achieve this, we implemented two principal methodologies from the field of Generative-AI: Fine-tuning and Retrieval Augmented Generation (RAG) are two of these approaches. The Fine-tuning approach which we are using is Transfer-Learning using unsloth, which offers two types of Hugging Face's trainers for fine-tuning in its \href{https://github.com/unslothai/unsloth#finetune-mistral-gemma-llama-2-5x-faster-with-70-less-memory}{GitHub repository}: Direct Performance Optimizer (DPO) and Supervised QA Fine-tuning using SFT Trainer. We have used the SFT Trainer, which requires reward modelling or reinforcement learning and works based on labelled data to fine-tune the pre-trained LLM for specific tasks \cite{li2024getting}. To generate curriculum-based questions, we have used a supervised and labelled dataset of Mathematics questions \cite{hendrycksmath2021}. In contrast, for Science, we have used NCERT (National Council of Educational Research and Training) books issued by CBSE (Central Board of Secondary Education), a fine-tuned model for structured and supervised mathematical questions data, while RAG model for books' PDF for generating contextually appropriate science questions due to containing a large number of factual details in a book. This research highlights how these approaches can be used to design highly effective instructional tools compatible with generating curriculum-based questions on required subjects. Thus, using these approaches, there is a possibility of developing educational content which is beneficial for students and teachers.