Research & Publications 

Exploring the frontiers of Quantum Computing, Machine Learning, and Artificial Intelligence through peer-reviewed research.

All Publications

IEEE Xplore and Arxiv Preprint

Mathematical Framework for Custom Reward Functions in Job Application Evaluation using Reinforcement Learning

Most of the traditional Applicant Tracking Systems (ATS) depend on strict matching using keywords, where candidates that are highly qualified are many times disqualified because of minor semantic differences. In this article, the two-stage process of developing a more comprehensive resume assessment system based on small language model that is trained with fewer than 600M parameters is introduced and fine-tuned by using GRPO with a unique-designed reward function.The initial stage is (SFT) Supervised Fine Tuning, which are use to create a strong base model with the ability to perceive resumes beyond superficial overlap of keywords. This SFT model is further-optimized in the second step with Reinforced Learning (RL) via GRPO with the help of multi-component based rewarding, which will not be considered as a commission of tokens matching.In the initial RL experiments, we found a severe difficulty in the shape of reward hacking: overly aggressive penalty terms resulted in unstable training dynamics and prohibitively negative model behaviour. This was solved by trial and error refinement of the reward, and careful training hyperparameter tuning, which led to a stable and controlled process of gentle polishing.GRPO-refined model shows high real-life performance, as it shows accuracy of 91% on unseen data used for testing. It has a high recall of 0.85 on the SELECTED class with a perfect precision of 1.0, which highlights its high reliability to be used in identifying qualified applicants. These findings demonstrate that an appropriately structured two-step fine-tuning pipeline can effectively be used to transfer a small language model into humanlike candidate evaluation, surpassing shortcoming of both traditional ATS systems and unrefined uses of reinforcement learning.

Reinforcement LearningLLMsGRPOATS Optimization
Shreyansh Jainet al.
January 2026
DOI: 10.1109/ICCCA66364.2025.11325393
Read Paper

S. Jain et al., "Mathematical Framework for Custom Reward Functions in Job Application Evaluation using Reinforcement Learning," 2025 IEEE 7th International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, 2025, pp. 1-6, doi: 10.1109/ICCCA66364.2025.11325393.

Book Chapter: IGI Global
Scopus

Quantum-Enhanced Smart Computing Framework for Sustainable Credit Risk Decision Communication

Recent advancements in smart computing and intelligent decision communication systems have enabled new possibilities for sustainable financial technologies. This chapter introduces a hybrid quantum classical model for credit risk prediction that leverages superposition and entanglement to enhance complex financial data processing. Unlike traditional machine learning models that struggle with class imbalance, nonlinear relationships and high dimensional dependencies, the system combines advanced preprocessing, feature selection and quantum kernel computation within a scalable support vector framework. Trained on 33,000 loan records across 12 borrower attributes, it achieves 94.53% accuracy.

Quantum ComputingFinTechRisk PredictionSVM
Shreyansh Jainet al.
January 2026
DOI: 10.4018/979-8-3373-3541-4.ch013
Read Paper

Jain, S., Chaudhary, K., Singh, S., Tundjungsari, V., & Bose, A. (2026). Quantum-Enhanced Smart Computing Framework for Sustainable Credit Risk Decision Communication. In V. Balas, H. Pandey, M. Bin Ali, V. Singh, & A. Kumar (Eds.), Recent Advances in Smart Communication Technologies for a Sustainable Future (pp. 357-384). IGI Global Scientific Publishing. https://doi.org/10.4018/979-8-3373-3541-4.ch013

IEEE ICAECA 2025
Scopus

Hybrid Quantum Model for Digital Media Processing for Metal Surface Defect Detection

Processing Digital media such images is a crucial step for any image classification task such as surface defect detection or any deep learning task which include Digital Media Processing. Ensuring the detail capturing of each and every feature of the input image from each and every pixel is must in order to build a robust system with minimal error or no error. While it may not be possible and causes the struggle for classical computing methods to efficiently capture and process complex patterns and subtle details. Utilising quantum computing for digital image processing gives an advantage over classical methods as it operates on high dimensional space and leverages quantum parallelism to explore numerous possibilities simultaneously. This enable the extraction of more complex features that might be otherwise missed improving the model's capability to flag defects with greater accuracy and efficiency. Quantum computing's potential to enhance feature extraction can significantly boost the performance of classification models, leading to dependable and precise surface defect detection.

Quantum ComputingCNNImage ProcessingMachine Learning
J. Sahaet al.
January 2025
DOI: 10.1109/ICAECA63854.2025.11012389
Read Paper

J. Saha, S. Jain and R. M, "Hybrid Quantum Model for Digital Media Processing for Metal Surface Defect," 2025 3rd International Conference on Advancements in Electrical, Electronics, Communication, Computing and Automation (ICAECA), Coimbatore, India, 2025, pp. 1-7, doi: 10.1109/ICAECA63854.2025.11012389.