[DL] Master 2 Internship on “Framework for Evaluating Reasoning Verification in Large Language Models” at Télécom Paris and BNP Paribas
Mehwish Alam
alammehw at gmail.com
Thu Oct 30 16:51:38 CET 2025
Dear all,
We have a Master 2 internship position open at Télécom Paris in collaboration with BNP Paribas, France. Detailed description of internship is attached.
Research Groups: Data, Intelligence, and Graph Team, Télécom Paris, France. BNP Paribas, France
Advisors: Mehwish Alam, Bérénice Jaulmes, Jean-Christophe Arouette.
Scientific Context.
Recent progress in Large Language Models (LLMs) has led to remarkable advances in Chain-of-Thought (CoT) reasoning, the step-by-step generation of intermediate thoughts to reach a final answer. However, the reliability and interpretability of these reasoning chains remains an open challenge. This internship will contribute to the growing scientific effort to verify and evaluate CoT reasoning and its verifiers through the analysis and development of benchmark datasets and evaluation metrics. The goal is to strengthen our understanding of how LLMs reason, identify where they fail, and provide a basis for designing methods to measure reasoning quality more accurately.
The project will explore existing benchmarks, which provide large-scale annotated datasets for reasoning verification across domains like mathematics, physics, and commonsense reasoning. Each of these benchmarks introduces different verification methodologies; for instance, PRM800k uses fine-grained human annotations for every reasoning step, while THINK-Bench introduces precision and recall metrics for key logical steps. The intern will analyze the advantages and drawbacks of these datasets, such as scalability, annotation reliability, and domain coverage, and investigate how they can be extended or combined for more comprehensive reasoning evaluation.
Candidate Profile.
Currently pursuing M2 in the field of Artificial Intelligence/Machine Learning
Good programming skills, such as in Python (incl. Pytorch).
Knowledge of Large Language Models is a plus but not required; however, interest in learning and keeping themselves up-to-date with upcoming trends in the field is required.
Good communication skills, especially in English.
Required Documents.
A full CV
A motivation letter expressing your interest in the position and relevant experience
A transcript of records
Contacts.
Please send the complete required documents to Mehwish Alam (mehwish.alam at telecom-paris.fr <mailto:mehwish.alam at telecom-paris.fr>), Bérénice Jaulmes (berenice.jaulmes at ip-paris.fr <mailto:berenice.jaulmes at ip-paris.fr>), and Jean-Christophe Arouete (jean-christophe.arouete at bnpparibas.com <mailto:jean-christophe.arouete at bnpparibas.com>) in an email with the subject starting with “[M2Internship-CoT]”.

—
Best Regards
Mehwish Alam
Associate Professor
Telecom Paris, Institut Polytechnique de Paris
Department of Informatics and Networks (INFRES)
19 Pl. Marguerite Perey, 91120 Palaiseau, France
Web: https://sites.google.com/view/mehwish-alam/home
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.zih.tu-dresden.de/pipermail/dl/attachments/20251030/6aacdbc1/attachment-0002.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: InternshipOffer - TPa - BNP - 2026.pdf
Type: application/pdf
Size: 549150 bytes
Desc: not available
URL: <http://mailman.zih.tu-dresden.de/pipermail/dl/attachments/20251030/6aacdbc1/attachment-0001.pdf>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.zih.tu-dresden.de/pipermail/dl/attachments/20251030/6aacdbc1/attachment-0003.htm>
More information about the dl
mailing list