Hi, I am Saikat, a Senior Researcher at the Research in Software Engineering (RiSE) group at Microsoft Research, working on reliability of large language models for code and post-training. I bring 10 years of experience in training and evaluating code models, with a focus on improving the correctness and fidelity of generated programs under real-world constraints.
My work guides code generation models through static and dynamic correctness signalsβvia tests, program analysis, and verificationβand uses these signals through fine-tuning and reinforcement learning. I view reliability as fundamentally a training problem, driven by structured and composable feedback.
Earlier, I graduated with a Ph.D. in Computer Science from Columbia University, advised by Professor Baishakhi Ray. I wrote my Ph.D. thesis on Learning to Edit Code.
π Website: saikatc.info
- Post-training for code: SFT, RLHF/GRPO, reward modeling, reranking, retrieval-augmented fine-tuning
- Correctness supervision: Reward design using test generation, execution feedback, mutation testing, specification inference, and program analysis
- Agent-driven testing: DeepTest β symbolic analysis + LLMs for testing production code at scale
- Formal verification: DeepProof β post-training models for theorem proving (F*, Rocq, Lean)
- Systems: PyTorch, Megatron-LM, Ray, distributed GPU clusters, Kubernetes
- π ICSE'25 β Neural Synthesis for Proof-Oriented Programming [Distinguished Paper Award]
- π ISSTA'23 β Contrastive Learning for Code Understanding [Distinguished Paper Award]
- π ACL'25 β Teaching an Old LLM Secure Coding via Localized Preference Optimization
- π EMNLP'23 β Ranking LLM-Generated Loop Invariants
- π ICSE'24 β Causal Learning for Code Understanding
- π FSE'22 β NatGen: Semantic Rewriting for Pretraining of Code Models
- π NAACL'21 β Unified Pretraining for Code Understanding and Generation
- Use of LLMs for program synthesis, editing, and verification
- Reinforcement learning with execution and correctness feedback for code
- Formal methods meets machine learning (proof generation, specification mining)





