publications | Harshwardhan Sanjay Fartale

2026

Can Linguistically Related Languages Guide LLM Translation in Low-Resource Settings?

Aishwarya Ramasethu, Rohin Garg, Niyathi Allu, and 2 more authors

2026

Accepted at LoResMT 2026: EACL 2026 Workshop on Technologies for Machine Translation of Low-Resource Languages

Abs arXiv HTML PDF

We examine whether linguistically similar languages can help large language models translate underrepresented languages effectively, using pivot-based prompting with related languages and few-shot examples — without any parameter updates. Results show that pivot-based prompting yields improvements in certain configurations, particularly when the target language is less well represented, though gains are inconsistent for closely related language varieties. The work provides practical guidance on when inference-time prompting serves as a viable lightweight alternative to fine-tuning in low-resource translation.
The Evolution of FlashAttention

Harshwardhan Fartale

2026

Accepted in 2026 ICLR Blog Post Track

HTML Website

2025

Enhancing Logical Consistency in Language Models through Neuro-Symbolic Feedback and Structured Reasoning

Harshwardhan Fartale, Ashish Kattamuri, Rahul Raja, and 3 more authors

2025

Accepted at AAAI 2026 Workshop on Logical and Symbolic Reasoning in Language Models

Abs

We propose a neuro-symbolic feedback framework for improving logical consistency in large language models through structured reasoning. The approach integrates symbolic reasoning constraints with neural model outputs to mitigate logical contradictions in generated text.
Disentangling Recall and Reasoning in Transformer Models through Layer-wise Attention and Activation Analysis

Harshwardhan Fartale, Ashish Kattamuri, Rahul Raja, and 3 more authors

2025

Accepted at AAAI 2026 XAI4Science Workshop

Abs arXiv HTML PDF

This research investigates whether transformer language models use distinct internal mechanisms for fact retrieval and logical inference. Using mechanistic interpretability techniques on models from the Qwen and LLaMA families, we employ activation patching and structured ablations to measure how specific model components contribute to each task. We find that distinct layers and attention heads lead to selective impairments — disabling recall circuits reduces fact accuracy by 15% while preserving reasoning abilities, and vice versa. At the neuron level, task-specific activation patterns are observed, though with less robust effects. The work provides evidence that these two cognitive functions rely on separable but interacting circuits within transformer architectures.
RADAR: Mechanistic Pathways for Detecting Data Contamination in LLM Evaluation

Ashish Kattamuri, Harshwardhan Fartale, Arpita Vats, and 2 more authors

2025

Accepted at NeurIPS 2025 LLM Evaluation Workshop Poster

Abs arXiv HTML PDF

We present RADAR, a framework that uses mechanistic interpretability to identify contaminated evaluation datasets for large language models — distinguishing genuine reasoning from memorized training data. The system extracts 37 features including surface-level confidence trajectories and deep mechanistic properties such as attention specialization, circuit dynamics, and activation flow patterns. An ensemble classifier achieves 93% overall accuracy, perfect accuracy on unambiguous cases, and 76.7% on challenging borderline examples. Rather than relying on traditional surface-level metrics, RADAR demonstrates how deep mechanistic analysis of model activation patterns and circuit dynamics can reveal whether strong performance stems from authentic reasoning or dataset memorization.
Equilibrium Dynamics and Mitigation of Gender Bias in Synthetically Generated Data

Ashish Kattamuri, Arpita Vats, Harshwardhan Fartale, and 3 more authors

2025

Accepted at AAAI 2026 Workshop on Shaping Responsible Synthetic Data in the Era of Foundation Models

Abs arXiv HTML PDF

We examine how gender bias evolves when large language models recursively generate synthetic datasets. Rather than consistent amplification, we discover equilibrium dynamics — biased data tends to converge toward the model’s inherent bias levels. Contrastive augmentation (introducing gender-swapped text variants) achieves substantial fairness improvements in downstream tasks, despite producing higher semantic similarity bias scores. This disconnect suggests that traditional embedding-based metrics may not capture behavioral fairness outcomes, underscoring the importance of multifaceted evaluation approaches in responsible synthetic data creation.

2023

A Survey of Differential Privacy Frameworks

Harshwardhan Fartale

OpenMined Blog, 2023

Abs HTML

A comprehensive survey of differential privacy and major frameworks implementing it, including Google’s DP Libraries, PyTorch Opacus, SecretFlow, IBM Diffprivlib, TensorFlow Privacy, OpenDP, and PyDP. Covers local vs global differential privacy approaches and their trade-offs for machine learning and statistical analysis applications.