Main picture

/ɥín mʷòː ʈʂʰə́n/

Bloomberg LP

Senior Scientist

Email: [email protected]

Bio

I am a Senior Scientist at Bloomberg. At Bloomberg, I have been building an interactive code agent to help analysts retrieve and analyze the various data provided by Bloomberg. This involves implementing various training procedures to improve large language models (LLMs) on capabilities such as tool calling, and training hybrid retrieval models to improve contextual accuracy in code generation.

Previously, I obtained my PhD from Johns Hopkins University, where I was advised by Prof. Benjamin Van Durme. I worked on large language model (LLM) projects including using reinforcement learning (RL) to better orchestrate between LLMs and retrieval systems for code generation [paper], faithfullness evaluation [paper], and preference alignment algorithms [paper, paper]. My research also explored sequence modeling [paper], particularly in the context of streaming, and developed evaluation metrics for code and other structures [paper].

I was awarded outstanding paper awards for Learning to Retrieve Iteratively for In-Context Learning at EMNLP 2024 and Iterative Document-level Information Extraction via Imitation Learning at EACL 2023.

Publications

,
Streaming Sequence Transduction through Dynamic Compression
In International Conference on Spoken Language Translation (IWSLT), 2025.
,
Learning to Retrieve Iteratively for In-Context Learning
In Empirical Methods in Natural Language Processing (EMNLP), 2024.
Outstanding Paper Award
,
FaithScore: Evaluating Hallucinations in Large Vision-Language Models
In Empirical Methods in Natural Language Processing (EMNLP), findings, 2024.
,
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts
In Association for Computational Linguistics (ACL), findings, 2024.
,
Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation
In International Conference on Machine Learning (ICML), 2024.

Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles
In North American Chapter of the Association for Computational Linguistics (NACL), 2024.
,
MultiMUC: Multilingual Template Filling on MUC-4
In European Chapter of the Association for Computational Linguistics (EACL), 2024.
,
A Unified View of Evaluation Metrics for Structured Prediction
In Empirical Methods in Natural Language Processing (EMNLP), 2023.
,
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules
In Empirical Methods in Natural Language Processing (EMNLP), 2023.
,
On Event Individuation for Document-Level Information Extraction
In Empirical Methods in Natural Language Processing (EMNLP), findings, 2023.
,
When Do Decompositions Help for Machine Reading?
In Empirical Methods in Natural Language Processing (EMNLP), 2023.
,
Iterative Document-level Information Extraction via Imitation Learning
In European Chapter of the Association for Computational Linguistics (EACL), 2023.
Outstanding Paper Award
,
Differentiable Tree Operations Promote Compositional Generalization
In International Conference on Machine Learning (ICML), 2023.
,
An Empirical Study on Finding Spans
In Empirical Methods in Natural Language Processing (EMNLP), 2022.
,
Everything Is All It Takes: A Multipronged Strategy for Zero-Shot Cross-Lingual Information Extraction
In Empirical Methods in Natural Language Processing (EMNLP), 2021.
,
LOME: Large Ontology Multilingual Extraction
In Association for Computational Linguistics (ACL), System Demo, 2021.
,
Pattern-aware Data Augmentation for Query Rewriting in Voice Assistant Systems
In DEEP-DIAL at AAAI, 2021.
,
Hierarchical Entity Typing via Multi-level Learning to Rank
In Association for Computational Linguistics (ACL), 2020.
,
Joint Modeling of Arguments for Event Understanding
In Computational Approaches to Discourse (CODI) at EMNLP, 2020.
,
Reading the Manual: Event Extraction as Definition Comprehension
In Structured Prediction for NLP (SPNLP) at EMNLP, 2020.