All Posts

Browse all blog posts by year and month

Filter: Publication Paper Review Project Post

2026 ¹¹

June ²

OSM data analysis for landuse

Published: 10 Jun, 2026
• 20 min read

Analyzing OpenStreetMap keys and tags to surface what's relevant to landuse, from raw counts down to clustered key families.
- Post
You should not take Hugging Face language tags at face value

Published: 2 Jun, 2026
• 3 min read

A short look at why Hugging Face language tags can be useful while still requiring manual investigation.
- Post

May ¹

A small milestone for our empathy and simulation paper

Published: 28 May, 2026
• 1 min read

Our Journal of Simulation paper on LLMs, agent-based models, and empathetic decision-making was selected as the Editor's Pick.
- Publication

April ²

Distilling Agent-Based Models into Textual Explanations via LLMs

Published: 15 Apr, 2026
• 3 min read

A look at our new research on using LLMs to turn complex ABM simulations into clear textual explanations.
- Publication
I am joining the EVERGREEN research team

Published: 4 Apr, 2026
• 5 min read

Next month, I will be joining INRIA in Montpellier, within the EVERGREEN research team as a research engineer.
- Post

March ¹

What Is Design of Experiments? Learning It Through a Better Cup of Chai

Published: 25 Mar, 2026
• 8 min read

Using the perfect cup of chai to understand the fundamentals of Design of Experiments (DOE)
- Post

February ⁴

How2Bench: A Guideline for Benchmark Development

Published: 24 Feb, 2026
• 7 min read

A breakdown of the How2Bench paper, which advocates rigor in benchmark development, with a focus on evaluation reliability and reproducibility.
- Paper Review
An AI survival guide

Published: 16 Feb, 2026
• 25 min read

Some advice and resources I have found helpful so far as a junior AI researcher.
- Post
Two different philosophies of giving an agent hands

Published: 13 Feb, 2026
• 7 min read

A comparison between CLI and MCP approaches for giving AI agents capabilities to interact with systems.
- Post
One place, two views: the core idea behind GeoReasoner

Published: 3 Feb, 2026
• 15 min read

A breakdown of the GeoReasoner paper, which leverages both linguistic and geospatial information to reason on geospatially grounded natural language.
- Paper Review

January ¹

Building FineWeb-Legal: A 10B Token Pilot

Published: 3 Jan, 2026
• 2 min read

How I extracted 67 million words of legal text from 10B tokens of web data using heuristics and classifiers.
- Project

2025 ¹

September ¹

Can we turn agent-based models into empathetic stories (without getting poetic)?

Published: 3 Sep, 2025
• 5 min read

We test whether GPT-4 can translate agents’ simulated lives into readable, empathetic narratives—and show that style transfer beats ‘please be empathetic’ prompts.
- Publication

2024 ¹

October ¹

Can LLMs learn conceptual modeling from slide decks?

Published: 26 Oct, 2024
• 4 min read

Our new study asks if LLMs can learn enough conceptual modeling to pass graduate quizzes by using the same course materials as students.
- Publication

All Posts

2026 11

June 2

OSM data analysis for landuse

You should not take Hugging Face language tags at face value

May 1

A small milestone for our empathy and simulation paper

April 2

Distilling Agent-Based Models into Textual Explanations via LLMs

I am joining the EVERGREEN research team

March 1

What Is Design of Experiments? Learning It Through a Better Cup of Chai

February 4

How2Bench: A Guideline for Benchmark Development

An AI survival guide

Two different philosophies of giving an agent hands

One place, two views: the core idea behind GeoReasoner

January 1

Building FineWeb-Legal: A 10B Token Pilot

2025 1

September 1

Can we turn agent-based models into empathetic stories (without getting poetic)?

2024 1

October 1

Can LLMs learn conceptual modeling from slide decks?

2026 ¹¹

June ²

May ¹

April ²

March ¹

February ⁴

January ¹

2025 ¹

September ¹

2024 ¹

October ¹