Personal page • applied AI for social good

Building data + AI systems for social good.

I build monitoring‑grade pipelines that turn messy signals into traceable evidence, decision‑ready products, and ultimately, actionable insights. The work is systems‑first: research design, deep learning + information retrieval (RAG/KB patterns, AI‑assisted), algorithm design, geo + time analytics, and evaluation harnesses that stay attached in production.

Chief Data Scientist @ PDRI‑DevLab (U. of Penn)
Focus: research + algorithm design • multilingual NLP • deep learning + retrieval • geo/time signals
Open to collaboration • mission‑driven teams • reach out
Blueprint at a glance
A reusable workflow I apply across social‑impact domains.
monitoring‑grade
R&D:    RQ → op-sigs → baselines → fail modes
arch:   conn → parse → schema/patch → lineage
K/R:    embed → vec+KB → hybrid RAG
mdl:    multi-head → cal+unc → decision policy
LLM:    synth rules | assist labels | doc→schema | err triage
ship:   agg → dash/maps → alerts/briefs | API
SRE:    drift | health | audit | ver | rollback

                
                
Research-dirven Evidence‑linked Comparability Continuous eval Audit trails Human review

Design systems that can pivot across contexts. From research to production, build with modularity, transparency, and robustness baked in.

Built for places where the cost of being wrong is real—stress-tested and drift-aware, with explainable outputs that support action.

About

When politics fractures, we stay grounded—and build a better vision.

What I build

End-to-end measurement pipelines for applied research. Built for rigor, reuse, and transparency, so results are reproducible and limitations are clear.

How I work

Designed to stay dependable: clean inputs, sturdy pipelines, and frequent sanity checks, so the output doesn’t fall apart when things shift.

Collaboration

If you’re building mission‑driven products (research labs, NGOs, policy, or industry), I’m happy to talk. Low‑friction contact here.

Deployments

A few papers and notes that reflect the research direction—kept brief and preview-only.

Deployed configurations

Civic space monitoring & forecasts MLP-Civic

Event detection + interpretable early‑warning signals for shifts in protest, restriction, media pressure, and advocacy activity — designed for frequent refreshes and evidence traceability.

Foreign influence & coercive leverage tracking MLP-RAI

Tracks influence patterns across channels (diplomatic / economic / information / cyber) with consistent task definitions, careful normalization, and transparent aggregation.

Climate‑driven disruption & response signals MLEED

Detects environmental shocks and social responses using multilingual event modeling + geo grounding, with downstream time‑series signals for monitoring and analysis.

Subnational disruption monitoring (ADM1) Subnational

Map‑first monitoring at subnational resolution, wired to two‑stage geo reconciliation (country → ADM1), standardized counts, and surge detection for rapid scanning and drill‑down.

These are representative, not exhaustive — the emphasis is on reusable infrastructure that transfers to new domains and stakeholders.

Selected papers & technical notes

Previews only (first 2 pages) — to avoid circulating full drafts. Public versions are linked when available.

Preview of first page: Modular Gated Attention
Modular Gated Attention: Adaptive Architecture for Flexible Sequence Modeling
Preprint • 2025
Preview of first page: Causal inference methods
Benchmarking Causal Inference Methods for ATE Estimation
Methods note • 2025
Preview of first page: Tracking Civic Space
Tracking Civic Space in Developing Countries with a High‑Quality Corpus of Domestic Media and Transformer Models
Preprint • 2025
Preview of first page: Foreign Influence data
Foreign Influence by Authoritarian Governments: Introducing New Data and Evidence
Working paper • 2024
Preview of first page: DataForUkraine
#DataForUkraine: Adapting Social Science Tools for Crisis Response
Reflection • 2022

If you’re exploring collaborations, I can share additional technical notes (evaluation harnesses, sampling QA playbooks, schema/patch patterns).

Impact themes

Typical problem spaces these systems support.

Civic space & democracy Information integrity Foreign influence Climate risk & adaptation Humanitarian response Auditability