Personal page • applied AI for social good

Building data + AI systems for social good.

I build monitoring‑grade pipelines that turn messy signals into traceable evidence, decision‑ready products, and ultimately, actionable insights. The work is systems‑first: research design, deep learning + information retrieval (RAG/KB patterns, AI‑assisted), algorithm design, geo + time analytics, and evaluation harnesses that stay attached in production.

See deployed examples PDRI‑DevLab bio GitHub Contact

Chief Data Scientist @ PDRI‑DevLab (U. of Penn)

Focus: research + algorithm design • multilingual NLP • deep learning + retrieval • geo/time signals

Open to collaboration • mission‑driven teams • reach out

Blueprint at a glance

A reusable workflow I apply across social‑impact domains.

monitoring‑grade

R&D:    RQ → op-sigs → baselines → fail modes
arch:   conn → parse → schema/patch → lineage
K/R:    embed → vec+KB → hybrid RAG
mdl:    multi-head → cal+unc → decision policy
LLM:    synth rules | assist labels | doc→schema | err triage
ship:   agg → dash/maps → alerts/briefs | API
SRE:    drift | health | audit | ver | rollback

Research-dirven Evidence‑linked Comparability Continuous eval Audit trails Human review

Design systems that can pivot across contexts. From research to production, build with modularity, transparency, and robustness baked in.

Built for places where the cost of being wrong is real—stress-tested and drift-aware, with explainable outputs that support action.

About

When politics fractures, we stay grounded—and build a better vision.

What I build

End-to-end measurement pipelines for applied research. Built for rigor, reuse, and transparency, so results are reproducible and limitations are clear.

How I work

Designed to stay dependable: clean inputs, sturdy pipelines, and frequent sanity checks, so the output doesn’t fall apart when things shift.

Collaboration

If you’re building mission‑driven products (research labs, NGOs, policy, or industry), I’m happy to talk. Low‑friction contact here.

Deployments

A few papers and notes that reflect the research direction—kept brief and preview-only.

Deployed configurations

Civic space monitoring & forecasts MLP-Civic

Event detection + interpretable early‑warning signals for shifts in protest, restriction, media pressure, and advocacy activity — designed for frequent refreshes and evidence traceability.

Overview Dashboard Forecasts

Foreign influence & coercive leverage tracking MLP-RAI

Tracks influence patterns across channels (diplomatic / economic / information / cyber) with consistent task definitions, careful normalization, and transparent aggregation.

Overview Dashboard Technical

Climate‑driven disruption & response signals MLEED

Detects environmental shocks and social responses using multilingual event modeling + geo grounding, with downstream time‑series signals for monitoring and analysis.

Overview Dashboard Technical

Subnational disruption monitoring (ADM1) Subnational

Map‑first monitoring at subnational resolution, wired to two‑stage geo reconciliation (country → ADM1), standardized counts, and surge detection for rapid scanning and drill‑down.

Dashboard

These are representative, not exhaustive — the emphasis is on reusable infrastructure that transfers to new domains and stakeholders.

Selected papers & technical notes

Previews only (first 2 pages) — to avoid circulating full drafts. Public versions are linked when available.

Preview of first page: Modular Gated Attention

Modular Gated Attention: Adaptive Architecture for Flexible Sequence Modeling

Preprint • 2025

Preview PDF Code

Preview of first page: Causal inference methods

Benchmarking Causal Inference Methods for ATE Estimation

Methods note • 2025

Preview PDF

Preview of first page: Tracking Civic Space

Tracking Civic Space in Developing Countries with a High‑Quality Corpus of Domestic Media and Transformer Models

Preprint • 2025

Preview PDF OSF

Preview of first page: Foreign Influence data

Foreign Influence by Authoritarian Governments: Introducing New Data and Evidence

Working paper • 2024

Preview PDF

#DataForUkraine: Adapting Social Science Tools for Crisis Response

Reflection • 2022

Preview PDF

If you’re exploring collaborations, I can share additional technical notes (evaluation harnesses, sampling QA playbooks, schema/patch patterns).

Impact themes

Typical problem spaces these systems support.

Civic space & democracy Information integrity Foreign influence Climate risk & adaptation Humanitarian response Auditability

Small things

Useful work still benefits from a little life.

About me

Taiwanese by origin, culturally curious by default. I travel through food and the stories people tell through what they cook. Off the clock: dancing, handcrafts, and graffiti-inspired lettering—creative outlets that keep me energized.

Quick nav

Overview Deployments

Technique palette

Multi-task / multi-head modeling Transformers + LLM integration Retrieval (hybrid search, RAG) Feature / embedding engineering Serving + latency-aware pipelines Observability + drift monitoring

Building data + AI systems for social good.

About

What I build

How I work

Collaboration

Deployments

Deployed configurations

Selected papers & technical notes

Impact themes

Links

Contact

Code & models

Lab

Small things

About me

Quick nav

Technique palette