About

Hi! I'm Jake, a PhD student in the Department of Economics at Harvard University. I graduated from Yale College in 2020, where I pursued a double major in Statistics and Data Science (S&DS) and Ethics, Politics, and Economics (EP&E).

I'm interested in topics at the intersections of economics, computer science, and statistics.

Curriculum Vitae Google Scholar Semantic Scholar

Publications

American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers

Melissa Dell, Jacob Carlson, Tom Bryan, Emily Silcock, Abhishek Arora, Zejiang Shen, Luca D'Amico-Wong, Quan Le, Pablo Querubin, Leander Heldring. "American Stories: A Large-Scale Structured Text Dataset of Historical U.S. Newspapers." NeurIPS D&B (2023).

Paper Dataset

EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge

Tom Bryan, Jacob Carlson, Abhishek Arora, Melissa Dell. "EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge." EMNLP SD (2023).

Paper Python Package

Dyadic Clustering in International Relations

Jacob Carlson, Trevor Incerti, and P. M. Aronow. "Dyadic Clustering in International Relations." Political Analysis (2024).

Paper Replication R Package Stata Command

LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis

Zejiang Shen, Ruochen Zhang, Melissa Dell, Benjamin Lee, Jacob Carlson, and Weining Li. "LayoutParser: A Unified Toolkit for Deep Learning Based Document Image Analysis." ICDAR (2021). Oral presentation.

Paper Website

Working Papers

True and Pseudo-True Parameters

Isaiah Andrews, Harvey Barnhard, and Jacob Carlson. "True and Pseudo-True Parameters." (2024).

Paper

Efficient OCR for Building a Diverse Digital History

Jacob Carlson, Tom Bryan, and Melissa Dell. "Efficient OCR for Building a Diverse Digital History." arXiv preprint arXiv:2304.02737 (2023).

Paper Codebase

Contact

Email: jacob underscore carlson at g dot harvard dot edu

Twitter: @J_S_Carlson