Chris Kennedy

Instructor in psychiatry

Harvard Medical School / Massachusetts General Hospital

Biography

Chris Kennedy is an instructor in psychiatry at Harvard Medical School, and a researcher at Massachusetts General Hospital’s Center for Precision Psychiatry led by Jordan Smoller. Previously he was a postdoctoral fellow in Gabriel Brat’s surgical informatics lab, in the department of biomedical informatics. He has a PhD in biostatistics from UC Berkeley where he worked with Alan Hubbard and Mark van der Laan. He is a research affiliate at Beth Israel Deaconess Medical Center, UC Berkeley’s D-Lab, the Integrative Cancer Research Group, and Kaiser Permanente’s Division of Research.

Chris chaired TextXD: Text Analysis Across Domains in 2018 & 2019, the premier text-focused data science conference at UC Berkeley. He is co-author of the SuperLearner machine learning framework and varimpact R package. Chris is lead author on the hate speech measurement project, is an NIH T-32 biomedical big data trainee, and is a member of the UCSF NLP Meetup.

He provides consulting services in deep/machine learning, data science, & surveying. In 2018 he led data science for Gavin Newsom’s gubernatorial campaign and Katie Porter’s congressional campaign.

View Chris’s CV here.

Interests

Targeted causal inference
Deep learning (NLP, images, video, time series)
Machine learning
Biomedicine & public health
Randomized trials & experimental design
Electronic health records
Item response theory
Survey methods

Education

PhD in biostatistics, 2020
University of California, Berkeley
Masters in public affairs, 2007
The University of Texas at Austin
B.A. in government and economics, 2005
The University of Texas at Austin

Selected Publications

Measuring hate speech via faceted Rasch measurement and multitask deep learning

We develop a method to create debiased, continuous, interval-valued latent variables from human-labeled data by combining faceted Rasch …

PDF Project Slides arXiv

Evaluation of a school-located influenza vaccination campaign

An evaluation of Oakland’s Shoo the Flu program, published in PLOS Medicine.

PDF DOI

Severe maternal morbidity scoring system

Causal variable importance to create a new risk score for complications during pregnancy.

Project DOI

Patient characteristics associated with telemedicine usage

Examining patient characteristics associated with telephone or video telemedicine visits, rather than in-person care.

PDF DOI

#Vape: Measuring e-cigarette influence on Instagram

Computer vision and natural language processing to track how influencers promote vaping on Instagram.

PDF Project DOI

Early immune stimulation and childhood acute lymphoblastic leukemia

Latent class analysis clustering, causal variable importance, and logistic regression to understand childhood leukemia.

Project DOI

Data-adaptive target parameters

Chapter in Targeted Learning in Data Science (2018) covering the varimpact variable importance algorithm.

Project DOI

Projects

Measuring hate speech

Integrate item response theory with deep NLP to enable major new innovations in the measurement of hate speech.

Targeted Exposure Mixtures

Analysis of exposure mixtures as data-adaptive target parameters based on cross-validated targeted learning (CV-TMLE).

Chestpain Risk Score

Development of a risk score for chest pain at Kaiser Permanente using machine learning, generalized low rank models, variable importance, and accumulated local effect plots.

Varimpact: causally motivated variable importance

Ranking the importance of variables based on their estimated treatment effect on an outcome.

Instagram Vaping

Application of deep learning to measure vaping marketing on Instagram.

Recent & Upcoming Talks

Constructing interval latent variables via Rasch measurement and multitask deep learning: a hate speech application

Nov 19, 2020 Society for Computation in Psychology Virtual

Project

Targeted exposure mixtures

Discovering exposure mixtures and ranking variable sets via cross-validated ensemble machine learning

Apr 22, 2020 European Causal Inference Meeting

Project Slides

Measuring hate speech: unifying deep learning with item response theory

Mar 17, 2020 2:00 PM — 4:00 PM Berkeley Evaluation and Assessment Research (BEAR) Seminar Berkeley, CA

Project

Integrating ordinal, multitask deep learning with faceted item response theory: debiased, explainable, interval measurement of hate speech

Feb 27, 2020 3:30 PM — 5:00 PM NLP@UCSF Meetup San Francisco, CA

Project

SuperLearner ensemble machine learning for chest pain prognostic modeling

Feb 27, 2020 CDC R User’s Group Atlanta, Georgia

Project

Machine learning for human rights and hate speech

Applied machine learning workshop, talk on machine learning for human rights, and talk on hate speech measurement

Nov 14, 2019 DataFest Tbilisi 2019 Tbilisi, Republic of Georgia

Project

See all talks

Teaching

Short courses

Supervised learning in R (6-8 hours): Preprocessing, cross-validation, lasso, decision trees, random forest, xgboost, and superlearner ensembles.

Deep learning in R (6-8 hours): Deep learning with Keras - building & training deep networks, image classification, transfer learning, text analysis, and visualization

Unsupervised learning in R (6-8 hours): Clustering (Hdbscan, LCA, Hopach), dimensionality reduction (GLRM, UMAP), and anomaly detection (isolation forests)

Guide to SuperLearner (4-6 hours): Basic ensembles, hyperparameter tuning, nested cross-validation, parallelization, diagnostics, feature selection, and loss customization.

Causal inference with targeted learning (6-8 hours): causal diagrams, regression with SuperLearner, inverse probability of treatment weighting, targeted maximum likelihood estimation, effect modification, causal variable importance, exposure mixture modeling

Feature selection in R (6-8 hours): permutation importance, adaptive elastic net, relief family (relief-f, STIR, multisurf), joint mutual information, and knockoffs

Please feel free to contact me to discuss training for your institution.

Chris Kennedy

Instructor in psychiatry

Harvard Medical School / Massachusetts General Hospital

Biography

Interests

Education

Selected Publications

Measuring hate speech via faceted Rasch measurement and multitask deep learning

Evaluation of a school-located influenza vaccination campaign

Severe maternal morbidity scoring system

Patient characteristics associated with telemedicine usage

#Vape: Measuring e-cigarette influence on Instagram

Early immune stimulation and childhood acute lymphoblastic leukemia

Data-adaptive target parameters

Projects

Measuring hate speech

Targeted Exposure Mixtures

Chestpain Risk Score

Varimpact: causally motivated variable importance

Instagram Vaping

Recent & Upcoming Talks

Constructing interval latent variables via Rasch measurement and multitask deep learning: a hate speech application

Targeted exposure mixtures

Measuring hate speech: unifying deep learning with item response theory

Integrating ordinal, multitask deep learning with faceted item response theory: debiased, explainable, interval measurement of hate speech

SuperLearner ensemble machine learning for chest pain prognostic modeling

Machine learning for human rights and hate speech

Teaching

Short courses

Recent Posts

Installing Stata with Windows on Amazon EC2

Advice for admitted political science PhD students

Contact