Skip to main content

Dredze, Mark

John C. Malone Associate Professor
Computer Science
http://www.dredze.com

Malone Hall 339
(443) 326-4227
mdredze@jhu.edu

Jump to:

News

Researchers look to Twitter to better understand vaccine refusal

February 24, 2015

A Johns Hopkins computer scientist is part of a team of researchers that has developed a new way to understand vaccine refusal by studying an unlikely resource: Twitter. The researchers will combine Twitter analyses with traditional survey techniques to study why people refuse vaccines and how these reasons vary among communities. The focus on vaccination […]

Read More

About

Education
  • Ph.D. 2009, Univ Pennsylvania
  • Master of Arts 2004, Yeshiva University
  • Bachelor of Science 2003, Northwstrn University*
Experience
  • 2011 - Present:  Joint, SOM Health Sciences Informatics
Research Areas
  • ARTIFICIAL intelligence -- Medical applications
  • Machine Learning trends
  • NATURAL language processing (Computer science)
  • Public Health Informatics methods
  • Social media analysis

Publications

Journal Articles
  • Nobles AL, Leas EC, Latkin CA, Dredze M, Strathdee SA, Ayers JW (2020).  #HIV: Alignment of HIV-Related Visual Content on Instagram with Public Health Priorities in the US.  AIDS and Behavior.  24(7).  2045-2053.
  • Nobles AL, Leas EC, Noar S, Dredze M, Latkin CA, Strathdee SA, Ayers JW (2020).  Automated image analysis of instagram posts: Implications for risk perception and communication in public health using a case study of #HIV.  PLoS ONE.  15(5).
  • Broniatowski DA, Quinn SC, Dredze M, Jamison AM (2020).  Vaccine communication as weaponized identity politics.  American Journal of Public Health.  110(5).  617-618.
  • Hu D, Martin C, Dredze M, Broniatowski DA (2020).  Chinese social media suggest decreased vaccine acceptance in China: An observational study on Weibo following the 2018 Changchun Changsheng vaccine incident.  Vaccine.  38(13).  2764-2770.
  • Jamison AM, Broniatowski DA, Dredze M, Wood-Doughty Z, Khan DA, Quinn SC (2020).  Vaccine-related advertising in the Facebook Ad Archive.  Vaccine.  38(3).  512-520.
  • Wu S, Dredze M (2020).  Beto, Bentz, Becas: The surprising cross-lingual effectiveness of Bert.  EMNLP-IJCNLP 2019 - 2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing, Proceedings of the Conference.  833-844.
  • Wood-Doughty Z, Shpitser I, Dredze M (2020).  Challenges of using text classifiers for causal inference.  Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018.  4586-4598.
  • Nakhasi A, Bell SG, Passarella RJ, Paul MJ, Dredze M, Pronovost PJ (2019).  The Potential of Twitter as a Data Source for Patient Safety.  Journal of Patient Safety.  15(4).  E32-E35.
  • Leas EC, Nobles AL, Caputi TL, Dredze M, Smith DM, Ayers JW (2019).  Trends in Internet Searches for Cannabidiol (CBD) in the United States.  JAMA network open.  2(10).  e1913853.
  • Nobles AL, Dredze M, Ayers JW (2019).  “Repeal and replace”: increased demand for intrauterine devices following the 2016 presidential election.  Contraception.  99(5).  293-295.
  • Chen T, Dredze M, Weiner JP, Kharrazi H (2019).  Identifying vulnerable older adult populations by contextualizing geriatric syndrome information in clinical notes of electronic health records.  Journal of the American Medical Informatics Association.  26(8-9).  787-795.
  • Chen T, Dredze M, Weiner JP, Hernandez L, Kimura J, Kharrazi H (2019).  Extraction of geriatric syndromes from electronic health record clinical notes: Assessment of statistical natural language processing methods.  Journal of Medical Internet Research.  21(3).
  • Kaufman MR, Dey D, Crainiceanu C, Dredze M (2019).  #MeToo and Google Inquiries Into Sexual Violence: A Hashtag Campaign Can Sustain Information Seeking.  Journal of Interpersonal Violence.
  • John W. Ayers, Alicia L. Nobles, Mark Dredze (2019).  Media Trends for the Substance Abuse and Mental Health Services Administration 800-662-HELP Addiction Treatment Referral Services After a Celebrity Overdose.  JAMA Internal Medicine.
  • Xiaolei Huang, Michael C Smith, Amelia M Jamison, David A Broniatowski, Mark Dredze, Sandra Crouse Quinn, Justin Cai, Michael J Paul (2019).  Can online self-reports assist in real-time identification of influenza vaccination uptake? A cross-sectional study of influenza vaccine-related tweets in the USA, 2013--2017.  BMJ Open.  9.  e024018.
  • Huang X, Smith MC, Jamison AM, Broniatowski DA, Dredze M, Quinn SC, Cai J, Paul MJ (2019).  Can online self-reports assist in real-time identification of influenza vaccination uptake? A cross-sectional study of influenza vaccine-related tweets in the USA, 2013-2017.  BMJ Open.  9(1).
  • Ayers JW, Caputi TL, Nebeker C, Dredze M (2018).  Don’t quote me: reverse identification of research participants in social media studies.  npj Digital Medicine.  1(1).
  • Ayers JW, Dredze M, Leas EC, Caputi TL, Allem JP, Cohen JE (2018).  Next generation media monitoring: Global coverage of electronic nicotine delivery systems (electronic cigarettes) on Bing, Google and Twitter, 2013-2018.  PLoS ONE.  13(11).
  • Broniatowski DA, Jamison AM, Qi SH, AlKulaib L, Chen T, Benton A, Quinn SC, Dredze M (2018).  Weaponized health communication: Twitter bots and Russian trolls amplify the vaccine debate.  American Journal of Public Health.  108(10).  1378-1384.
  • Lama Y, Chen T, Dredze M, Jamison A, Quinn SC, Broniatowski DA (2018).  Discordance between human papillomavirus twitter images and disparities in human papillomavirus risk and disease in the United States: Mixed-methods analysis.  Journal of Medical Internet Research.  20(9).
  • Hammond AS, Paul MJ, Hobelmann J, Koratana AR, Dredze M, Chisolm MS (2018).  Perceived attitudes about substance use in anonymous social media posts near college campuses: Observational study.  Journal of Medical Internet Research.  20(8).
  • Noar SM, Leas E, Althouse BM, Dredze M, Kelley D, Ayers JW (2018).  Can a selfie promote public engagement with skin cancer?.  Preventive Medicine.  111.  280-283.
  • Nastasi A, Bryant T, Canner JK, Dredze M, Camp MS, Nagarajan N (2018).  Breast Cancer Screening and Social Media: a Content Analysis of Evidence Use and Guideline Opinions on Twitter.  Journal of Cancer Education.  33(3).  695-702.
  • Caputi TL, Leas EC, Dredze M, Ayers JW (2018).  Online Sales of Marijuana: An Unrecognized Public Health Dilemma.  American Journal of Preventive Medicine.  54(5).  719-721.
  • Chen T, Dredze M (2018).  Vaccine images on twitter: Analysis of what images are shared.  Journal of Medical Internet Research.  20(4).
  • Gao N, Sell G, Oard DW, Dredze M (2018).  Leveraging side information for speaker identification with the Enron conversational telephone speech collection.  2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings.  2018-January.  577-583.
  • Wolfe T, Carrell A, Dredze M, Benjamin Van D (2018).  Summarizing entities using distantly supervised information extractors.  CEUR Workshop Proceedings.  2127.  51-58.
  • Alexis S. Hammond, Michael J. Paul, J. Gregory Hobelmann, Animesh R. Koratana, Mark Dredze, Margaret S. Chisolm (2018).  Perceived Attitudes About Substance Use in Anonymous Social Media Posts Near College Campuses.  Journal of Medical Internet Research Mental Health (JMIR MH).  5.  e52.
  • Zhou Y, Dredze M, Broniatowski DA, Adler WD (2018).  Gab: The alt-right social media platform.  2018 International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction and Behavior Representation in Modeling and Simulation, BRiMS 2018.
  • Zhou Y, Dredze M, Broniatowski DA, Adler WD (2018).  Gab: The alt-right social media platform.  2018 International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction and Behavior Representation in Modeling and Simulation, BRiMS 2018.
  • John W Ayers, Mark Dredze, Eric C Leas, Theodore L. Caputi, Jon-Patrick Allem, Joanna E Cohen (2018).  Next generation media monitoring: Global coverage of electronic nicotine delivery systems (electronic cigarettes) on Bing, Google and Twitter, 2013-2018.  PloS one.  Public Library of Science.  13.  e0205822.
  • David A. Broniatowski, Amelia M. Jamison, SiHua Qi, Lulwah AlKulaib, Tao Chen, Adrian Benton, Sandra C. Quinn, Mark Dredze (2018).  Weaponized Health Communication: Twitter Bots and Russian Trolls Amplify the Vaccine Debate.  American Journal of Public Health (AJPH).  108.  1378-1384.
  • John W Ayers, Theodore L. Caputi, Camille Nebeker, Mark Dredze (2018).  Don't quote me: reverse identification of research participants in social media studies.  Nature Digital Medicine.  1.
  • Yuki Lama, Tao Chen, Mark Dredze, Amelia M Jamison, Sandra C Quinn, David A Broniatowski (2018).  Discordance Between Human Papillomavirus Twitter Images and Disparities in Human Papillomavirus Risk and Disease in the United States: Mixed-Methods Analysis.  Journal of Medical Internet Research (JMIR).  20.  e10244.
  • Benton A, Dredze M (2018).  Deep dirichlet multinomial regression.  NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference.  1.  365-374.
  • Theodore L. Caputi, Eric C. Leas, Mark Dredze, John W. Ayers (2018).  Online Sales of Marijuana: An Unrecognized Public Health Dilemma.  American Journal of Preventive Medicine (AJPM).  54.  719-721.
  • Tao Chen, Mark Dredze (2018).  Vaccine Images on Twitter: What is Shared and Why.  Journal of Medical Internet Research (JMIR).  20.  2018.
  • Zhou Y, Dredze M, Broniatowski DA, Adler WD (2018).  Gab: The alt-right social media platform.  2018 International Conference on Social Computing, Behavioral-Cultural Modeling, and Prediction and Behavior Representation in Modeling and Simulation, BRiMS 2018.
  • Caputi TL, Leas E, Dredze M, Cohen JE, Ayers JW (2017).  They’re heating up: Internet search query trends reveal significant public interest in heat-not-burn tobacco products.  PLoS ONE.  12(10).
  • Gao N, Dredze M, Oard DW (2017).  Person entity linking in email with NIL detection.  Journal of the Association for Information Science and Technology.  68(10).  2412-2424.
  • Gao N, Oard DW, Dredze M (2017).  Support for interactive identification of mentioned entities in conversational speech.  SIGIR 2017 - Proceedings of the 40th International ACM SIGIR Conference on Research and Development in Information Retrieval.  953-956.
  • Allem JP, Leas EC, Caputi TL, Dredze M, Althouse BM, Noar SM, Ayers JW (2017).  The Charlie Sheen Effect on Rapid In-home Human Immunodeficiency Virus Test Sales.  Prevention Science.  18(5).  541-544.
  • Mark Dredze, Zachary Wood-Doughty, Sandra C Quinn, David A. Broniatowski (2017).  Vaccine opponents' use of Twitter during the 2016 US presidential election: Implications for practice and policy.  Vaccine.  35.  4670-4672.
  • Ayers JW, Leas EC, Allem JP, Benton A, Dredze M, Althouse BM, Cruz TB, Unger JB (2017).  Why do people use electronic nicotine delivery systems (electronic cigarettes)? A content analysis of Twitter, 2012-2015.  PLoS ONE.  12(3).
  • Huang X, Smith MC, Paul MJ, Ryzhkov D, Quinn SC, Broniatowski DA, Dredze M (2017).  Examining patterns of influenza vaccination in social media.  AAAI Workshop - Technical Report.  WS-17-01 - WS-17-15.  542-546.
  • Andrews N, Dredze M, Van Durme B, Eisner J (2017).  Bayesian modeling of lexical resources for low-resource settings.  ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers).  1.  1029-1039.
  • Smith MC, Dredze M, Quinn SC, Broniatowski DA (2017).  Monitoring real-time spatial public health discussions in the context of vaccine hesitancy.  CEUR Workshop Proceedings.  1996.  12-18.
  • Wolfe T, Dredze M, Van Durme B (2017).  Pocket knowledge base population.  ACL 2017 - 55th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Long Papers).  2.  305-310.
  • Seth M. Noar, Eric C Leas, Benjamin M. Althouse, Mark Dredze, Dannielle Kelley, John W. Ayers (2017).  Can a selfie promote public engagement with skin cancer?.  Preventive Medicine.  10.1016/j.ypmed.2017.10.038.
  • Theodore L. Caputi, Eric C Leas, Mark Dredze, Joanna E. Cohen, John W. Ayers (2017).  They're heating up: Internet search query trends reveal significant public interest in heat-not-burn tobacco products.  PLoS ONE.  10.1371/journal.pone.0185735.
  • John W Ayers, Benjamin M. Althouse, Eric C Leas, Mark Dredze, Jon-Patrick Allem (2017).  Internet searches for suicide following the release of 13 Reasons Why.  JAMA Internal Medicine.  57.  238-240.
  • Michael J. Paul, Mark Dredze (2017).  Social Monitoring for Public Health.  Synthesis Lectures on Information Concepts, Retrieval, and Services.  9.  1-183.
  • Jon-Patrick Allem, Eric C. Leas, Theodore L. Caputi, Mark Dredze, Benjamin M. Althouse, Seth M. Noar, John W. Ayers (2017).  The Charlie Sheen Effect on Rapid In-home Human Immunodeficiency Virus Test Sales.  Prevention Science.  18.  541--544.
  • Ning Gao, Mark Dredze, Douglas Oard (2017).  Person Entity Linking in Email with NIL Detection.  Journal of the Association for Information Science and Technology (JAIST).  10.1002/asi.23888.
  • John W Ayers, Eric C Leas, Jon-Patrick Allem, Adrian Benton, Mark Dredze, Benjamin M Althouse, Tess B Cruz, Jennifer B Unger (2017).  Why Do People Use Electronic Nicotine Delivery Systems (Electronic Cigarettes)? A Content Analysis of Twitter, 2012-2015.  PLoS One.  10.1371/journal.pone.0170702.
  • Anthony Nastasi, Tyler Bryant, Joseph K. Canner, Mark Dredze, Melissa S. Camp, Neeraja Nagarajan (2017).  Breast Cancer Screening and Social Media: a Content Analysis of Evidence Use and Guideline Opinions on Twitter.  Journal of Cancer Education.  1-8.
  • Leas EC, Althouse BM, Dredze M, Obradovich N, Fowler JH, Noar SM, Allem JP, Ayers JW (2016).  Big data sensors of organic advocacy: The case of Leonardo DiCaprio and climate change.  PLoS ONE.  11(8).
  • Biggerstaff M, Alper D, Dredze M, Fox S, Fung ICH, Hickmann KS, Lewis B, Rosenfeld R, Shaman J, Tsou MH, Velardi P, Vespignani A, Finelli L, Chandra P, Kaup H, Krishnan R, Madhavan S, Markar A, Pashley B, Paul M, Meyers LA, Eggo R, Henderson J, Ramakrishnan A, Scott J, Singh B, Srinivasan R, Bakach I, Hao Y, Schaible BJ, Sexton JK, Del Valle SY, Deshpande A, Fairchild G, Generous N, Priedhorsky R, Hickman KS, Hyman JM, Brooks L, Farrow D, Hyun S, Tibshirani RJ, Yang W, Allen C, Aslam A, Nagel A, Stilo G, Basagni S, Zhang Q, Perra N, Chakraborty P, Butler P, Khadivi P, Ramakrishnan N, Chen J, Barrett C, Bisset K, Eubank S, Anil Kumar VS, Laskowski K, Lum K, Marathe M, Aman S, Brownstein JS, Goldstein E, Lipsitch M, Mekaru SR, Nsoesie EO, Gesualdo F, Tozzi AE, Broniatowski D, Karspeck A, Tse ZTH, Ying Y, Gambhir M, Scarpino S (2016).  Results from the centers for disease control and prevention's predict the 2013-2014 Influenza Season Challenge.  BMC Infectious Diseases.  16(1).
  • Dredze M, Kambadur P, Kazantsev G, Mann G, Osborne M (2016).  How Twitter is changing the nature of financial news discovery.  Proceedings of the ACM SIGMOD International Conference on Management of Data.  01-July-2016.
  • Broniatowski DA, Hilyard KM, Dredze M (2016).  Effective vaccine communication during the disneyland measles outbreak.  Vaccine.  34(28).  3225-3228.
  • Ayers JW, Althouse BM, Allem JP, Leas EC, Dredze M, Williams RS (2016).  Revisiting the Rise of Electronic Nicotine Delivery Systems Using Search Query Surveillance.  American Journal of Preventive Medicine.  50(6).  e173-e181.
  • John W. Ayers, Benjamin M. Althouse, Jon-Patrick Allem, Eric C. Leas, Mark Dredze, Rebecca Williams (2016).  Revisiting the Rise of Electronic Nicotine Delivery Systems Using Search Query Surveillance.  American Journal of Preventive Medicine (AJPM).  50.  e173-e181.
  • De Choudhury M, Kiciman E, Dredze M, Coppersmith G, Kumar M (2016).  Discovering shifts to suicidal ideation from mental health content in social media.  Conference on Human Factors in Computing Systems - Proceedings.  2098-2110.
  • Dredze M, Broniatowski DA, Smith MC, Hilyard KM (2016).  Understanding Vaccine Refusal: Why We Need Social Media Now.  American Journal of Preventive Medicine.  50(4).  550-552.
  • Paul MJ, Chisolm MS, Johnson MW, Vandrey RG, Dredze M (2016).  Assessing the validity of online drug forums as a source for estimating demographic and temporal trends in drug use.  Journal of Addiction Medicine.  10(5).  324-330.
  • Yu M, Dredze M, Arora R, Gormley MR (2016).  Embedding lexical features via low-rank tensors.  2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference.  1019-1029.
  • Peng N, Dredze M (2016).  Improving named entity recognition for Chinese social media with word segmentation representation learning.  54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Short Papers.  149-155.
  • Koratana A, Dredze M, Chisolm MS, Johnson MW, Paul MJ (2016).  Studying anonymous health issues and substance use on college campuses with Yik Yak.  AAAI Workshop - Technical Report.  WS-16-01 - WS-16-15.  778-782.
  • Dredze M, Osborne M, Kambadur P (2016).  Geolocation for Twitter: Timing matters.  2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2016 - Proceedings of the Conference.  1064-1069.
  • Benton A, Arora R, Dredze M (2016).  Learning multiview embeddings of Twitter users.  54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Short Papers.  14-19.
  • John W. Ayers, J. Lee Westmaas, Eric C. Leas, Adrian Benton, Yunqi Chen, Mark Dredze, Benjamin M. Althouse (2016).  Leveraging Big Data to Improve Health Awareness Campaigns: A Novel Evaluation of the Great American Smokeout.  JMIR Public Health and Surveillance.  2.  e16.
  • Brad J. Bushman, Katherine Newman, Sandra L. Calvert, Geraldine Downey, Mark Dredze, Michael Gottfredson, Nina G. Jablonski, Ann S. Masten, Calvin Morrill, Daniel B. Neill, Daniel Romer, Daniel W. Webster (2016).  Youth Violence: What We Know and What We Need to Know.  American Psychologist.  71.  17-39.
  • John W Ayers, Eric C Leas, Mark Dredze, Jon-Patrick Allem, Jurek G Grabowski, Linda Hill (2016).  Pok'emon go---a new distraction for drivers and pedestrians.  JAMA Internal Medicine.  176.  1865-1866.
  • Eric C Leas, Benjamin M Althouse, Mark Dredze, Nick Obradovich, James H Fowler, Seth M Noar, Jon-Patrick Allem, John W Ayers (2016).  Big data sensors of organic advocacy: The case of Leonardo DiCaprio and Climate Change.  PLoS One.  11.  e0159885.
  • Michael J. Paul, Margaret S. Chisolm, Matthew W. Johnson, Ryan G. Vandrey, Mark Dredze (2016).  Assessing the validity of online drug forums as a source for estimating demographic and temporal trends in drug use.  Journal of Addiction Medicine.  10.  324--330.
  • Matthew Biggerstaff, David Alper, Mark Dredze, Spencer Fox, Isaac Chun-Hai Fung, Kyle S. Hickmann, Bryan Lewis, Roni Rosenfeld, Jeffrey Shaman, Ming-Hsiang Tsou, Paola Velardi, Alessandro Vespignani, Lyn Finelli (2016).  Results from the Centers for Disease Control and Prevention's Predict the 2013--2014 Influenza Season Challenge.  BMC Infectious Diseases.  16.  10.1186/s12879-016-1669-x.
  • Mark Dredze, David A Broniatowski, Karen M Hilyard (2016).  Zika Vaccine Misconceptions: A social media analysis.  Vaccine.  34.  3441-3442.
  • David A Broniatowski, Mark Dredze, Karen M Hilyard (2016).  Effective Vaccine Communication during the Disneyland Measles Outbreak.  Vaccine.  34.  3225-3228.
  • John W. Ayers, Benjamin M. Althouse, Mark Dredze, Eric C. Leas, Seth M. Noar (2016).  News and Internet Searches About Human Immunodeficiency Virus After Charlie Sheen's Disclosure.  JAMA Internal Medicine.  176.  552-554.
  • Atul Nakhasi, Sarah G Bell, Ralph J Passarella, Michael J Paul, Mark Dredze, Peter J Pronovost (2016).  The Potential of Twitter as a Data Source for Patient Safety.  Journal of Patient Safety.  10.1097/PTS.0000000000000253.
  • Benton A, Paul MJ, Hancock B, Dredze M (2016).  Collective supervision of topic models for predicting surveys with social media.  30th AAAI Conference on Artificial Intelligence, AAAI 2016.  2892-2898.
  • Bushman BJ, Newman K, Calvert SL, Downey G, Dredze M, Gottfredson M, Jablonski NG, Masten AS, Morrill C, Neill DB, Romer D, Webster DW (2016).  Youth Violence: What We Know and What We Need to Know.  American Psychologist.  71(1).  17-39.
  • Kumar M, Dredze M, Coppersmith G, De Choudhury M (2015).  Detecting changes in suicide content manifested in social media following celebrity suicides.  HT 2015 - Proceedings of the 26th ACM Conference on Hypertext and Social Media.  85-94.
  • Wolfe T, Dredze M, Van Durme B (2015).  Predicate argument alignment using a global coherence model.  NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference.  11-20.
  • Gormley MR, Mo Y, Dredze M (2015).  Improved relation extraction with Feature-rich Compositional embedding models.  Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing.  1774-1784.
  • Peng N, Dredze M (2015).  Named entity recognition for Chinese social media with jointly trained embeddings.  Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing.  548-554.
  • Wang H, Hovy E, Dredze M (2015).  The hurricane sandy twitter corpus.  AAAI Workshop - Technical Report.  WS-15-15.  20-24.
  • David A Broniatowski, Mark Dredze, Michael J Paul, Andrea Dugas (2015).  Using Social Media to Perform Local Influenza Surveillance in an Inner-City Hospital.  JMIR Public Health and Surveillance.  1.
  • Shiliang Wang, Michael J Paul, Mark Dredze (2015).  Social Media as a Sensor of Air Quality and Public Response in China.  Journal of Medical Internet Research (JMIR).  17.
  • Michael J Paul, Mark Dredze (2015).  SPRITE: Generalizing Topic Models with Structured Priors.  Transactions of the Association for Computational Linguistics (TACL).  43-58.
  • Mo Yu, Mark Dredze (2015).  Learning Composition Models for Phrase Embeddings.  Transactions of the Association for Computational Linguistics (TACL).  3.  227--242.
  • Matthew R Gormley, Mark Dredze, Jason Eisner (2015).  Approximation-Aware Dependency Parsing by Belief Propagation.  Transactions of the Association for Computational Linguistics (TACL).  3.  489--501.
  • Mauricio Santillana, Andre T. Nguyen, Mark Dredze, Michael J. Paul, Elaine Nsoesie, John S. Brownstein (2015).  Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance.  PLOS Computational Biology.
  • Mark Dredze, David A. Broniatowski, Michael C Smith, Karen M. Hilyard (2015).  Understanding Vaccine Refusal: Why We Need Social Media Now.  American Journal of Preventive Medicine (AJPM).  50.  550-552.
  • Santillana M, Nguyen AT, Dredze M, Paul MJ, Nsoesie EO, Brownstein JS (2015).  Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance.  PLoS Computational Biology.  11(10).
  • Benton A, Dredze M (2015).  Entity linking for spoken language.  NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference.  225-230.
  • Wang S, Paul MJ, Dredze M (2015).  Social media as a sensor of air quality and public response in China.  Journal of Medical Internet Research.  17(3).
  • Yu M, Gormley MR, Dredze M (2015).  Combining word embeddings and feature embeddings for fine-grained relation extraction.  NAACL HLT 2015 - 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference.  1374-1379.
  • Pavlick E, Wolfe T, Rastogi P, Callison-Burch C, Dredze M, Van Durme B (2015).  FrameNet+: Fast paraphrastic tripling of framenet.  ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference.  2.  408-413.
  • Paul MJ, Dredze M, Broniatowski DA, Generous N (2015).  Worldwide Influenza Surveillance through Twitter.  AAAI Workshop - Technical Report.  WS-15-15.  6-11.
  • Peng N, Yu M, Dredze M (2015).  An empirical study of Chinese name matching and applications.  ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference.  2.  377-383.
  • Lee JL, DeCamp M, Dredze M, Chisolm MS, Berger ZD (2014).  What are health-related users tweeting? A qualitative content analysis of health-related users and their messages on Twitter.  Journal of Medical Internet Research.  16(10).  e237.
  • Paul MJ, Dredze M (2014).  Discovering health topics in social media using topic models.  PLoS ONE.  9(8).
  • Wallace BC, Paul MJ, Sarkar U, Trikalinos TA, Dredze M (2014).  A large-scale quantitative analysis of latent factors and sentiment in online doctor reviews.  Journal of the American Medical Informatics Association.  21(6).  1098-1103.
  • Althouse BM, Allem JP, Childers MA, Dredze M, Ayers JW (2014).  Population health concerns during the United States' great recession.  American Journal of Preventive Medicine.  46(2).  166-170.
  • Benjamin M Althouse, Jon-Patrick Allem, Matt Childers, Mark Dredze, John W Ayers (2014).  Population Health Concerns During the United States' Great Recession.  American Journal of Preventive Medicine (AJPM).  46.  166-170.
  • David A Broniatowski, Michael J. Paul, Mark Dredze (2014).  Twitter: Big Data Opportunities (Letter).  Science.  345.  148.
  • Michael J Paul, Mark Dredze (2014).  Discovering Health Topics in Social Media Using Topic Models.  PLoS ONE.  9.
  • Abbasi A, Adjeroh D, Dredze M, Paul MJ, Zahedi FM, Zhao H, Walia N, Jain H, Sanvanson P, Shaker R, Huesch MD, Beal R, Zheng W, Abate M, Ross A (2014).  Social media analytics for smart health.  IEEE Intelligent Systems.  29(2).  60-80.
  • Joy L Lee, Matthew DeCamp, Mark Dredze, Margaret S. Chisolm, Zackary D Berger (2014).  What Are Health-related Users Tweeting? A Qualitative Content Analysis of Health-related Users and their Messages on Twitter.  Journal of Medical Internet Research (JMIR).
  • Michael J Paul, Mark Dredze, David A Broniatowski (2014).  Twitter Improves Influenza Forecasting.  PLOS Currents Outbreaks.
  • Dredze M, Cheng R, Paul MJ, Broniatowski D (2014).  HealthTweets.org: A platform for public health surveillance using Twitter.  AAAI Workshop - Technical Report.  WS-14-14.  2-3.
  • Byron C. Wallace, Michael J. Paul, Urmimala Sarkar, Thomas A. Trikalinos, Mark Dredze (2014).  A Large-Scale Quantitative Analysis of Latent Factors and Sentiment in Online Doctor Reviews.  Journal of the American Medical Informatics Association (JAMIA).  21.  1098--1103.
  • Ahmed Abbasi, Donald Adjeroh, Mark Dredze, Michael J. Paul, Fatemeh Mariam Zahedi, Huimin Zhao, Nitin Walia, Hemant Jain, Patrick Sanvanson, Reza Shaker, Marco D. Huesch, Richard Beal, Wanhong Zheng, Marie Abate, Arun Ross (2014).  Social Media Analytics for Smart Health.  IEEE Intelligent Systems.  29.  60--80.
  • Osborne M, Dredze M (2014).  Facebook, twitter and google plus for breaking news: Is there awinner?.  Proceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014.  611-614.
  • John W. Ayers, Benjamin M. Althouse, Mark Dredze (2014).  Could Behavioral Medicine Lead the Web Data Revolution?.  Journal of the American Medical Association (JAMA).  311.  1399--1400.
  • John W. Ayers, Benjamin M. Althouse, Morgan Johnson, Mark Dredze, Joanna E. Cohen (2014).  What's the Healthiest Day? Circaseptan (Weekly) Rhythms in Healthy Considerations.  American Journal of Preventive Medicine (AJPM).  47.  73-76.
  • Coppersmith G, Harman C, Dredze M (2014).  Measuring post traumatic stress disorder in twitter.  Proceedings of the 8th International Conference on Weblogs and Social Media, ICWSM 2014.  579-582.
  • Peng N, Wang Y, Dredze M (2014).  Learning polylingual topic models from code-switched social media documents.  52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference.  2.  674-679.
  • Wang S, Paul MJ, Dredze M (2014).  Exploring health topics in Chinese social media: An analysis of Sina Weibo.  AAAI Workshop - Technical Report.  WS-14-14.  20-23.
  • Ayers JW, Althouse BM, Johnson M, Dredze M, Cohen JE (2014).  What's the healthiest day?: Circaseptan (weekly) rhythms in healthy considerations.  American Journal of Preventive Medicine.  47(1).  73-76.
  • Andrews N, Eisner J, Dredze M (2014).  Robust entity clustering via phylogenetic inference.  52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference.  1.  775-785.
  • Yu M, Dredze M (2014).  Improving lexical embeddings with semantic knowledge.  52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference.  2.  545-550.
  • Gormley MR, Mitchell M, Van Durme B, Dredze M (2014).  Low-resource semantic role labeling.  52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 - Proceedings of the Conference.  1.  1177-1187.
  • Broniatowski DA, Paul MJ, Dredze M (2013).  National and local influenza surveillance through twitter: An analysis of the 2012-2013 influenza epidemic.  PLoS ONE.  8(12).
  • Crammer K, Kulesza A, Dredze M (2013).  Adaptive regularization of weight vectors.  Machine Learning.  91(2).  155-187.
  • Paul MJ, Wallace BC, Dredze M (2013).  What affects patient (dis)satisfaction? Analyzing online doctor ratings with a joint topic-sentiment model.  AAAI Workshop - Technical Report.  WS-13-09.  53-58.
  • David A Broniatowski, Michael J. Paul, Mark Dredze (2013).  National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic.  PLOS ONE.  8.
  • Paul MJ, Dredze M (2013).  Drug extraction from the web: Summarizing drug experiences with multi-dimensional topic models.  NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference.  168-178.
  • Lamb A, Paul MJ, Dredze M (2013).  Separating fact from fear: Tracking flu infections on twitter.  NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference.  789-795.
  • Wolfe T, Van Durme B, Dredze M, Andrews N, Beller C, Callison-Burch C, De Young J, Snyder J, Weese J, Xu T, Yao X (2013).  PARMA: A predicate argument aligner.  ACL 2013 - 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference.  2.  6368.
  • Joshi M, Dredze M, Cohen WW, Rosé CP (2013).  What's in a domain? multi-domain learning for multi-Attribute data.  NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference.  685-690.
  • Bergsma S, Dredze M, Van Durme B, Wilson T, Yarowsky D (2013).  Broadly improving user classification via communication-based name and location clustering on twitter.  NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference.  1010-1019.
  • Dredze M, Paul MJ, Bergsma S, Tran H (2013).  Carmen: A twitter geolocation system with applications to public health.  AAAI Workshop - Technical Report.  WS-13-09.  20-24.
  • Koby Crammer, Alex Kulesza, Mark Dredze (2013).  Adaptive Regularization of Weight Vectors.  Machine Learning.  91.  155-187.
  • Diab M, Dredze M, Harabagiu S, Radev D (2012).  Overview of the special session on semantics and sociolinguistics in social media.  Proceedings - IEEE 6th International Conference on Semantic Computing, ICSC 2012.
  • Rastrow A, Dredze M, Khudanpur S (2012).  Efficient structured language modeling for speech recognition.  13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012.  2.  1658-1661.
  • Lamb A, Paul MJ, Dredze M (2012).  Investigating twitter as a source for studying behavioral responses to epidemics.  AAAI Fall Symposium - Technical Report.  FS-12-05.  81-82.
  • Rastrow A, Dredze M, Khudanpur S (2012).  Fast syntactic analysis for statistical language modeling via substructure sharing and uptraining.  50th Annual Meeting of the Association for Computational Linguistics, ACL 2012 - Proceedings of the Conference.  1.  175-183.
  • Paul MJ, Dredze M (2012).  Factorial LDA: Sparse multi-dimensional text models.  Advances in Neural Information Processing Systems.  4.  2582-2590.
  • Joshi M, Dredze M, Cohen WW, Rose CP (2012).  Multi-domain learning: When do domains matter?.  EMNLP-CoNLL 2012 - 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the Conference.  1302-1312.
  • Paul MJ, Dredze M (2012).  Experimenting with drugs (and topic models): Multi-dimensional exploration of recreational drug discussions.  AAAI Fall Symposium - Technical Report.  FS-12-05.  38-44.
  • Nakhasi A, Passarella RJ, Bell SG, Paul MJ, Dredze M, Pronovost PJ (2012).  Malpractice and malcontent: Analyzing medical complaints in twitter.  AAAI Fall Symposium - Technical Report.  FS-12-05.  84-85.
  • Andrews N, Eisner J, Dredze M (2012).  Name phylogeny: A generative model of string variation.  EMNLP-CoNLL 2012 - 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Proceedings of the Conference.  344-355.
  • Karakos D, Roark B, Shafran I, Sagae K, Lehr M, Prud'hommeaux E, Xu P, Glenn N, Khudanpur S, Saraçlar M, Bikel D, Dredze M, Callison-Burch C, Cao Y, Hall K, Hasler E, Koehn P, Lopez A, Post M, Rileyi D (2012).  Deriving conversation-based features from unlabeled speech for discriminative language modeling.  13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012.  1.  202-205.
  • Crammer K, Kulesza A, Dredze M (2012).  New H bounds for the recursive least squares algorithm exploiting input structure.  ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings.  2017-2020.
  • Dredze M (2012).  How social media will change public health.  IEEE Intelligent Systems.  27(4).  81-84.
  • Crammer K, Dredze M, Pereira F (2012).  Confidence-weighted linear classification for text categorization.  Journal of Machine Learning Research.  13.  1891-1926.
  • Mark Dredze (2012).  How Social Media Will Change Public Health.  IEEE Intelligent Systems.  27.  81-84.
  • Koby Crammer, Mark Dredze, Fernando Pereira (2012).  Confidence-Weighted Linear Classification for Text Categorization.  Journal of Machine Learning Research (JMLR).  13.  1891-1926.
  • Gormley MR, Dredze M, Van Durme B, Eisner J (2012).  Shared components topic models.  NAACL HLT 2012 - 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference.  783-792.
  • Green S, Andrews N, Gormley MR, Dredze M, Manning CD (2012).  Entity clustering across languages.  NAACL HLT 2012 - 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference.  60-69.
  • Karakos D, Dredze M, Church K, Jansen A, Khudanpur S (2011).  Estimating document frequencies in a speech corpus.  2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings.  407-412.
  • Parada C, Dredze M, Sethy A, Rastrow A (2011).  Learning sub-word units for open vocabulary speech recognition.  ACL-HLT 2011 - Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies.  1.  712-721.
  • Rastrow A, Dredze M, Khudanpur S (2011).  Efficient discriminative training of long-span language models.  2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings.  214-219.
  • Rastrow A, Dredze M, Khudanpur S (2011).  Adapting n-gram maximum entropy language models with conditional entropy regularization.  2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings.  220-225.
  • Parada C, Dredze M, Jelinek F (2011).  OOV sensitive Named-Entity Recognition in speech.  Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH.  2085-2088.
  • Rastrow A, Dreyer M, Sethy A, Khudanpur S, Ramabhadran B, Dredze M (2011).  Hill climbing on speech lattices: A new rescoring framework.  ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings.  5032-5035.
  • Dredze M, Mcnamee P, Rao D, Gerber A, Finin T (2010).  Entity disambiguation for knowledge base population.  Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference.  2.  277-285.
  • Parada C, Sethy A, Dredze M, Jelinek F (2010).  A spoken term detection framework for recovering out-of-vocabulary words using the web.  Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010.  1269-1272.
  • Ma J, Kulesza A, Dredze M, Crammer K, Saul LK, Pereira F (2010).  Exploiting feature covariance in high-dimensional online learning.  Journal of Machine Learning Research.  9.  493-500.
  • Dredze M, Oates T, Piatko C (2010).  We're not in kansas anymore: Detecting domain changes in streams.  EMNLP 2010 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference.  585-595.
  • Dredze M, Jansen A, Coppersmith G, Church K (2010).  NLP on spoken documents without ASR.  EMNLP 2010 - Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference.  460-470.
  • Parada C, Dredze M, Filimonov D, Jelinek F (2010).  Contextual information improves OOV detection in speech.  NAACL HLT 2010 - Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Proceedings of the Main Conference.  216-224.
  • Rao D, McNamee P, Dredze M (2010).  Streaming cross document entity coreference resolution.  Coling 2010 - 23rd International Conference on Computational Linguistics, Proceedings of the Conference.  2.  1050-1058.
  • Dredze M, Kulesza A, Crammer K (2010).  Multi-domain learning by confidence-weighted parameter combination.  Machine Learning.  79(1-2).  123-149.
  • Mark Dredze, Alex Kulesza, Koby Crammer (2010).  Multi-Domain Learning by Confidence-Weighted Parameter Combination.  Machine Learning.  79.  123-149.
  • Crammer K, Dredze M, Pereira F (2009).  Exact convex confidence-weighted learning.  Advances in Neural Information Processing Systems 21 - Proceedings of the 2008 Conference.  345-352.
  • Crammer K, Kulesza A, Dredze M (2009).  Adaptive regularization of weight vectors.  Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference.  414-422.
  • Anand SS, Bunescu R, Carvcdho V, Chomicki J, Conitzer V, Cox MT, Dignum V, Dodds Z, Dredze M, Furcy D, Gabrilovich E, Göker MH, Guesgen H, Hirsh H, Jannach D, Junker U, Ketter W, Kobsa A, Koenig S, Lau T, Lewis L, Matson E, Metzler T, Mihalcea R, Mobasher B, Pineau J, Poupart P, Raja A, Ruml W, Sadeh N, Shani G, Shapiro D, Smith T, Taylor ME, Wagstaff K, Walsh W, Zhou R (2009).  AAAI 2008 workshop reports.  AI Magazine.  30(1).  108-118.
  • Crammer K, Dredze M, Kulesza A (2009).  Multi-class confidence weighted algorithms.  EMNLP 2009 - Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: A Meeting of SIGDAT, a Special Interest Group of ACL, Held in Conjunction with ACL-IJCNLP 2009.  496-504.
  • Dredze M, Schilit BN, Norvig P (2009).  Suggesting email view filters for triage and search.  IJCAI International Joint Conference on Artificial Intelligence.  1414-1419.
  • Dredze M, Wallach HM, Puller D, Brooks T, Carroll J, Magarick J, Blitzer J, Pereira F (2008).  Intelligent email: Aiding users with AI.  Proceedings of the National Conference on Artificial Intelligence.  3.  1524-1527.
  • Dredze M, Wallach HM, Puller D, Pereira F (2008).  Generating summary keywords for emails using topics.  International Conference on Intelligent User Interfaces, Proceedings IUI.  199-206.
  • Dredze M, Brooks T, Carroll J, Magarick J, Blitzer J, Pereira F (2008).  Intelligent email: Reply and attachment prediction.  International Conference on Intelligent User Interfaces, Proceedings IUI.  321-324.
  • Lerman K, Gilder A, Dredze M, Pereira F (2008).  Reading the markets: Forecasting public opinion of political candidates by news analysis.  Coling 2008 - 22nd International Conference on Computational Linguistics, Proceedings of the Conference.  1.  473-480.
  • Dredze M, Carvalho VR, Lau T (2008).  AAAI Workshop - Technical Report: Preface.  AAAI Workshop - Technical Report.  WS-08-04.
  • Dredze M, Crammer K (2008).  Online methods for multi-domain learning and adaptation.  EMNLP 2008 - 2008 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference: A Meeting of SIGDAT, a Special Interest Group of the ACL.  689-697.
  • Dredze M, Wallenberg J (2008).  Icelandic data driven part of speech tagging.  ACL-08: HLT - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference.  33-36.
  • Dredze M, Crammer K (2008).  Active learning with confidence.  ACL-08: HLT - 46th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference.  233-236.
  • Dredze M, Crammer K, Pereira F (2008).  Confidence-weighted linear classification.  Proceedings of the 25th International Conference on Machine Learning.  264-271.
  • Dredze M, Blitzer J, Talukdar PP, Ganchev K, Graça JV, Pereira F (2007).  Frustratingly hard domain adaptation for dependency parsing.  EMNLP-CoNLL 2007 - Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning.  1051-1055.
  • Blitzer J, Dredze M, Pereira F (2007).  Biographies, bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification.  ACL 2007 - Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics.  440-447.
  • Crammer K, Dredze M, Ganchev K, Talukdar PP, Carroll S (2007).  Automatic code assignment to medical text.  ACL 2007 - Proceedings of the Workshop on BioNLP 2007: Biological, Translational, and Clinical Language Processing.  129-136.
  • Dredze M, Gevaryahu R, Elias-Bachrach A (2007).  Learning fast classifiers for image spam.  4th Conference on Email and Anti-Spam, CEAS 2007.
  • Kushmerick N, Lau T, Dredze M, Khoussainov R (2006).  Activity-centric email? A machine learning approach.  Proceedings of the National Conference on Artificial Intelligence.  2.  1634-1637.
  • Dredze M, Lau T, Kushmerick N (2006).  Automatically classifying emails into activities.  International Conference on Intelligent User Interfaces, Proceedings IUI.  2006.  70-77.
  • Dredze M, Blitzer J, Pereira F (2006).  "Sorry, i forgot the attachment:" Email attachment prediction.  3rd Conference on Email and Anti-Spam - Proceedings, CEAS 2006.
  • Ando RK, Dredze M, Zhang T (2005).  TREC 2005 genomics track experiments at IBM watson.  NIST Special Publication.
  • Danis C, Kellogg WA, Lau T, Stylos J, Dredze M, Kushmerick N (2005).  Managers' email: Beyond tasks and to-dos.  Conference on Human Factors in Computing Systems - Proceedings.  1324-1327.
  • Dredze M, Blitzer J, Pereira F (2005).  Reply expectation prediction for email management.  2nd Conference on Email and Anti-Spam.
  • Livingston K, Dredze M, Hammond K, Birnbaum L (2003).  Beyond broadcast: A demo.  International Conference on Intelligent User Interfaces, Proceedings IUI.  325.
  • Livingston K, Dredze M, Hammond K, Birnbaum L (2003).  Beyond broadcast.  International Conference on Intelligent User Interfaces, Proceedings IUI.  260-262.
Book Chapters
  • Delip Rao, Paul McNamee, Mark Dredze (2013).  Entity Linking: Finding Extracted Entities in a Knowledge Base.  Multi-source, Multi-lingual Information Extraction and Summarization.  Springer Berlin Heidelberg.  93-115.
Other Publications
  • Damianos Karakos, Mark Dredze, Sanjeev Khudanpur (2013).  Estimating Confusions in the ASR Channel for Improved Topic-based Language Model Adaptation.
  • Carolina Parada, Mark Dredze, Abhinav Sethy, Ariya Rastrow (2013).  Sub-Lexical and Contextual Modeling of Out-of-Vocabulary Words in Speech Recognition.
  • Spence Green, Nicholas Andrews, Matthew R. Gormley, Mark Dredze, Christopher D Manning (2011).  Cross-lingual Coreference Resolution: A New Task for Multilingual Comparable Corpora.
  • Michael J. Paul, Mark Dredze (2011).  A Model for Mining Public Health Topics from Twitter.
  • Mark Dredze, Joel Wallenberg (2008).  Further Results and Analysis of Icelandic Part of Speech Tagging.
  • Neal Parikh, Mark Dredze (2007).  Graphical Models for Primarily Unsupervised Sequence Labeling.
Conference Proceedings
  • Ran Zhao, Yuntian Deng, Mark Dredze, Arun Verma, David Rosenberg, Amanda Stent (2019).  Visual Attention Model for Cross-sectional Stock Return Prediction and End-to-End Multimodal Market Representation Learning.  The Florida Artificial Intelligence Research Society (FLAIRS).
  • Joshua Dredze, Lisi Dredze, Mark Dredze (2019).  Measuring Online Information Seeking for Stimulants from Google Search Queries.  American Psychological Association (APA).
  • Elliot Schumacher, Mark Dredze (2019).  Discriminative Candidate Generation for Medical Concept Linking.  Knowledge Base Construction (AKBC).
  • Zachary Wood-Doughty, Praateek Mahajan, Mark Dredze (2018).  Johns Hopkins or johnny-hopkins: Classifying Individuals versus Organizations on Twitter.  NAACL Workshop on Computational Modeling of People's Opinions, Personality, and Emotions in Social Media.  56-61.
  • Zachary Wood-Doughty, Nicholas Andrews, Rebecca Marvin, Mark Dredze (2018).  Predicting Twitter User Demographics from Names Alone.  NAACL Workshop on Computational Modeling of People's Opinions, Personality, and Emotions in Social Media.  105--111.
  • Vedran Sekara, Alex Rutherford, Gideon Mann, Mark Dredze, Natalia Adler, Manuel Garc'ia-Herranz (2018).  Trends in the Adoption of Corporate Child Labor Policies: An Analysis with Bloomberg Terminal ESG Data.  Bloomberg Data for Good Exchange.
  • Masoud Rouhizadeh, Elham Hatef, Mark Dredze, Christopher Chute, Hadi Kharrazi (2018).  Identifying Social Determinants of Health from Clinical Notes: A Rule-Based Approach.  AMIA Natural Language Processing Working Group Pre-Symposium.
  • Adrian Benton, Mark Dredze (2018).  Using Author Embeddings to Improve Tweet Stance Classification.  EMNLP Workshop on Noisy User-generated Text (W-NUT).  184--194.
  • Adrian Benton, Mark Dredze (2018).  Deep Dirichlet Multinomial Regression.  North American Chapter of the Association for Computational Linguistics (NAACL).  365--374.
  • Zachary Wood-Doughty, Nicholas Andrews, Mark Dredze (2018).  Convolutions Are All You Need (For Classifying Character Sequences).  EMNLP Workshop on Noisy User-generated Text (W-NUT).  208--213.
  • Yuchen Zhou, Mark Dredze, David A Broniatowski, William Adler (2018).  Gab: The Alt-Right Social Media Platform.  International Conference on Social Computing, Behavioral-Cultural Modeling & Prediction and Behavior Representation in Modeling and Simulation (SBP-BRiMS).
  • Zachary Wood-Doughty, Ilya Shpitser, Mark Dredze (2018).  Challenges of Using Text Classifiers for Causal Inference.  Empirical Methods in Natural Language Processing (EMNLP).  4586--4598.
  • Travis Wolfe, Annabelle Carrell, Mark Dredze, Benjamin Van Durme (2018).  Summarizing Entities using Distantly Supervised Information Extractors.  SIGIR Workshop on Knowledge Graphs and Semantics for Text Retrieval, Analysis, and Understanding (KG4IR).  51-58.
  • Katherine Smith, Caitlin Weiger, Errol Fields, Joanna E Cohen, Meghan Moran, Mark Dredze (2018).  Conducting public health surveillance research on consumer product websites.  American Public Health Association (APHA).
  • Neeraja Nagarajan, Husain Alshaikh, Anthony Nastasi, Blair J Smart, Zackary D Berger, Eric B. Schneider, Mark Dredze, Joseph K. Canner, Nita Ahuja (2017).  The Utility of Twitter in Generating High-Quality Conversations about Surgical Care.  Academic Surgical Congress.
  • Xiaolei Huang, Michael C. Smith, Michael J Paul, Dmytro Ryzhkov, Sandra C Quinn, David A Broniatowski, Mark Dredze (2017).  Examining Patterns of Influenza Vaccination in Social Media.  AAAI Joint Workshop on Health Intelligence (W3PHIAI).  542-546.
  • Zachary Wood-Doughty, Michael C Smith, David A Broniatowski, Mark Dredze (2017).  How Does Twitter User Behavior Vary Across Demographic Groups?.  ACL Workshop on Natural Language Processing and Computational Social Science.  83-89.
  • Adrian Benton, Glen A Coppersmith, Mark Dredze (2017).  Ethical Research Protocols for Social Media Health Research.  EACL Workshop on Ethics in Natural Language Processing.  94-102.
  • Nicholas Andrews, Benjamin Van Durme, Mark Dredze, Jason Eisner (2017).  Bayesian Modeling of Lexical Resources for Low-Resource Settings.  The Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL).
  • Ning Gao, Mark Dredze, Douglas Oard (2017).  Enhancing Scientific Collaboration Through Knowledge Base Population and Linking for Meetings.  Hawaii International Conference on System Sciences (HICSS).  10.24251/HICSS.2018.076.
  • Travis Wolfe, Mark Dredze, Benjamin Van Durme (2017).  Pocket Knowledge Base Population.  The Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL).
  • Michael C. Smith, Mark Dredze, Sandra C Quinn, David A. Broniatowski (2017).  Monitoring Real-time Spatial Public Health Discussions in the Context of Vaccine Hesitancy.  AMIA Workshop on Social Media Mining for Health Applications.
  • Van Durme, Lippincott, Duh, Burchfield, Poliak, Costello, Finin, Miller, Mayfield, Koehn, Harman, Lawrie, May, Thomas, Chaloux, Carrell, Chen, Comerford, Dredze, Glass, Hao, Martin, Sankepally, Rastogi, Wolfe, Tran, Zhang (2017).  CADET: Computer Assisted Discovery Extraction and Translation.  The Proceedings of the 8th International Conference on Natural Language Processing (IJCNLP): System Demonstrations.
  • Anietie Andy, Mark Dredze, Mugizi Rwebangira, Chris Callison-Burch (2017).  Constructing an Alias List for Named Entities during an Event.  EMNLP Workshop on Noisy User-generated Text (W-NUT).  40-44.
  • Ning Gao, Douglas Oard, Mark Dredze (2017).  Support for Interactive Identification of Mentioned Entities in Conversational Speech.  International Conference on Research and Development in Information Retrieval (SIGIR) (short paper).  953-956.
  • Nanyun Peng, Mark Dredze (2017).  Multi-task Domain Adaptation for Sequence Tagging.  ACL Workshop on Representation Learning for NLP (RepL4NLP).  91-100.
  • Ning Gao, Gregory Sell, Douglas Oard, Mark Dredze (2017).  Leveraging Side Information for Speaker Identification with the Enron Conversational Telephone Speech Collection.  IEEE Automatic Speech Recognition and Understanding Workshop (ASRU).
  • Neeraja Nagarajan, Blair J Smart, Anthony Nastasi, Zoya J. Effendi, Sruthi Murali, Zackary D Berger, Eric B Schneider, Mark Dredze, Joseph K. Canner (2016).  An Analysis of Twitter Conversations on Global Surgical Care.  Annual CUGH Global Health Conference.
  • Michael C Smith, David A. Broniatowski, Michael J. Paul, Mark Dredze (2016).  Towards Real-Time Measurement of Public Epidemic Awareness: Monitoring Influenza Awareness through Twitter.  AAAI Spring Symposium on Observational Studies through Social Media and Other Human-Generated Content.
  • Adrian Benton, Michael J. Paul, Braden Hancock, Mark Dredze (2016).  Collective Supervision of Topic Models for Predicting Surveys with Social Media.  Association for the Advancement of Artificial Intelligence (AAAI).  2892-2898.
  • Animesh R Koratana, Mark Dredze, Margaret S Chisolm, Matthew W Johnson, Michael J. Paul (2016).  Studying Anonymous Health Issues and Substance Use on College Campuses with Yik Yak.  AAAI Workshop on the World Wide Web and Public Health Intelligence.  778-782.
  • Blair. J. Smart, Neeraja Nagarajan, Joseph K. Canner, Mark Dredze, Eric B. Schneider, Minh Luu, Zackary D Berger, Jonathan A. Myers (2016).  The Use of Social Media in Surgical Education: An Analysis of Twitter.  Annual Academic Surgical Congress.
  • Neeraja Nagarajan, Blair J. Smart, Mark Dredze, Joy L. Lee, James Taylor, Jonathan A. Myers, Eric B. Schneider, Zackary D. Berger, Joseph K. Canner (2016).  How do Surgical Providers use Social Media? A Mixed-Methods Analysis using Twitter.  Annual Academic Surgical Congress.
  • Munmun De Choudhury, Emre Kiciman, Mark Dredze, Glen A Coppersmith, Mrinal Kumar (2016).  Discovering Shifts to Suicidal Ideation from Mental Health Content in Social Media.  Conference on Human Factors in Computing Systems (CHI).  2098-2110.
  • Mo Yu, Mark Dredze, Raman Arora, Matthew R. Gormley (2016).  Embedding Lexical Features via Low-rank Tensors.  North American Chapter of the Association for Computational Linguistics (NAACL).  1019-1029.
  • Mark Dredze, Miles Osborne, Prabhanjan Kambadur (2016).  Geolocation for Twitter: Timing Matters.  North American Chapter of the Association for Computational Linguistics (NAACL) (short paper).  1064-1069.
  • Mark Dredze, Prabhanjan Kambadur, Gary Kazantsev, Gideon Mann, Miles Osborne (2016).  How Twitter is Changing the Nature of Financial News Discovery.  SIGMOD Workshop on Data Science for Macro-Modeling with Financial and Economic Datasets.  10.1145/2951894.2951903.
  • Mark Dredze, Nicholas Andrews, Jay DeYoung (2016).  Twitter at the Grammys: A Social Media Corpus for Entity Linking and Disambiguation.  EMNLP Workshop on Natural Language Processing for Social Media.  20-25.
  • Adrian Benton, Braden Hancock, Glen A Coppersmith, John W Ayers, Mark Dredze (2016).  After Sandy Hook Elementary: A Year in the Gun Control Debate on Twitter.  Bloomberg Data for Good Exchange.
  • Travis Wolfe, Mark Dredze, Benjamin Van Durme (2016).  A Study of Imitation Learning Methods for Semantic Role Labeling.  Empirical Methods in Natural Language Processing (EMNLP), Workshop on Structured Prediction for NLP.
  • David A Broniatowski, Mark Dredze, Karen M Hilyard, Maeghan Dessecker, Sandra C Quinn, Amelia M Jamison, Michael J. Paul, Michael C. Smith (2016).  Both Mirror and Complement: A Comparison of Social Media Data and Survey Data about Flu Vaccination.  American Public Health Association.
  • Mark Dredze, Manuel Garc'ia-Herranz, Alex Rutherford, Gideon Mann (2016).  Twitter as a Source of Global Mobility Patterns for Social Good.  ICML Workshop on #Data4Good: Machine Learning in Social Good Applications.
  • Nanyun Peng, Mark Dredze (2016).  Improving Named Entity Recognition for Chinese Social Media with Word Segmentation Representation Learning.  Association for Computational Linguistics (ACL) (short paper).  149-155.
  • Anietie Andy, Satoshi Sekine, Mugizi Rwebangira, Mark Dredze (2016).  Name Variation in Community Question Answering Systems.  COLING Workshop on Noisy User-generated Text.  51-60.
  • Adrian Benton, Raman Arora, Mark Dredze (2016).  Learning Multiview Embeddings of Twitter Users.  Association for Computational Linguistics (ACL) (short paper).  14-19.
  • Rebecca Knowles, Josh Carroll, Mark Dredze (2016).  Demographer: Extremely Simple Name Demographics.  EMNLP Workshop on Natural Language Processing and Computational Social Science.  108-113.
  • Ning Gao, Mark Dredze, Douglas Oard (2016).  Knowledge Base Population for Organization Mentions in Email.  NAACL Workshop on Automated Knowledge Base Construction (AKBC).  24-28.
  • Michael C Smith, David A. Broniatowski, Mark Dredze (2016).  Using Twitter to Examine Social Rationales for Vaccine Refusal.  International Engineering Systems Symposium (CESUN).
  • John W Ayers, Benjamin M. Althouse, Eric C Leas, Ted Alcorn, Mark Dredze (2016).  Big Media Data Can Inform Gun Violence Prevention.  Bloomberg Data for Good Exchange.
  • Nanyun Peng, Francis Ferraro, Mo Yu, Nicholas Andrews, Jay DeYoung, Max Thomas, Matthew R. Gormley, Travis Wolfe, Craig Harman, Benjamin Van Durme, Mark Dredze (2015).  A Chinese Concrete NLP Pipeline.  North American Chapter of the Association for Computational Linguistics (NAACL) (Demo Paper).  86-90.
  • Yu Wang, Eugene Agichtein, Tom Clark, Mark Dredze, Jeffrey Staton (2015).  Inferring latent user characteristics for analyzing political discussions in social media.  Atlanta Computational Social Science Workshop.
  • Ellie Pavlick, Travis Wolfe, Pushpendre Rastogi, Chris Callison-Burch, Mark Dredze, Benjamin Van Durme (2015).  FrameNet+: Fast Paraphrastic Tripling of FrameNet.  The Annual Meeting of the Association for Computational Linguistics (ACL).
  • Travis Wolfe, Mark Dredze, Benjamin Van Durme (2015).  Predicate Argument Alignment using a Global Coherence Model.  The Annual Meeting of the North American Association of Computational Linguistics (NAACL).
  • Joanna E Cohen, John W Ayers, Mark Dredze (2015).  Tobacco Watcher: Real-time Global Surveillance for Tobacco Control.  World Conference on Tobacco or Health (WCTOH).
  • Haoyu Wang, Eduard Hovy, Mark Dredze (2015).  The Hurricane Sandy Twitter Corpus.  AAAI Workshop on the World Wide Web and Public Health Intelligence.  20-24.
  • Mo Yu, Matthew R. Gormley, Mark Dredze (2015).  Combining Word Embeddings and Feature Embeddings for Fine-grained Relation Extraction.  North American Chapter of the Association for Computational Linguistics (NAACL) (short paper).  1374-1379.
  • Matthew R. Gormley, Mo Yu, Mark Dredze (2015).  Improved Relation Extraction with Feature-Rich Compositional Embedding Models.  Empirical Methods in Natural Language Processing (EMNLP).  1774-1784.
  • Matthew Biggerstaff, David Alper, Mark Dredze, Spencer Fox, Isaac Chun-Hai Fung, Kyle S. Hickmann, Bryan Lewis, Roni Rosenfeld, Jeffrey Shaman, Ming-Hsiang Tsou, Paola Velardi, Alessandro Vespignani, Lyn Finelli (2015).  Results from the Centers for Disease Control and Prevention's Predict the 2013--2014 Influenza Season Challenge.  International Conference of Emerging Infectious Diseases Conference.
  • Nanyun Peng, Mo Yu, Mark Dredze (2015).  An Empirical Study of Chinese Name Matching and Applications.  Association for Computational Linguistics (ACL) (short paper).  377-383.
  • David A Broniatowski, Mark Dredze, Karen M Hilyard (2015).  News Articles are More Likely to be Shared if they Combine Statistics with Explanation.  Conference of the Society for Medical Decision Making.
  • Mrinal Kumar, Mark Dredze, Glen A Coppersmith, Munmun De Choudhury (2015).  Detecting Changes in Suicide Content Manifested in Social Media Following Celebrity Suicides.  Conference on Hypertext and Social Media.  85-94.
  • Michael C Smith, David A Broniatowski, Michael J Paul, Mark Dredze (2015).  Tracking Public Awareness of Influenza through Twitter.  3rd International Conference on Digital Disease Detection (DDD).
  • Nanyun Peng, Mark Dredze (2015).  Named Entity Recognition for Chinese Social Media with Jointly Trained Embeddings.  Empirical Methods in Natural Language Processing (EMNLP) (short paper).  548-554.
  • Joanna E. Cohen, Rebecca Shillenn, Mark Dredze, John W. Ayers (2015).  Tobacco Watcher: Real-Time Global Tobacco Surveillance Using Online News Media.  Annual Meeting of the Society for Research on Nicotine and Tobacco.
  • Michael J Paul, Mark Dredze, David A Broniatowski, Nicholas Generous (2015).  Worldwide Influenza Surveillance through Twitter.  AAAI Workshop on the World Wide Web and Public Health Intelligence.
  • J. Lee Westmaas, John W. Ayers, Mark Dredze, Benjamin M. Althouse (2015).  Evaluation of the Great American Smokeout by Digital Surveillance.  Society of Behavioral Medicine.
  • Glen A Coppersmith, Mark Dredze, Craig Harman, Kristy Hollingshead, Margaret Mitchell (2015).  CLPsych 2015 Shared Task: Depression and PTSD on Twitter.  NAACL Workshop on Computational Linguistics and Clinical Psychology.  31-39.
  • Glen A Coppersmith, Mark Dredze, Craig Harman, Kristy Hollingshead (2015).  From ADHD to SAD: analyzing the language of mental health on Twitter through self-reported diagnoses.  NAACL Workshop on Computational Linguistics and Clinical Psychology.  1-10.
  • Adrian Benton, Mark Dredze (2015).  Entity Linking for Spoken Language.  North American Chapter of the Association for Computational Linguistics (NAACL) (short paper).  225-230.
  • Rebecca Knowles, Mark Dredze, Kathleen Evans, Elyse Lasser, Tom Richards, Jonathan Weiner, Hadi Kharrazi (2014).  High Risk Pregnancy Prediction from Clinical Text.  NIPS Workshop on Machine Learning for Clinical Data Analysis.
  • Adrian Benton, Jay Deyoung, Adam Teichert, Mark Dredze, Benjamin Van Durme, Stephen Mayhew, Max Thomas (2014).  Faster (and Better) Entity Linking with Cascades.  NIPS Workshop on Automated Knowledge Base Construction (AKBC).
  • Matthew R. Gormley, Margaret Mitchell, Benjamin Van Durme, Mark Dredze (2014).  Low-Resource Semantic Role Labeling.  The Annual Meeting of the Association for Computational Linguistics (ACL).
  • Ning Gao, Douglas Oard, Mark Dredze (2014).  A Test Collection for Email Entity Linking.  NIPS Workshop on Automated Knowledge Base Construction.
  • Mark Dredze, Renyuan Cheng, Michael J Paul, David A Broniatowski (2014).  HealthTweets.org: A Platform for Public Health Surveillance using Twitter.  AAAI Workshop on the World Wide Web and Public Health Intelligence.  2-3.
  • Michael J Paul, Mark Dredze, David A Broniatowski (2014).  Challenges in Influenza Forecasting and Opportunities for Social Media.  AAAI Workshop on the World Wide Web and Public Health Intelligence.
  • Shiliang Wang, Michael J Paul, Mark Dredze (2014).  Exploring Health Topics in Chinese Social Media: An Analysis of Sina Weibo.  AAAI Workshop on the World Wide Web and Public Health Intelligence.  20-23.
  • Mo Yu, Mark Dredze (2014).  Improving Lexical Embeddings with Semantic Knowledge.  Association for Computational Linguistics (ACL) (short paper).  545-550.
  • Miles Osborne, Mark Dredze (2014).  Facebook, Twitter and Google Plus for Breaking News: Is there a winner?.  International Conference on Weblogs and Social Media (ICWSM).  611-614.
  • Nanyun Peng, Yiming Wang, Mark Dredze (2014).  Learning Polylingual Topic Models from Code-Switched Social Media Documents.  Association for Computational Linguistics (ACL) (short paper).  674-679.
  • Glen A Coppersmith, Mark Dredze, Craig Harman (2014).  Quantifying Mental Health Signals in Twitter.  ACL Workshop on Computational Linguistics and Clinical Psychology.  51-60.
  • Glen A Coppersmith, Craig Harman, Mark Dredze (2014).  Measuring Post Traumatic Stress Disorder in Twitter.  International Conference on Weblogs and Social Media (ICWSM).  579-582.
  • Nicholas Andrews, Jason Eisner, Mark Dredze (2014).  Robust Entity Clustering via Phylogenetic Inference.  Association for Computational Linguistics (ACL).  775-785.
  • Mo Yu, Matthew R Gormley, Mark Dredze (2014).  Factor-based Compositional Embedding Models.  NIPS Workshop on Learning Semantics.
  • Mark Dredze, Michael J Paul, Shane Bergsma, Hieu Tran (2013).  Carmen: A Twitter Geolocation System with Applications to Public Health.  AAAI Workshop on Expanding the Boundaries of Health Informatics Using AI (HIAI).
  • Michael J Paul, Byron C Wallace, Mark Dredze (2013).  What Affects Patient (Dis)satisfaction? Analyzing Online Doctor Ratings with a Joint Topic-Sentiment Model.  AAAI Workshop on Expanding the Boundaries of Health Informatics Using AI (HIAI).
  • Justin Snyder, Rebecca Knowles, Mark Dredze, Matthew R. Gormley, Travis Wolfe (2013).  Topic Models and Metadata for Visualizing Text Corpora.  North American Chapter of the Association for Computational Linguistics (NAACL) (Demo Paper).  5-9.
  • Mahesh Joshi, Mark Dredze, William W. Cohen, Carolyn P. Rose (2013).  What's in a Domain? Multi-Domain Learning for Multi-Attribute Data.  North American Chapter of the Association for Computational Linguistics (NAACL) (short paper).  685-690.
  • Alex Lamb, Michael J. Paul, Mark Dredze (2013).  Separating Fact from Fear: Tracking Flu Infections on Twitter.  North American Chapter of the Association for Computational Linguistics (NAACL) (short paper).  789-795.
  • Michael J Paul, Mark Dredze (2013).  Drug Extraction from the Web: Summarizing Drug Experiences with Multi-Dimensional Topic Models.  North American Chapter of the Association for Computational Linguistics (NAACL).  168-178.
  • Shane Bergsma, Mark Dredze, Benjamin Van Durme, Theresa Wilson, David Yarowsky (2013).  Broadly Improving User Classification via Communication-Based Name and Location Clustering on Twitter.  The Annual Meeting of the North American Association of Computational Linguistics (NAACL).
  • Travis Wolfe, Benjamin Van Durme, Mark Dredze, Nicholas Andrews, Charley Beller, Chris Callison-Burch, Jay DeYoung, Justin Snyder, Jonathan Weese, Tan Xu, Xuchen Yao (2013).  PARMA: A Predicate Argument Aligner.  The Annual Meeting of the Association for Computational Linguistics (ACL).
  • Mahesh Joshi, Mark Dredze, William W Cohen, Carolyn P Rose (2012).  Multi-Domain Learning: When Do Domains Matter?.  Empirical Methods in Natural Language Processing (EMNLP).  1302-1312.
  • Nicholas Andrews, Jason Eisner, Mark Dredze (2012).  Name Phylogeny: A Generative Model of String Variation.  Empirical Methods in Natural Language Processing (EMNLP).  344-355.
  • Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur (2012).  Efficient Structured Language Modeling for Speech Recognition.  International Speech Communication Association (INTERSPEECH).
  • Michael J. Paul, Mark Dredze (2012).  Factorial LDA: Sparse Multi-Dimensional Text Models.  Neural Information Processing Systems (NIPS).
  • Damianos Karakos, Brian Roark, Izhak Shafran, Kenji Sagae, Maider Lehr, Emily Prud'hommeaux, Puyang Xu, Nathan Glenn, Sanjeev Khudanpur, Murat Saraclar, Dan Bikel, Mark Dredze, Chris Callison-Burch, Yuan Cao, Keith Hall, Eva Hasler, Philipp Koehn, Adam Lopez, Matt Post, Darcey Riley (2012).  Deriving conversation-based features from unlabeled speech for discriminative language modeling.  International Speech Communication Association (INTERSPEECH).
  • Ralph J Passarella, Atul Nakhasi, Sarah G Bell, Michael J. Paul, Peter J Pronovost, Mark Dredze (2012).  Twitter as a Source for Learning about Patient Safety Events.  Annual Symposium of the American Medical Informatics Association (AMIA).
  • Michael J. Paul, Mark Dredze (2012).  Experimenting with Drugs (and Topic Models): Multi-Dimensional Exploration of Recreational Drug Discussions.  AAAI Fall Symposium on Information Retrieval and Knowledge Discovery in Biomedical Text.
  • Atul Nakhasi, Ralph J Passarella, Sarah G Bell, Michael J Paul, Mark Dredze, Peter J Pronovost (2012).  Malpractice and Malcontent: Analyzing Medical Complaints in Twitter.  AAAI Fall Symposium on Information Retrieval and Knowledge Discovery in Biomedical Text.
  • Matthew R. Gormley, Mark Dredze, Benjamin Van Durme, Jason Eisner (2012).  Shared Components Topic Models.  The Annual Meeting of the North American Association of Computational Linguistics (NAACL).
  • Ariya Rastrow, Sanjeev Khudanpur, Mark Dredze (2012).  Revisiting the Case for Explicit Syntactic Information in Language Models.  NAACL Workshop on the Future of Language Modeling for HLT.  50-58.
  • Alex Lamb, Michael J. Paul, Mark Dredze (2012).  Investigating Twitter as a Source for Studying Behavioral Responses to Epidemics.  AAAI Fall Symposium on Information Retrieval and Knowledge Discovery in Biomedical Text.
  • Spence Green, Nicholas Andrews, Matthew R Gormley, Mark Dredze, Christopher D. Manning (2012).  Entity Clustering Across Languages.  North American Chapter of the Association for Computational Linguistics (NAACL).  60-69.
  • Koby Crammer, Alex Kulesza, Mark Dredze (2012).  New H-∞ Bounds for the Recursive Least Squares Algorithm Exploiting Input Structure.  International Conference on Acoustics, Speech, and Signal Processing (ICASSP).  2017-2020.
  • Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur (2012).  Fast Syntactic Analysis for Statistical Language Modeling via Substructure Sharing and Uptraining.  Association for Computational Linguistics (ACL).  175-183.
  • Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur (2011).  Adapting N-Gram Maximum Entropy Language Models with Conditional Entropy Regularization.  IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
  • Ariya Rastrow, Mark Dredze, Sanjeev Khudanpur (2011).  Efficient Discrimnative Training of Long-Span Language Models.  IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
  • Ann Irvine, Mark Dredze, Geraldine Legendre, Paul Smolensky (2011).  Optimality Theory Syntax Learnability: An Empirical Exploration of the Perceptron and GLA.  CogSci Workshop on OT as a General Cognitive Architecture.
  • Carolina Parada, Mark Dredze, Frederick Jelinek (2011).  OOV Sensitive Named-Entity Recognition in Speech.  International Speech Communication Association (INTERSPEECH).
  • Michael J. Paul, Mark Dredze (2011).  You Are What You Tweet: Analyzing Twitter for Public Health.  International Conference on Weblogs and Social Media (ICWSM).  265-272.
  • Carolina Parada, Mark Dredze, Abhinav Sethy, Ariya Rastrow (2011).  Learning Sub-Word Units for Open Vocabulary Speech Recognition.  Association for Computational Linguistics (ACL).  712-721.
  • Ariya Rastrow, Markus Dreyer, Abhinav Sethy, Sanjeev Khudanpur, Bhuvana Ramabhadran, Mark Dredze (2011).  Hill Climbing on Speech Lattices: A New Rescoring Framework.  International Conference on Acoustics, Speech and Signal Processing (ICASSP).  5032-5035.
  • Damianos Karakos, Mark Dredze, Kenneth Church, Aren Jansen, Sanjeev Khudanpur (2011).  Estimating Document Frequencies in a Speech Corpus.  IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).
  • Matthew R. Gormley, Mark Dredze, Benjamin Van Durme, Jason Eisner (2011).  Shared Components Topic Models with Application to Selectional Preference.  NIPS Workshop: Learning Semantics.
  • Joshua T. Vogelstein, William R. Gray, Jason G. Martin, Glen A. Coppersmith, Mark Dredze, J. Bogovic, J. L. Prince, S. M. Resnick, Carey E. Priebe, R. Jacob Vogelstein (2011).  Connectome Classification using Statistical Graph Theory and Machine Learning.  Society for Neuroscience (Poster).
  • Mark Dredze, Tim Oates, Christine Piatko (2010).  We're Not in Kansas Anymore: Detecting Domain Changes in Streams.  Empirical Methods in Natural Language Processing (EMNLP).  585-595.
  • Mark Dredze, Aren Jansen, Glen A Coppersmith, Kenneth Church (2010).  NLP on Spoken Documents without ASR.  Empirical Methods in Natural Language Processing (EMNLP).  460-470.
  • Matthew R. Gormley, Adam Gerber, Mary Harper, Mark Dredze (2010).  Non-Expert Correction of Automatically Generated Relation Annotations.  NAACL-HLT Workshop on Creating Speech and Language Data With Mechanical Turk.  204-207.
  • Chris Callison-Burch, Mark Dredze (2010).  Creating Speech and Language Data With Amazon's Mechanical Turk.  NAACL-HLT Workshop on Creating Speech and Language Data With Mechanical Turk.  1-12.
  • Courtney Napoles, Mark Dredze (2010).  Learning Simple Wikipedia: A Cogitation in Ascertaining Abecedarian Language.  NAACL-HLT Workshop on Computational Linguistics and Writing: Writing Processes and Authoring Aids.  42-50.
  • Carolina Parada, Abhinav Sethy, Mark Dredze, Frederick Jelinek (2010).  A Spoken Term Detection Framework for Recovering Out-of-Vocabulary Words Using the Web.  International Speech Communication Association (INTERSPEECH).
  • Justin Ma, Alex Kulesza, Koby Crammer, Mark Dredze, Lawrence Saul, Fernando Pereira (2010).  Exploiting Feature Covariance in High-Dimensional Online Learning.  AIStats.  493-500.
  • Carolina Parada, Mark Dredze, Denis Filimonov, Frederick Jelinek (2010).  Contextual Information Improves OOV Detection in Speech.  North American Chapter of the Association for Computational Linguistics (NAACL).  216-224.
  • Delip Rao, Paul McNamee, Mark Dredze (2010).  Streaming Cross Document Entity Coreference Resolution.  Conference on Computational Linguistics (Coling).  1050-1058.
  • Tim Finin, William Murnane, Anand Karandikar, Nicholas Keller, Justin Martineau, Mark Dredze (2010).  Annotating named entities in Twitter data with crowdsourcing.  NAACL-HLT Workshop on Creating Speech and Language Data With Mechanical Turk.  80-88.
  • Mark Dredze, Paul McNamee, Delip Rao, Adam Gerber, Tim Finin (2010).  Entity Disambiguation for Knowledge Base Population.  Conference on Computational Linguistics (Coling).  277-285.
  • Mark Dredze, Bill Schilit, Peter Norvig (2009).  Suggesting Email View Filters for Triage and Search.  International Joint Conference on Artificial Intelligence (IJCAI).  1414-1419.
  • Koby Crammer, Mark Dredze, Alex Kulesza (2009).  Multi-Class Confidence Weighted Algorithms.  Empirical Methods in Natural Language Processing (EMNLP).  496-504.
  • Koby Crammer, Alex Kulesza, Mark Dredze (2009).  Adaptive Regularization of Weight Vectors.  Advances in Neural Information Processing Systems (NIPS).
  • Paul McNamee, Mark Dredze, Adam Gerber, Nikesh Garera, Tim Finin, James Mayfield, Christine Piatko, Delip Rao, David Yarowsky, Markus Dreyer (2009).  HLTCOE Approaches to Knowledge Base Population at TAC 2009.  Text Analysis Conference (TAC).
  • Mark Dredze, Partha Pratim Talukdar, Koby Crammer (2009).  Sequence Learning from Data with Multiple Labels.  ECML/PKDD Workshop on Learning from Multi-Label Data.
  • Mark Dredze, Joel Wallenberg (2008).  Icelandic Data-Driven Part of Speech Tagging.  Association for Computational Linguistics (ACL) (short paper).  33-36.
  • Kevin Lerman, Ari Gilder, Mark Dredze, Fernando Pereira (2008).  Reading the Markets: Forecasting Public Opinion of Political Candidates by News Analysis.  Conference on Computational Linguistics (Coling).  473-480.
  • Mark Dredze, Tova Brooks, Josh Carroll, Joshua Magarick, John Blitzer, Fernando Pereira (2008).  Intelligent Email: Reply and Attachment Prediction.  Intelligent User Interfaces (IUI).  321-324.
  • Kuzman Ganchev, Mark Dredze (2008).  Small Statistical Models by Random Feature Mixing.  ACL Workshop on Mobile NLP.  19-20.
  • Mark Dredze, Koby Crammer (2008).  Active Learning with Confidence.  Association for Computational Linguistics (ACL) (short paper).  233-236.
  • Mark Dredze, Hanna Wallach (2008).  User Models for Email Activity Management.  IUI Workshop on Ubiquitous User Modeling.
  • Koby Crammer, Mark Dredze, Fernando Pereira (2008).  Exact Convex Confidence-Weighted Learning.  Advances in Neural Information Processing Systems (NIPS).
  • Mark Dredze, Koby Crammer, Fernando Pereira (2008).  Confidence-Weighted Linear Classification.  International Conference on Machine Learning (ICML).  264-271.
  • Mark Dredze, Hanna Wallach, Danny Puller, Fernando Pereira (2008).  Generating Summary Keywords for Emails Using Topics.  Intelligent User Interfaces (IUI).  199-206.
  • Mark Dredze, Koby Crammer (2008).  Online Methods for Multi-Domain Learning and Adaptation.  Empirical Methods in Natural Language Processing (EMNLP).  689-697.
  • Mark Dredze, Hanna Wallach, Danny Puller, Tova Brooks, Josh Carroll, Joshua Magarick, John Blitzer, Fernando Pereira (2008).  Intelligent Email: Aiding Users with AI.  American National Conference on Artificial Intelligence (AAAI) (Nectar).
  • John Blitzer, Mark Dredze, Fernando Pereira (2007).  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification.  Association for Computational Linguistics (ACL).  440-447.
  • Mark Dredze, John Blitzer, Partha Pratim Talukdar, Kuzman Ganchev, Joao Graca, Fernando Pereira (2007).  Frustratingly Hard Domain Adaptation for Dependency Parsing.  Shared Task - Conference on Natural Language Learning - CoNLL 2007 shared task.  1051-1055.
  • Mark Dredze, Hanna M. Wallach (2007).  Email Keyword Summarization and Visualization with Topic Models.  North East Student Colloquium on Artificial Intelligence (NESCAI).
  • Koby Crammer, Mark Dredze, Kuzman Ganchev, Partha Pratim Talukdar, Steven Carroll (2007).  Automatic Code Assignment to Medical Text.  BioNLP Workshop at ACL.  129-136.
  • John Blitzer, Mark Dredze, Fernando Pereira (2007).  Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification.  North East Student Colloquium on Artificial Intelligence (NESCAI).
  • Mark Dredze, Reuven Gevaryahu, Ari Elias-Bachrach (2007).  Learning Fast Classifiers for Image Spam.  Conference on Email and Anti-Spam (CEAS).
  • Kedar Bellare, Partha Pratim Talukdar, Giridhar Kumaran, Fernando Pereira, Mark Liberman, Andrew McCallum, Mark Dredze (2007).  Lightly-Supervised Attribute Extraction for Web Search.  NIPS Workshop on Machine Learning for Web Search.
  • Mark Dredze, Krzysztof Czuba (2007).  Learning to Admit You're Wrong: Statistical Tools for Evaluating Web QA.  NIPS Workshop on Machine Learning for Web Search.
  • Koby Crammer, Mark Dredze, John Blitzer, Fernando Pereira (2007).  Batch Performance for an Online Price.  NIPS Workshop on Efficient Machine Learning.
  • Danny Puller, Hanna Wallach, Mark Dredze, Fernando Pereira (2007).  Generating Summary Keywords for Emails Using Topics.  Women in Machine Learning Workshop (WiML) at Grace Hopper.
  • Nicholas Kushmerick, Tessa Lau, Mark Dredze, Rinat Khoussainov (2006).  Activity-Centric Email: A Machine Learning Approach.  American National Conference on Artificial Intelligence (AAAI) (Nectar).  1634-1637.
  • Mark Dredze, Tessa Lau, Nicholas Kushmerick (2006).  Automatically classifying emails into activities.  Intelligent User Interfaces (IUI).  70-77.
  • Mark Dredze, John Blitzer, Fernando Pereira (2006).  ``Sorry, I Forgot the Attachment:'' Email Attachment Prediction.  Conference on Email and Anti-Spam (CEAS).
  • Mark Dredze, John Blitzer, Koby Crammer, Fernando Pereira (2006).  Feature Design for Transfer Learning.  North East Student Colloquium on Artificial Intelligence (NESCAI).
  • Rie Kuboto Ando, Mark Dredze, Tong Zhang (2005).  Trec 2005 Genomics Track Experiments at IBM Watson.  Text REtrieval Conference (TREC).
  • Catalina Danis, Wendy Kellogg, Tessa Lau, Mark Dredze, Jeffrey Stylos, Nicholas Kushmerick (2005).  Managers Email: Beyond Tasks and To-Dos.  Conference on Human Factors in Computing Systems (CHI) (Extended Abstracts).  1324-1327.
  • Mark Dredze, John Blitzer, Fernando Pereira (2005).  Reply Expectation Prediction for Email Management.  Conference on Email and Anti-Spam (CEAS).
  • Kevin Livingston, Mark Dredze, Kristian Hammond, Larry Birnbaum (2003).  Beyond Broadcast.  International Conference on Intelligent User Interfaces (IUI).  260-262.
Patents
  • "Facet suggestion for search query augmentation", 2013.
  • "Request initiated collateral content offering", 2012.
Back to top