If you made any changes in Pure these will be visible here soon.

Personal profile

Research interests

[Note: This profile is incomplete, especially with regard to my publications. See http://shomir.net  for many more.]

My research brings together natural language processing (NLP), privacy, and artificial intelligence.

I am interested in solving problems to enable computers to do meaningful work with large volumes of natural language text. My lab develops new methods for NLP and applies them to a variety of domains, including privacy, online social networks, web science, and digital libraries. I am particularly interested in breaking down technology's "walls of text", i.e., situations where a human user or decision-maker is expected to consume a large quantity of text to take action while lacking sufficient resources (time, expertise) to properly understand what they have been given. I have applied this paradigm to privacy policies, scholarly manuscripts, documents from the world wide web, and historical texts, and I am always interested in new domains to work with.

Personal profile

I am an Assistant Professor in the College of Information Sciences and Technology at Penn State, where I lead the Human Language Technologies Lab. I am also a Faculty Affiliate of Penn State's Institute for CyberScience and a member of the Social Data Analytics graduate faculty.

From 2016 until 2018 I was an Assistant Professor in the EECS Department at the University of Cincinnati. Prior to that I was a postdoc and a lecturer in Carnegie Mellon University's School of Computer Science and an NSF International Research Fellow in the University of Edinburgh's School of Informatics. I received my PhD in Computer Science from the University of Maryland in 2011.

Education/Academic qualification

Computer Science, PhD, University of Maryland

Award Date: May 1 2011

Computer Science, M.S., University of Maryland

Award Date: May 1 2008

Computer Science, B.S., Virginia Tech

Award Date: May 1 2005

Mathematics, B.S, Virginia Tech

Award Date: May 1 2005

Philosophy, B.A., Virginia Tech

Award Date: May 1 2005

Researcher Defined Keywords

  • natural language processing
  • computational linguistics
  • privacy
  • artificial intelligence

Fingerprint Dive into the research topics where Shomir Wilson is active. These topic labels come from the works of this person. Together they form a unique fingerprint.

  • 8 Similar Profiles

Network Recent external collaboration on country level. Dive into details by clicking on the dots.

Research Output

Supervised and unsupervised methods for robust separation of section titles and prose text in web documents

Gopinath, A. A. M., Wilson, S. & Sadeh, N., Jan 1 2020, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018. Riloff, E., Chiang, D., Hockenmaier, J. & Tsujii, J. (eds.). Association for Computational Linguistics, p. 850-855 6 p. (Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, EMNLP 2018).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • Privacy-enhancing artificial intelligence and language technologies (PAL 2019): Preface to the proceedings

    Wilson, S., Ghanavati, S., Ghazinour, K. & Sadeh, N., Jan 1 2019, In : CEUR Workshop Proceedings. 2335

    Research output: Contribution to journalEditorial

    Reports of the AAAI 2019 spring symposium series

    Baldini, I., Barrett, C., Chella, A., Cinelli, C., Gamez, D., Gilpin, L. H., Hinkelmann, K., Holmes, D., Kido, T., Kocaoglu, M., Lawless, W. F., Lomuscio, A., Macbeth, J. C., Martin, A., Mittu, R., Patterson, E., Sofge, D., Tadepalli, P., Takadama, K. & Wilson, S., Jan 1 2019, In : AI Magazine. 40, 3, p. 59-66 8 p.

    Research output: Contribution to journalArticle

  • Vaccine:: Obfuscating Access Pattern Against File-Injection Attacks

    Liu, H., Wang, B., Niu, N., Wilson, S. & Wei, X., Jun 2019, 2019 IEEE Conference on Communications and Network Security, CNS 2019. Institute of Electrical and Electronics Engineers Inc., p. 109-117 9 p. 8802803. (2019 IEEE Conference on Communications and Network Security, CNS 2019).

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

  • 1 Scopus citations

    Analyzing privacy policies at scale: From crowdsourcing to automated annotations

    Wilson, S., Schaub, F., Liu, F., Sathyendra, K. M., Smullen, D., Zimmeck, S., Ramanath, R., Story, P., Liu, F., Sadeh, N. & Smith, N. A., Dec 1 2018, In : ACM Transactions on the Web. 13, 1, 1.

    Research output: Contribution to journalArticle

  • 4 Scopus citations