Research Interests


Data Mining, Healthcare Data Analysis, Social Network Data Analysis, Data-Intensive High Performance Computing.


Project Website


  • Allergy
  • Cancer
  • Flu
  • Pulse Of The Tweeters

  • Publications


  • Kathy Lee, Ankit Agrawal, and Alok Choudhary: Real-Time Disease Surveillance using Twitter Data: Demonstration on Flu and Cancer. To appear in the 19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), August 2013.
  • Kathy Lee, Ankit Agrawal, and Alok Choudhary: Real-Time Digital Flu Surveillance using Twitter Data. DMMH workshop at SDM 2013(SDM-DMMH), May 2013. (pdf)
  • Diana Palsetia, Md Mostofa Ali Patwary, Kunpeng Zhang, Kathy Lee, Christopher Moran, Yusheng Xie, Daniel Honbo, Ankit Agrawal, Wei-keng Liao, and Alok Choudhary. User-Interest Based Community Extraction in Social Networks. In the Workshop on Social Network Mining and Analysis, held in conjunction with the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, August 2012. (pdf)
  • Alok Choudhary, William Hendrix, Kathy Lee, Diana Palsetia, and Wei-keng Liao. Social Media Evolution of The Egyptian Revolution. Communications of the ACM, 55(5):74-80, ACM, New York, NY, USA, May 2012. (pdf)
  • Hongyu Gao, Yan Chen, Kathy Lee, Diana Palsetia, and Alok Choudhary. Towards Online Spam Filtering in Social Networks. In the Network and Distributed System Security Symposium, February 2012. (pdf)
  • Kathy Lee, Diana Palsetia, Md Mostofa Ali Patwary, Ankit Agrawal, Alok Choudhary, and Ramanathan Narayanan. Twitter Trending Topic Classification. In the Workshop on Optimization Based Methods for Emerging Data Mining Problems, held in conjunction with the IEEE International Conference on Data Mining, December 2011. (pdf)
  • Kunpeng Zhang, Yu Cheng, Yusheng Xie, Ankit Agrawal, Diana Palsetia, Kathy Lee, Wei-keng Liao, and Alok Choudhary. SES: Sentiment Elicitation System for Social Media Data. In the Workshop on Sentiment Elicitation from Natural Text for Information Retrieval and Extraction, held in conjunction with the IEEE International Conference on Data Mining, December 2011. (pdf)