My research interests include artificial intelligence, data mining, data integration, information extraction , and health informatics. Information resources distributed across the Internet are hard to process automatically for knowledge acquisition and knowledge sharing because their data are structurally and semantically heterogeneous. Also, most current AI or machine learning systems are hard to scale to efficiently process large size of data and knowledge. My research combines knowledge-driven and data-driven approaches to address three critical challenges in processing and managing real world data and knowledge: heterogeneity, reusability, and scalability.
I lead the Advanced Integration and Mining (AIM) Laboratory and direct the NSF IUCRC Center for Big Learning (CBL). I work with my students and collaborators on several research projects funded by the National Science Foundation and the National Institutes of Health. Recently, my research has primarily focused on the following research projects: