Indiana University Bloomington

School of Informatics and Computing


Computer Science Program







 Home

 Contacts

 Courses

 Academics

 Careers

 Research

 People

 Calendar
   Events
   Colloquia
   Activity Photos

 Resources

 Facilities

Departmental Colloquia
(2004-2005)

Computer Science Department and School of Informatics, Indiana University


April 22, 2005
4-5:00, Informatics 107

CiteSeer, the Next Generation

Dr. C. Lee Giles

School of Information Sciences and Technology, Computer Science and Engineering, and Supply Chain and Information Systems, The Pennsylvania State University

Abstract:
CiteSeer, a computer and information science search engine and digital library, was a radical departure for scientific document access and analysis. CiteSeer, now hosted at Penn State, has over 700,000 documents and, with over a million page views a day, it has become one of the most popular academic document access engines in science. CiteSeer is also portable and was recently extended to academic business documents (SMEALSearch). CiteSeer is based on two features: actively acquiring new documents and automatic intelligent tagging, and linking of metadata information inherent in an academic document's syntactic structure, such as citations. The new Google Scholar does a similar type of indexing and linking. Why has CiteSeer been so popular and what should it do next? We discuss this and the Next Generation CiteSeer, exploring intelligent methods and algorithms for providing improved and new indexes such as tagged metadata from institutions and acknowledgements, collaboratories for focused collective research, new data resources and web services, a new services oriented architecture, and issues in the automation of processes and services.

Biography:
Since 2000, Dr. C. Lee Giles has been the David Reese Professor of Information Sciences and Technology, Professor of Computer Science and Engineering, and Professor of Supply Chain and Information Systems at the Pennsylvania State University, University Park, PA. His research interests are in web and internet data mining, computational and systems issues in search engines and digital libraries, computational models of e-business, and intelligent information processing; he has over 200 publications in these areas. He was the cocreator of the specialty search engines, CiteSeer, CiteSeer.IST, SMEALSearch and eBizSearch. He is a Fellow of the IEEE and INNS, and a member of AAAI, ACM, and AAAS. His previous positions include that of program manager for the Air Force Office of Scientific Research, Washington, DC and Research Scientist for NEC Research Institute, Princeton, NJ. His graduate degrees are from the U. of Michigan and U. of Arizona.








Valid HTML 4.01!