
|
|
Departmental Colloquia (2004-2005)
April 22, 2005 4-5:00, Informatics 107
CiteSeer, the Next Generation
Dr. C. Lee Giles
School of Information Sciences and Technology, Computer Science and Engineering, and
Supply Chain and Information Systems, The Pennsylvania State University
Abstract:
CiteSeer, a computer and information science search engine and digital library,
was a radical departure for scientific document access and analysis. CiteSeer, now hosted
at Penn State, has over 700,000 documents and, with over a million page views a day,
it has become one of the most popular academic document access engines in science.
CiteSeer is also portable and was recently extended to academic business documents
(SMEALSearch). CiteSeer is based on two features: actively acquiring new documents and
automatic intelligent tagging, and linking of metadata information inherent in an academic
document's syntactic structure, such as citations. The new Google Scholar does a similar
type of indexing and linking. Why has CiteSeer been so popular and what should it do next?
We discuss this and the Next Generation CiteSeer, exploring intelligent methods and
algorithms for providing improved and new indexes such as tagged metadata from institutions
and acknowledgements, collaboratories for focused collective research, new data resources
and web services, a new services oriented architecture, and issues in the automation
of processes and services.
Biography:
Since 2000, Dr. C. Lee Giles has been the David Reese Professor of Information Sciences
and Technology, Professor of Computer Science and Engineering, and Professor of Supply
Chain and Information Systems at the Pennsylvania State University, University Park, PA.
His research interests are in web and internet data mining, computational and systems
issues in search engines and digital libraries, computational models of e-business,
and intelligent information processing; he has over 200 publications in these areas.
He was the cocreator of the specialty search engines, CiteSeer, CiteSeer.IST, SMEALSearch
and eBizSearch. He is a Fellow of the IEEE and INNS, and a member of AAAI, ACM, and AAAS.
His previous positions include that of program manager for the Air Force Office of Scientific
Research, Washington, DC and Research Scientist for NEC Research Institute, Princeton, NJ.
His graduate degrees are from the U. of Michigan and U. of Arizona.
|