This is a dated site - my current academic site is at Wright State University but
has mostly academic information.
Courses I have taught in the last four semesters:
Other duties:
My primary research interest on Structured documents grew out
of the inspiration provided by
Prof.
Dirk Van Gucht. The present research is primarily to design a
powerful yet elegant query language for retrieving information on
Document databases encoded in SGML format. This query language would
form the base for a complete Database System for structured documents.
You can look at the final version of my proposal in
HTML Format, or
download the Postscript
version (638 K) or the
DVI version (68K). Note that the dvi file refers to two
postscript figures which you can download separately:
Figure 1 (240K) and
Figure 2 (16K). A
short summary of the research
project is also available.
If you would like to see my other recent publications, please
look at my publication section
We have recently obtained the PAT Software from OpenText and
presently in the process of using it to query the English Poetry
Database. I worked on a
query interface
to Pat for the Chadwyck Healey English Poetry
Database, which eventually resulted in DocBase, a system for
posing queries on structured documents. Note that if you are
accessing this from a location outside
IU, you will not be able to see the results of your query, since the
results come from a copyrighted database. A similar interface can
be looked up in
The UVirginia Pat gateway.
You can also obtain more information on other currently installed SGML
tools in the department.
My latest area of research is in applying database technology in
problem-solving using Case-Based Reasoning (CBR). This is joint
work with Prof. David
Leake and David
Wilson. For more information on this project, and a demonstration of
the system that we are building, please look at the The CBDB page.
This is a part of my thesis project, which involves building an
interface for SGML databases. This project is currently in infancy,
but some information on the interface and related information about
java can be looked up at
my java
project page. This project has been replaced by the DocBase - QBT system.
This project is a part of the coursework for the C690 course on management of multimedia data. The projects in this
course are part of the Media Library Kiosk project.
Sandeep Purao, Veda Storey, Arijit Sengupta and Melody
Moore. "Reconciling and Cleansing: An Approach to Inducing Domain
Models" Accepted for publication at the Workshop on Information
Systems and Technologies (WITS 2000), Brisbane, Australia, Dec 9-10,
2000. Arijit Sengupta and Sandeep Purao. "Transitioning Existing
Conetnt: Inferring Organization Specific Documents" Australian
Journal of Information Systems 8:1, September 2000. p 91-99 (Extended
version of the paper presented at XML Meets Business, 2000, below.)
Arijit Sengupta. "Database Concepts for marked-up textual
documents" in Amita Chin, editor. Text Databases and Document
Management: Theory and Practice. Idea Group Publishing. July 2000 Arijit Sengupta and Peng Xu. "An Approach for a Reuseable
Electronic Commerce System Model" in Proceedings of the
4th Multiconference on Systemics, Cybernetics and Informatics (SCI-2000).
Orlando Florida, July 23-26, 2000. Arijit Sengupta and Sandeep Purao. "Transitioning Existing
Content: Inferring Organization-Sepcific Document Structures" in
Proceedings of the conference on XML Meets Business Heidelberg,
Germany, 4th May 2000. Selected for Best paper Award. Arijit Sengupta. "The compleat closure: toward a unified view
of structured document database objects" in Proceedings
of the Fifth International Conference on Information Systesm Analysis
and Synthesis (ISAS '99). Volume 5, pp 269-273. Orlando, Florida. July
31-August 4, 1999 Arijit Sengupta, David C. Wilson and David
B. Leake. "Constructing and Transforming CBR Implementations:
Techniques for Corporate Memory Management. Proceedings of the
Workshop on Practical Case-Based Reasoning Strategies for Building and
Maintaining Corporate Memories, Third International Conference on
Case-Based Reasoning, Seeon, Germany, 1999. pp 9-18. (This is a
revised version of the paper below.) In press. Arijit Sengupta, David C. Wilson and David B. Leake. "On
Constructing the Right Sort of CBR Implementation."
Proceedings of the IJCAI-99 Workshop on Automating the Construction
of Case Based Reasoners, Stockholm, Sweden, 1999. 5 pages. In press. Jing Ma, Arijit Sengupta and David Wilson. "A framework for
automated construction and transformation of case-based reasoning
systems" Technical Report no. 525, Department of Computer
Science, Indiana University. July 1999, 25 pages. Arijit Sengupta and Dirk Van Gucht. "A Low-complexity query
language for marked-up documents." In preparation, May 1999.
Arijit Sengupta. "Toward the union of databases and document
management: The design of DocBase." Accepted for publication in
Proceedings: Conference on Management of Data (COMAD'98),
Hyderabad, India, December 17-19 1998.
Available in postscript [548K].
(Text Abstract) Dennis Groth and Arijit Sengupta. "Introduction to database
applications: a problem-solving approach." In progress. To be
published by McGraw-Hill Publishing Company, December 1998.
Arijit Sengupta. "DocBase - A database environment for
structured documents." Ph.D. Thesis, Computer Science
Department, Indiana University, December 1997.
Available in postscript [4.5M].
(Text Abstract) Arijit Sengupta and Andrew Dillon. "Extending SGML to Accommodate
Database Functions: A Methodological Overview." Journal of the
American Society of Information Systems (JASIS), special issue on
structured information/standards for document architectures. pages
629-637, July, 1997.
Available in postscript [744K].
(Text Abstract) Arijit Sengupta and Andrew Dillon. "Query By Templates: A
Generalized Approach for Visual Query Formulation for Text Dominated
Databases." in Proceedings: Conference on Advanced
Digital Libraries (ADL'97), Library of Congress, Washington,
D.C. pages 36-47. May 7-9 1997.
Available in postscript [776K]
(Text abstract) Arijit Sengupta. "Standardizing the Querying Process with SGML: The
SQL DTD." In Tommie Usdin and Debbie Lapeyre, editors, Proceedings
of the SGML'96 Conference. Graphic Communications Association,
pages 323-337, November, 1996.
Available
in SGML, also
available
in postscript [800K] (Text
Abstract) Arijit Sengupta. "Demand More from Your SGML Database! Bringing
SQL Under the SGML Limelight." <TAG>, 9(4):pages 1-7,
April 1996. Available in
postscript.[352K] (Text
Abstract) Pradip Bose, Santanu Chaudhuri, and Arijit
Sengupta. "Automated Verification of Printed Circuit Boards:
A Fast AI Approach." In Vijay P. Bhatkar and Kiran M. Rege,
editors, Frontiers in Knowledge Based Computing, Proceedings
of the International Conference on Knowledge Based Computing
Systems (KBCS90), pages 155-164, December 1990. I started this just as an experiment with the Pat Project above,
but turned out to be the start of a few major projects that I
did. Anyhow, without further ado, here are the projects (note that
some of them are restricted, so don't blame me if you can't get access) As discussed above, it
is a query interface to the Chadwyck-Healey English Poetry Database,
built as a gateway to the PAT software. Again, a
full-text implementation of the
International Center's
Career and Employment newsletter, will full-support for on-line posting
and retrieval of newsletters with or without complex search
conditions. This is an on-line student database for
Project ASPIRE. Features a Sybase database with on-line
data entry, edit, delete, search. Also features an auto-addition
scheme which allows bulk data to be added to the database
automatically. Note that this is only meant for students from
the Southeast Asian countries. This is also a
sybase database, for speakers participating in the
Global Speakers
Service program, and features various types of data
in multiple forms for viewing and retrieval, and auto-report generation
on-line. A Sybase database from the
International Research and Development (IRD) group of the Office of
International Programs in Indiana University, featuring on-line posting,
viewing, editing, searching, and generation of Grants and Fellowships
postings. A searchable repository of Fellowship
applications submitted by applicants, with a sybase backend that
indexes primary parts of the applications. If you are an employer, and you are intersted in my research area
(document database systems - see the research section in my home page), I
would like you to look at my resumé - which is available in the
following formats:

Arijit
Sengupta
Director of Educational Development
Computer Science, Indiana University
Courses and Academic duties
Research and Projects
Publications
(abstract | pdf | ps.Z)
(abstract | pdf | ps.Z)
(abstract
| 497K PS
| 216K pdf)
WWW Database Projects
My Resumé
Non-Research :-)
Table Tennis is my favorite game! If you play, and have some
time - LET'S PLAY!Places to go from here
Temporary things
Last modified: Wed Oct 11 17:44:17 EST 2000
asengupt@indiana.edu