call for papers
committees
topics of interest
submissions
important dates
conference rates
hotel / tour info
sponsors
location

 

Conference Tutorial:

Finding scholarly information through the Internet and the WWW:
state of the art and hopes for the future

by Professor Paul Nieuwenhuysen, Vrije Universiteit Brussel,

B-1050 Brussel,Belgium

 

Abstract:

This tutorial gives an overview of secondary information sources and services that allow us to find and locate required primary sources, with an emphasis on open access systems and academic, scholarly, scientific and technical information.
Primary information sources include these days
- billions of public access WWW pages that include text as well as images and multimedia
- thousands of discussion groups based on electronic mail, Usenet, the WWW or combinations of these
- millions of books
- thousands of electronic journals with millions of articles
- open archives/repositories set up and maintained by scientific organizations
Pointing out a selection of interesting ones for a general audience is not feasible or meaningful, in view of the huge volume. This brings us to the secondary sources that help us by creating some order in the expanding information landscape. The secondary sources include
- some general, horizontal subject directories guiding us to WWW sites
- more specialized, vertical subject directories guiding us to WWW sites in some specific subject area
- general, horizontal search engines that lead us to WWW pages, such as Google Web Search, MSN Web Search, Search Yahoo, Ask; some offer categorization and clustering of results on similar subjects, to cope with the classical problem in information retrieval of ambiguity of meaning of words in a query, such as Mooter and Wisenut
- several more specialized systems to search through open archives/repositories, such as Scirus, OAIster and more recently Google Scholar
- meta-WWW-search engines that rely on existing general WWW search engines and that combine and merge search results; remarkable among these is Vivisimo which offers categorization of results to cope with the classical problem in information retrieval of ambiguity of meaning of words in a query
- several systems to find information, which offer categorization plus graphical visualization of search results, such as Kartoo and Grokker
- several databases of booksellers that offer bibliographic descriptions of books and related information, such as Amazon, and of course the book databases/catalogues of many libraries, such as the Library of Congress and the British Library
- several open access meta-search systems that allow us to search through several bookshop-databases in one action
- a few databases that allow even full-text searches of the contents of a selection of books, such as Amazon and Google Book Search
- a few general, horizontal open access databases with search engines that allow us to find article titles in most areas of science and technology, such as Infotrieve, Ingenta, Scirus, and most recently Google Scholar
- many vertical search engines / databases that allow us to find articles and other documents in some specific domain of science, such as the open access systems Medline for biomedical science and related areas, and Eric for educational science and for library and information science
- more specialised search engines to find images on the WWW, such as the open access Google Image Search
- directories and databases of scholarly open access journals and articles, for instance the DOAJ
- a database and search engine to find Usenet newsgroups and articles, Google Groups,
- current awareness services that alert users when a new document has become available that corresponds well with the user profile stored with the system; for public access WWW pages, Google Alert offers this service
- a few systems that allow citation searching besides subject searching as an alternative method to find information, such a the Web of Science, Scopus and the open access Google Scholar
 
Looking into the future we can hope for systems that offer
- better guidance in formulating queries, in particular to cope with the problems associated with using human natural language to express information needs
- better automatic subject categorization of database contents and search results
- more and better visualization of database contents and search results, to guide searching
- better and more standardized APIs of search systems, so that they can be incorporated more efficiently in federated search systems
 
In view of the limited time that is available, participants can express their personal interest, so that the presentation can focus on a selection of the topics mentioned above.
 
More information is available through the WWW:
about the author:
http://www.vub.ac.be/BIBLIO/nieuwenhuysen/professional/index.html
tutorial materials already available:
http://www.vub.ac.be/BIBLIO/nieuwenhuysen/courses/chapters/

 

Back