Jump to content

Gerard Salton

Gerard Salton (1927-1995) was a German-American computer scientist and information retrieval specialist.

Life

Gerard A. "Gerry" Salton was born on March 8, 1927 in Nuremberg, Germany and named Gerhard Anton Sahlmann. He came to the United States in 1947 and was naturalized in 1952. He received Bachelor's (1950) and Master's (1952) degrees in mathematics from Brooklyn College in New York city. He earned a Ph.D. from Harvard in applied mathematics in 1958 and taught there until 1965 when he joined Cornell University and co-founded its department of Computer Science. He died on August 28, 1995 in Ithaca, NY.

Contributions

Salton's team at Cornell developed the influential SMART (System for the Mechanical Analysis and Retrieval of Text) which he had initiated when he was at Harvard. SMART was an environment for evaluating retrieval systems.

SMART popularized the vector space model for weighted coordinate indexing using the words in the text of each document in a collection, excluding a stop list of very common or insignificant words. Each word is treated as a dimension a very highly dimensional graph. Each document in the collection is a treated as a vector with its position on each word dimension determined by the relative frequency of that word in that document. A query is expressed as a comparable graph and then each document's relevance to it is assessed by its graph's proximity to the query graph. [1]

Relative frequency was commonly refined by using term frequency–inverse document frequency (td-idf) whereby the relative frequency of a word in a document is weighted by the inverse relative frequency its presence in the collection as a whole so that rare words count for more than common words.[2]

Salton was editor-in-chief of the Communications of the ACM and the Journal of the ACM (1969 – 1972). He was an associate editor of the ACM Transactions on Information Systems and chaired the ACM Special Interest Group on Information Retrieval (SIGIR).

Publications

Many publications by Salton are listed by DBLP [3] and Google scholar [4].

  • Automatic Information Organization and Retrieval. New York: McGraw-Hill, [1968].
  • The SMART retrieval system; experiments in automatic document processing. Comp. by G. Salton. Englewood Cliffs, N.J., Prentice-Hall [1971].
  • "A vector space model for automatic indexing." With Anita Wong & Chung-Shu Yang. Communications of the ACM 18, no. 11 (1975): 613-620. [5]
  • "Toward a dynamic library." In: F. Wilfrid Lancaster, ed. The Role of the Library in an Electronic Society: Clinic on Library Applications of Data Processing. Urbana-Champaign, IL: University of Illinois Graduate School of Library Science, 1980.
  • "Extended boolean information retrieval." With E. A. Fox & H. Wu. Communications of the ACM 26, no. 11 (1983): 1022-1036. [6]
  • Introduction to modern information retrieval. With Michael J. McGill. New York: McGraw-Hill, 1983.
  • "Term-weighting approaches in automatic text retrieval." With C. Buckley. Information processing & management 24, no. 5 (1988): 513-523. [7]
  • Automatic text processing: the transformation, analysis, and retrieval of information by computer. Reading, MA: Addison-Wesley Publishing Company, 1989.
  • "Improving retrieval performance by relevance feedback." With Chris Buckley. Journal of the American society for information science 41, no. 4 (1990): 288-297. [8]

Offices

Awards

  • Guggenheim Fellowship, 1962.
  • American Society for Information Science. Best JASIS Paper Award, 1970; best book award, 1975; Award of Merit, 1989.
  • Alexander von Humboldt Foundation. Senior Science Award, 1988.
  • Association for Computing Machinery. Fellow, 1995.
  • The Association for Computing Machinery Special Interest Group on Information Retrieval (ACM SIGIR) confers the Gerard Salton Award every three years for contributions to information retrieval". Salton ws the first recipient, 1983.

Further reading

  • "Gerard Salton." Wikipedia [9]
  • Dubin, David. "The Most Influential Paper Gerard Salton Never Wrote." Library Trends 52, no 4 (Spring 2004): 748–764. [10] Revisionist history of the vector space model and Salton's work.
  • "The father of information retrieval." p 25. [11]
  • Evslin, Tom. "Search Down Memory Lane." Fractals of change (Jan 19, 2006) [12]
  • Stock, Wolfgang G. & Mechtild Stock. In Handbook of information science. Berlin ; Boston : De Gruyter Saur, [2013], pp 289-300, "E.2 Vector Space Model."

Papers

  • Cornell University Library. Division of Rare and Manuscript Collections: 16-13-2908. Gerard Salton papers. [13]