FREE BOOKS

Author's List




PREV.   NEXT  
|<   47   48   49   50   51   52   53   54   55   56   57   58   59   60   61   >>  
kinds of reasons, the U.S. Government has over the past five years been funding research in MT, SUM, and IR (information retrieval), and is interested in starting a new program of research in Multilingual IR. This way you will be able to one day open Netscape or Explorer or the like, type in your query in (say) English, and have the engine return texts in *all* the languages of the world. You will have them clustered by subarea, summarized by cluster, and the foreign summaries translated, all the kinds of things that you would like to have." Eduard Hovy added in August 1999: "Over the past 12 months I have been contacted by a surprising number of new information technology (IT) companies and startups. Most of them plan to offer some variant of electronic commerce (online shopping, bartering, information gathering, etc.). Given the rather poor performance of current non-research level natural language processing technology (when is the last time you actually easily and accurately found a correct answer to a question to the web, without having to spend too much time sifting through irrelevant information?), this is a bit surprising. But I think everyone feels that the new developments in automated text summarization, question analysis, and so on, are going to make a significant difference. I hope so!--but the level of performance is not available yet. It seems to me that we will not get a big breakthrough, but we will get a somewhat acceptable level of performance, and then see slow but sure incremental improvement. The reason is that it is very hard to make your computer really 'understand' what you mean -- this requires us to build into the computer a network of 'concepts' and their interrelationships that (at some level) mirror those in your own mind, at least in the subjects areas of interest. The surface (word) level is not adequate -- when you type in 'capital of Switzerland', current systems have no way of knowing whether you mean 'capital city' or 'financial capital'. Yet the vast majority of people would choose the former reading, based on phrasing and on knowledge about what kinds of things one is likely to ask the web, and in what way. Several projects are now building, or proposing to build, such large 'concept' networks. This is not something one can do in two years, and not something that has a correct result. We have to develop both the network and the techniques for building it semi-automatically and se
PREV.   NEXT  
|<   47   48   49   50   51   52   53   54   55   56   57   58   59   60   61   >>  



Top keywords:

information

 

capital

 
research
 

performance

 
surprising
 

technology

 

current

 

things

 

computer

 

network


building

 

question

 

correct

 

interrelationships

 

Government

 

concepts

 

breakthrough

 

mirror

 

interest

 

surface


subjects

 

requires

 

incremental

 

improvement

 
funding
 
acceptable
 

reason

 

understand

 

concept

 

networks


reasons

 

projects

 

proposing

 

automatically

 
techniques
 
result
 

develop

 

Several

 

financial

 
knowing

Switzerland
 

systems

 
majority
 
phrasing
 
knowledge
 
reading
 

people

 

choose

 

adequate

 
retrieval