W. Lam
Massive amount of information is stored in the form of texts. They can be in the form of unrestricted natural language and in different domains. Some texts are in semi-structured form such as Web pages. This project aims at developing new models for discovering new, previously unknown information that is useful for human or for further construction of intelligent systems. Techniques drawn from machine learning, natural language processing, and information retrieval are investigated.