As we have seen, one important aspect of the discovery of new scientific and medical information is the process of mining for information. This can take the form of data mining, which retrieves patterns from already processed text, or text mining, which extracts information from unstructured data. Text mining can seem intimidating at times, so it was good to see the breakdown of the divisions/steps in the text mining process. Going in the order of the steps (lexical, syntactic, semantic, discourse) makes the process seem much more manageable. Still, there are many challenges in text mining, such as the validation of patterns that are found and the processing of such an overwhelming amount of information. Even if patterns are found, it is important to make sure that proper validation and analyses are conducted to prove that there is a legitimate finding.
Posted by Annie
Friday, October 30, 2009
Subscribe to:
Post Comments (Atom)
No comments:
Post a Comment
Gentle Reminder: Sign comments with your name.