Extracting information from content not as easy for computers.
Making sense of Web content is mostly easy for humans but rarely easy for computers. Part of the issue is that recognizing the “interesting” parts of online content involves what is mostly unstructured data, making the task very difficult.
For example, consider text that you, a human, have no problem understanding: “The man, who is 42, was charged with arson”. You immediately understand what the age of the arsonist is. Not so for computer programs which would need a huge number of rules to interpret that sentence and discover the man’s age. Even then anyone of the thousands of possible minor variants such as “The man, 42, was charged with arson” would most likely cause the same program to make a mistake.
Continues @ http://www.networkworld.com
Related articles by Zemanta
- Document and Records Management: Controlling Information Risk and Aiding Compliance (digitalassetmanagement.org.uk)
- Information Management: The Tides They Are A-Turnin’? (digitalassetmanagement.org.uk)