CIL2008: Text Mining and Visualization of Open Sources

Text Mining and Visualization of Open Sources
Patrice Slert
We’re talking about structured data from open sources (Web of Science, Dialog, Silobreaker, the Internet), not necessarily free sources. This is in contrast to intelligence data, where a lot of the technologies have applications, as well.
Visualization can mislead you in terms of cause and effect. It can also lead to false similarities (such as New England and England being presented as the same place).
Open Source Information (OSI) is growing. Intelligence community is recognizing the value of librarians in searching the open source information space.
ISI Web of Knowledge includes visualization and text mining capabilities. However, limited to databases provided through ISI. To mix and match with data available through other vendors, need to use other products, such as VantagePoint. VantagePoint allows you to create filters for importing data from various sources.
SiloBreaker — a news analysis tool, commercially available. It lets you mine for information via word searches, visual searches, people, organizations, industries — ways of pulling together relationships among these facets. It pulls out networks of people, as reported in news reports. It’s provides a way to look at the news and see who is appearing in news articles about the subject. You can expand your search — or refocus it — by diving deeper into related people, organizations, companies, etc.