File type detection
NOTE: Since this article was written, updates have been made to the MagicMimeTypeIdentifier in the Aperture Framework. Read more...Problem DescriptionMy current project, vyasa, is a digital library...
View ArticleCharacter encoding detection
ProblemDocuments that are stored as plain text, such as XML or XHTML, often have a particular character encoding, also known as a character set or codepage. This character encoding allows applications...
View ArticleInformation Scarcity to Information Overload
"The focus in enterprise content management (ECM) is shifting from ending information scarcity to dealing with information overload. This dynamic explains why the disparate technologies of search,...
View ArticleHakia: A New Google?
"...the triumph of a semantic search technology over the irrelevance of syntactic search illustrates why companies like Hakia will garner more attention in 2007."An article on Line56.com discusses the...
View ArticleIntellisophic Achieves Patent Milestone on Document Indexing System & Methods
Intellisophic Inc., a leading provider of information products to the search and text mining industry, announced that it has been granted allowance by the U.S. Patent and Trademark Office on the...
View ArticleFile type detection follow up
earl.strain.at ran the test files used in my file type detection article against file, "the open source implementation of the file(1) command that ships with every free operating system."Read about his...
View ArticleCentiare on the Heels of Wikipedia
"semantic web" technology is installed on Centiare. This means registered users can perform amazing searches on Centiare that just wouldn’t be possible on Google, MySpace, or Wikipedia. Imagine...
View ArticleFile Type Metadata Discovery, Part 1: Audio
In a previous article, I evaluated various libraries to determine which most accurately identified a file's type. This article represents part one in a series of articles that explore how to discover...
View ArticleFlexible Taxonomies
In Web Analytics And Content Group Management, Gary Angel writes about the use of taxonomies in web analytics, stating that "no single taxonomy is likely to support a very wide range of analytic...
View ArticleFree Music Identification and Metadata Service
Yesterday I briefly mentioned that there were services that could help you identify non-technical audio metadata. I just become aware of a press release from MusicIP that announces their free music...
View ArticleThe Future of Enterprise Search
Dana Gardner of Interarbor Solutions recently interviewed members of FAST Search & Transfer. Their discussion brought up several interesting topics about semantic, search-centered applications.Dr....
View ArticleWikiseek
A newly launched service called Wikiseek focuses on complimenting Wikipedia by restricting search results to articles and references in the encyclopedia. Wikiseek's about page claims that this method...
View ArticleThe Future of Semantic Search
Steven Arnold recently stated some facts that are very closely related to my project and research interests: ...what will carry us into 2007 is a collection of technologies we think of as text mining,...
View ArticleFile Type Metadata Discovery, Part 2: Images
File Type Metadata Discovery, Part 2: ImagesIn a previous article, I evaluated various libraries to determine which most accurately identified a file's type. This article represents part two in a...
View ArticleBauhaus-Universität Weimar
I recently stumbled upon Bauhaus-Universität Weimar (english), a univerisity for creative studies in Weimar, Germany. The university conducts research in Web Technology and Information Systems.The site...
View ArticleVisualization Links
Swivel is a Web site for curious people to explore data. They "use farms of powerful computers and algorithms ... to transform a lonely grid of numbers and letters into hundreds - sometimes thousands -...
View ArticleMicrosoft Photo Info
Microsoft Photo Info is a new software add-in for Microsoft Windows that allows photographers to add, change and delete common "metadata" properties for digital photographs from inside Windows Explorer.
View ArticleMagicMimeTypeIdentifier update
MagicMimeTypeIdentifier, which scored so highly on my comparison of Java file type detectors, has been updated. Here is the email I received from Christian Fluit:I have just updated Aperture's...
View Article
More Pages to Explore .....