Selling and overselling the Semantic Web
We got on the train together. I had just finished a four-day training/consulting session with a company doing information integration for international security. She was doing a master's degree, with a thesis about Ontologies. Like a good grad student, she was a voratious reader. She had read white papers, research papers, books, web pages, magazine articles, and anything else she could get her eyes on. The more she read, the more confused she became.
It is hard to be surprised at this. It seem that just about everyone is jumping on the Semantics bandwagon. Is an Ontology a top-down way to organize all human knowledge? Or just a glorified ERD? Or a controlled vocabulary? Will it take an Ontologist to make them? Or will they be something that everyone can do, like a web page? Will Ontologies make the web come alive as a sentient, intelligent being? You can find someone who has seriously puported variants of all of these, all using the name "Ontology".
So I just sorted it all out for her before we got to Elephant and Castle.
Well, not really. There are just too many contradictions. Is the Semantic Web about a top-down organization of everything? Or a wooley free-for-all? Are vocabularies controlled or not? Is content authored or automatically generated?
But here's what I was able to offer. One story, about a web of information. A story about information sharing. A story that builds on the success of things like Wikipedia and the World Wide Web.
In my story there is no need for natural language processing. Inference plays a key role, but not an analytic one; it is just a way to connect information together. Upper ontologies are largely irrelevant, but reusable ontologies are not.
Will this technology story solve every problem? No. It will not diagnose diseases, it will not automatically index your library. It will not make your search engines obsolete.
But no technology story can do all that - the best we can hope for is a story that is coherent (it actually makes sense), feasible (it can be done with extant technology), and, perhaps most importantly, valuable (it provides some real business value). I think I have such a story - and in that story, there happens to be no need for natural language processing, upper ontologies, or highly sophisticated inferencing.
So, what is that story? I did finish the story before we got to Elephant and Castle, but that's a 45 minute ride. I can't fit the whole thing into a blog entry. But I can fit it into a book.
Yes, this whole entry was a troll for the book. Check it out.
