Unstructued data – how to benefit or what has “plunge and squish” got to do with it all anyway?

Back in 1992 David Gelernter wrote a wonderful book entitled “Mirrors Worlds”. Gelernter’s work influenced the Sun team and the creation of JavaSpaces (technology that I based my first start-up on).

In this book Gelernter talks about “plunge and squish”. Imagine you have a huge vat of data that from a distance appears homogeneous but upon close inspection is a massively heterogeneous, some structured, some unstructured. You wish to know what data in this vat has a strong relationship to something or other, a particular thought, question or subject that you have in mind for instance. How do you find this related data? Gelernter imagined a system that allowed you to plunge your “something” into the vat of data and see the related data attracted or squished towards your “something” allowing you then to extract that related data to find the answer to your question. This is all very theoretical but still important in that to find a solution to a problem it helps to imagine what your solution looks or feels like. Gelernter didn’t stop there but went on to create Scopeware which was the equivalent of Google Desktop however launched well before Google’s product. Scopeware then disappeared from view (purchased by some group and the technology integrated into their product set) being faced with the competition from Google’s and Microsoft’s products in this area.

Scopeware was a first step in the road-map to solving the problem of data mining in that data was pre-searched during computer downtime and the user then used keyword searches to search across all document types desired. This is a first generation solution with the pre-search facility clearly helping in speeding up the searches. A second generation solution to this problem has not yet, as far as I am aware, been launched but I believe it is now worth investigating. BTW if anyone feels that second gen solutions are here already then do let me know.

One area that I am interested in and have been brainstorming with a friend of mine is in the area of knowledge management. How sensible is it to spend time researching a topic when someone maybe only two-steps down the office has done this research and has done so from within a similar context to your own? It is only sensible if the person is not able to be found by you simply and rapidly. If an enterprise system could quickly and efficiently find the person within the enterprise with the some or all of the knowledge that you are seeking to find this would potentially be of significant value. The identification of the person is the key to finding the information. Sometimes simple answers to simple questions are the most tricky to solve.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: