The semantic web could be the key to unlocking scientific data that's sequestered by disparate applications' formats and organisational limitations, and could allow scientists to harness computation’s full power, world wide web inventor Tim Berners-Lee said Tuesday.
The semantic web "will give scientists and other users unexpected help and serendipitous added value from others' data," Berners-Lee, director of the World Wide Web Consortium (W3C), said at the Fourth Annual Bio-IT World Conference and Expo in Boston.
The semantic web seeks to make it easier for data on the web to be shared and reused by people and applications.
The semantic web is based on the W3C's Resource Description Framework, which uses XML (Extensible Markup Language) to integrate applications. Documents and information in databases on the semantic web have to be published in a machine-processable form, creating a kind of global database.
Life scientists in particular could find the semantic web a useful tool, and in so doing, "provide leadership to lots of other fields" in implementing this next-generation web technology, Berners-Lee said.
"At the moment, I see a huge amount of energy from people in life sciences, getting excited by the semantic web and what it can do to solve the big-idea problems," he said.
Berners-Lee, who invented key components of the World Wide Web such as HTTP (Hypertext Transfer Protocol) and HTML (Hypertext Markup Language) in the late 1980s, has long envisioned an extension of the organic, unstructured web. The W3C launched the first projects in the late 1990s, adding metadata to web pages.
Berners-Lee hopes that life sciences will drive adoption of the semantic web, just as high-energy physics drove the early web.
"Maybe we will meet a critical mass in a certain area. The web, for example, took off in high-energy physics. When we got six high-energy physics websites, then it got interesting for physicists to be onboard," he said.
"Similarly, if we get a half a dozen or a dozen set of ontologies, the core ones for drug discovery out there, then suddenly the semantic web within life sciences would have a critical mass. It’ll snowball much more rapidly and it will be copied. Other areas will realise: Oh it’s worth investing in this," Berners-Lee said
Life sciences are particularly suitable for pioneering the semantic web, Berners-Lee said. For example, within drug discovery, many databases and information systems used by drug researchers are already in, or are ready to be transformed to, machine-readable formats.
Berners-Lee does not promise a quick return on investment for those formatting their data to suit the semantic web and he admits that the concept is “quite difficult to explain.” However, he experienced the same problem trying to explain the world wide web 15 years ago.
"'Hypertext pages; big deal!' people said. They couldn’t realise how they would be able to link to potentially anything and what that would mean," he said.
Asked when the semantic web will take off, Berners-Lee said: "You tell me. I spend all my energy just telling people what I would like to see happen. What I think will happen is much more dangerous."