Get the real story via our monthly newsletter

Search

    2
    0

rss

Send to a colleague

Home > Commentary > Trends Archive > JBoss DNA: using JCR to make metadata behave

Browse TrendWatch Blog

Recent Blog Entries

The Complete Archive

Trends by Vendor


TrendWatch by Channel

Web Content Management Trends

Enterprise Portals Trends

ECM Trends

Web Analytics Trends

Enterprise Search Trends

SharePoint


Report Excerpt

The Enterprise Search Report 2008 looks at... Fast Serach & Transfer: FAST ESP

"Design choices can have a major impact. For example, whether synonyms are expanded in the indexed document - or looked up at query time - will influence performance on either side. Because of the way ESP functions, it is also known for its relatively large index size; this is often compounded by choices based on responsiveness query-side. "

(p. 103)

More about The Enterprise Search Report 2008

 

TrendWatch Blog

JBoss DNA: using JCR to make metadata behave

22-Mar-2008

The JBoss folks have launched an ambitious new open-source project that is so breathtaking in scope, it defies easy categorization, even though (ironically) it is largely about categorization.

JBoss DNA is (according to the project website) "a repository and set of tools that make it easy to capture, version, analyze, and understand the fundamental building blocks of information." Notably, the key enabling technology for the project is the Java Content Repository specification (JSR-283).

The project description goes on to say: "As models, service and process definitions, schemas, source code, and other artifacts are added to the repository, JBoss DNA 'sequences' the makeup of these components and extracts their structure and interdependencies. Users can then search, analyze, visualize, report, and modify the repository's content using the terminology and structures they are familiar with. Such domain-specific solutions can be created with little or no programming. Sharing this information is possible through Eclipse plugins, web applications, and REST servers."

If you're still not getting it, there is a useful slide show on the JBoss Labs site. The system is transactional, event-driven, and rules-based, and (of course) it leverages a long list of well-known open-source building blocks and industry standards. Basically, what it does (if I understand it right) is allow you to discover and manage dependencies and semantic relationships between bits of info that most of us would otherwise call metadata.

A core primitive in the JBoss DNA system is the Sequencer, which is essentially a custom event handler that fires when you insert a content item into the repository. It executes rules (which you write in a domain-specific rules language) against the item in question, to extract atomic bits of information about it. In other words, a sequencer does autoextraction of metadata. (Why don't they just say that? Why the cutesy bioengineering lingo?) According to the project's leaders, sequencers are planned for .zip archives, Java bytecode, WSDL, UML, and database DDLs, among other targets.

Autoextraction of metadata is a noble goal, of course. In fact it is becoming a key capability in many corners of the content-management world (DAM in particular). But there are problems with the JBoss DNA vision, not least of which is the fact that metadata extraction is notoriously tricky business (and inferring taxonomic relationships gets even trickier). Visualization of this kind of information is also challenging (ask any Edward Tufte fan), a subject on which JBoss DNA is silent.

But the greater issue with the JBoss DNA project is that the problem space, as envisioned by the project's creators, is hopelessly broad (King Kong could not get his arms around it) and the DNA reference architecture is bewilderingly baroque, encompassing federation of repositories, a Publishing Server that implements the Atom Publishing Protocol, WebDAV support, pluggable analytics, connectors of all kinds, and scads more. The number of moving parts is large and the footprint will doubtless be massive. The factoring is distinctly J2EE circa 2005, in the most obnoxious sense. If there is one thing IT departments don't need at this point, it's yet another kitchen-sink Java EE architecture to deal with.

Earlier, I referred to JBoss DNA as ambitious. Perhaps audacious is a better word. To be sure, many sublime achievements in this world began as audacious dreams. But it is true, also, that audacious endeavors sometimes (maybe most of the time) end up as giant, smoke-filled craters; and JBoss DNA, for all its noble goals, already seems in danger of following that trajectory.

- Submitted by: Kas Thomas, Analyst

All Search Channel Trends

Join the conversation

Digg This! Search Technorati Tag it on Del.icio.us



Get a Free Sample

Wondering about CMS Watch research? Sign up to receive free samples of any of our products.




What we do

CMS Watch™ evaluates content-oriented technologies, publishing head-to-head comparative reviews of leading solutions. What makes us special?

  • Our critical analysis exposes product weaknesses as well as strengths
  • We deliver unrivaled technical depth and comprehensive project advice
  • Our research is led by international topic experts
  • We only work for buyers -- never for vendors

Contact us

CMS Watch

info@cmswatch.com

18113 Town Center Drive, Ste 217

Olney, MD USA 20832

1 800 325 6190 (N. America only)

+1 617 763 5336 (customer service)

+1 301 585 7004 (editorial)

Fax: +1 214 242 3048