We use cookies and other similar technologies (Cookies) to enhance your experience and to provide you with relevant content and ads. By using our website, you are agreeing to the use of Cookies. You can change your settings at any time. Cookie Policy.

IBM to Enhance Open Source Knowledge Discovery


IBM today announced it has made new open source technology available that seeks to enhance knowledge discovery capabilities across multiple industries and applications and provide developers with tools to support a new breed of software for the analysis of information.

The following details were provided to ebizQ:

The company has completed the first step of making the Unstructured Information Management Architecture (UIMA) available to the open source community by publishing the UIMA source code to SourceForge.net, the world’s largest open source development site.

UIMA is an open software framework already in use by industry and academia to collaborate on the creation, development and deployment of technologies for discovering the vital knowledge present in the fastest growing sources of information today – unstructured content in the enterprise and across the Web, including documents, images, comment and note fields, e-mail and even rich media like video and audio. New technologies built using UIMA will help unlock the value in organizations’ content assets. Later this year, IBM intends to move this project to a full open source community development model.

“Companies want to get value from all of their information, but no single vendor can address all of the search, text analytics and business insight needs across all types of information and for all industries,” said Nelson Mattos, vice president Information and Interaction, IBM Research. “We are making UIMA available to the open source community to encourage innovation and allow analytics software tools from multiple sources to work together and build upon each other.”

Since unveiling UIMA in December of 2004, an active ecosystem of partners, customers and open source developers have accelerated innovation and solution delivery around UIMA.

The Mayo Clinic also adopted the UIMA framework early in its development cycle as part of its broader collaboration with IBM in the area of unstructured text processing. Mayo Clinic used UIMA as the basis for implementing a system to extract knowledge from its approximately 20 million clinical notes. This provided the flexibility to combine a series of annotators from Mayo Clinic, IBM and the open source community in a plug-and-play fashion to rapidly create a powerful analytic solution with advanced capabilities.

Memorial Sloan-Kettering Cancer Center is working with IBM to develop a Web accessible data warehouse that will conform to HIPAA requirements. This data warehouse will enable clinicians and researchers from Memorial Sloan-Kettering Cancer Center to efficiently use data facilitating research on a new cancer taxonomy. An important aspect of the data warehouse is the inclusion of searchable concepts from Memorial Sloan-Kettering Cancer Center's text-based pathology reports. These concepts are automatically extracted by an IBM text analytics solution built on the UIMA framework.

Adding to the growing UIMA ecosystem, the General Architecture for Text Engineering Engineering (GATE - gate.ac.uk) team at the University of Sheffield recently announced the delivery of an interoperability layer with UIMA. This new layer provides GATE users access to UIMA’s flexible deployment options and UIMA users access to the many useful plug-ins already available in GATE for text mining, information extraction and natural language processing for research and commercial use.

UIMA has also received significant support from the Defense Advanced Research Projects Agency (DARPA) and is currently in use as part of DARPA’s new human language technology research and development program called GALE (Global Autonomous Language Exploitation). The GALE Program is a five-year project involving industry leaders and 24 universities with the goal of developing and applying software technologies to absorb, analyze and interpret huge volumes of speech and text in multiple languages. UIMA has been adopted as the underlying integrating architecture for building large-scale multimodal unstructured information management applications.

In addition, several of the software vendors that previously announced plans to support UIMA have already made available their first UIMA compliant solutions, including companies such as ClearForest, Cognos, Factiva and Nstein.

  • Subscribe Newsletter
  • Contribute
Subscribe to ebizQ:

Enter your email address:

 Subscribe Blog Updates via RSS

 Subscribe News via RSS

ebizQ is very interested in what you have to say. To contribute an article, an opinion, or to become a blogger, please contact Peter Schooff.

  • Virtual Conferences
  • Webinars
  • Roundtables

SOA Cloud Qcamp

June 3, 2009

One of the most compelling trends in the enterprise business technology space over the past year has been the emergence of cloud computing. In ebizQ’s upcoming Qcamp virtual un-conference, leading industry experts and practitioners will explore the role of service-oriented architecture (SOA) and business process management (BPM) in supporting cloud-computing initiatives. Additionally, the new skills that developers and IT managers need for successful cloud development will be discussed.Register

View All Virtual Conferences

Smart Case Management: Why It's So Smart.

Date:Nov 05, 2009
Time:12:00 PM ET- (17:00 GMT)


Date:Oct 29, 2009
Time:15:00 PM ET- (19:00 GMT)

View All Roundtables
  • White Papers
  • Podcasts
  • Blogs

Joe McKendrick: Part II of II: Designing Evolve-ability into SOA and IT Systems

In part two of Joe McKendrick's recent podcast with Miko Matsumura, chief strategist for Software AG, they talk about how SOA and IT systems need to change and grow and adapt with the organization around it.

Listen Now

Phil Wainewright: Helping Brands Engage with Social Media

Phil Wainewright interviews David Vap, VP of products at RightNow Technologies, and finds out how sharing best practices can help businesses understand how best to engage with online communities.

Listen Now

Peter Schooff: Making Every IT Dollar Result in a Desired Business Outcome: Scott Hebner of IBM Rati

Scott Hebner, Vice President of Marketing and Strategy for IBM Rational, discusses a topic on the top of every company's mind today: getting the most from IT investments.

Listen Now

Jessica Ann Mola: Where Will BI Fit In? Lyndsay Wise Explains

In BI, this tough economy and the increasing role of Web 2.0 and MDM are certainly topics on people's minds today. WiseAnalytics' Lyndsay Wise addresses each of them in this informative podcast.

Listen Now

Dennis Byron: Talking with...Deepak Singh of BPM Provider Adeptia

Deepak Singh, President and CTO of Adeptia, joins ebizQ's Dennis Byron in a podcast that gets its hand around the trend of industry-specific BPM.

Listen Now
More Podcasts
  • Most Read
  • Quick Guide
  • Most Discussed

Quick Guide: What is BPM?

Learn More

Quick Guide: What is Event Processing?

Smart event processing can help your company run smarter and faster. This comprehensive guide helps you research the basics of complex event processing (CEP) and learn how to get started on the right foot with your CEP project using EDA, RFID, SOA, SCADA and other relevant technologies. Learn More

Quick Guide: What is Enterprise 2.0?

A lot of people are talking about Enterprise 2.0 as being the business application of Web 2.0 technology. However, there's still some debate on exactly what this technology entails, how it applies to today's business models, and which components bring true value. Some use the term Enterprise 2.0 exclusively to describe the use of social networking technologies in the enterprise, while others use it to describe a web economy platform, or the technological framework behind such a platform. Still others say that Enterprise 2.0 is all of these things. Learn More