< Return to Big Data Analytics page

Enables continuous and extremely fast analysis of massive volumes of information-in-motion to help improve business insights and decision making.

 

  • Highlights
  • What's New
  • Resources

IBM Streams is an advanced analytic platform that allows user-developed applications to quickly ingest, analyze and correlate information as it arrives from thousands of data stream sources. The solution can handle very high data throughput rates, up to millions of events or messages per second.

IBM Streams helps you:

  • Analyze data in motion - provides sub-millisecond response times, allowing you to view information and events as they unfold.
    • Supports analysis of continuous data including text, images, audio, voice, video, web traffic, email, GPS data, financial transactions, satellite data and sensor logs.
    • Includes toolkits and accelerators for advanced analytics, including a telco event data accelerator that analyzes large volumes of streaming data from telecommunications systems in near real time and a social data accelerator for analyzing social media data.
    • Distributes portions of programs over one or more nodes of the runtime computing cluster to help achieve volumes in the millions of messages per second with velocities of under a millisecond.
    • Allows you to filter and extract only relevant data from unimportant volumes of information to help reduce data storage costs.
    • Scales from a single server to thousands of computer nodes based on data volumes or analytics complexity.
    • Provides security features and confidentiality for shared information.
  • Simplify development of streaming applications - uses an Eclipse-based integrated development environment (IDE).
    • Allows you to build applications with drag operators, and dynamically add new views to running applications using data visualization capabilites such as charts and graphs.
    • Enables you to create, edit, visualize, test, debug and run Streams Processing Language (SPL) applications.
    • Provides composites capability to increase application modularity and support large or distributed application development teams.
    • Allows you to nest and aggregate data types within a single stream definition.
    • Enables applications to be built on a development cluster and moved into production without recompiling.
  • Extend the value of existing systems - integrates with your applications, and supports both structured and unstructured data sources.
    • Adapts to rapidly changing data forms and types.
    • Allows you to quickly develop new applications that can be mapped to a variety of hardware configurations.
    • Supports reuse of existing Java or C++ code, as well as Predictive Model Markup Language (PMML) models.
    • Includes a limited license for IBM BigInsights, a Hadoop-based offering for analyzing large volumes of unstructured data at rest.
    • Integrates with IBM DB2, IBM Informix, IBM PureData System Oracle, Microsoft SQLServer and MySQL, and more.

Related Products

InfoSphere BigInsights   

An enterprise-ready, Apache Hadoop-based solution for managing and analyzing massive volumes of structured and unstructured data.

IBM InfoSphere Warehouse 

Provides a comprehensive data warehouse platform that delivers access to structured and unstructured information in real time.

        

IBM PureData powered by Netezza technology

Simplifies and optimizes performance of data services for analytic applications, enabling very complex algorithms to run in minutes not days.



Next steps

Contact us today to learn how IBM InfoSphere Streams can help your company to maximize performance - you can complete the form or call us at 877-454-4898, and we would be delighted to consult with you and make specific recommendations.

IBM Streams 4.2

With IBM Streams, organizations can see events and trends as they are happening rather than react to them after they have passed. Thanks to its ability to bring meaning to unwieldy bodies of fast-moving data streams, the technology has already achieved success across a wide spectrum of industries. Any organization that requires immediate, accurate analysis and business decisions based on up-tothe-minute information can benefit from Streams.

IBM Streams V4.2 is poised for continued growth with new Python and Rules developer support, Internet of Things (IoT) Edge analytics, and Hyperstate Accelerator. Streams V4.2 offers the following benefits:

  • Developers and data scientists can use Python to call the extensive Streams analytic libraries along with thousands of Python libraries to create real-time analytic processing applications that run on Streams.
  • Based on industry-leading capabilities from IBM Operational Decision Manager, Streams developers can now create rules that are compiled to run natively on Streams for superior runtime performance.
  • New support for Apache Edgent enables Streams developers to create federated applications to optimize computing for IoT applications with Edgent at the edge and Streams for central analytics. Watson IoT Platform on Bluemix can be used as a management service.
  • With a high-speed, Hyperstate Accelerator capability, Streams natively supports state management and Consistent Region support for processing at very high speed. Consistent Region is a Streams capability that assures processing at very high speed.
  • The speech-to-text toolkit enables developers to create applications that ingest voice, convert it to text, and then with Text Toolkit perform natural language processing applications.
  • Interhost encryption enables you to automatically encrypt all data between hosts for additional security for streaming applications
  • Nested parallelism enhances the existing ability to create parallel processing by simply adding an annotation to the code. Developers can now add a nested region inside of another nested region.
  • Administrative capabilities for serviceability improvements include single sign-on and Kerberos authentication.
  • Submission time fusion and asynchronous check pointing help further extend Streams performance advantages over alternative stream computing platforms.

Enterprise integration: Extending the power of Streams

Integration with other products gives organizations access to all data and systems. These products offer the following benefits:

  • Apache Edgent (previously called Apache Quarks): Create federated analytics between Streams and devices and gateways at the edge of networks running Apache Edgent. Communications can be performed natively with Streams or using a management service like Watson IoT Platform on Bluemix.
  • Watson IoT Platform: Use IoT device management capabilities of the IoT platform, and subscribe to device data to provide advanced analytics using Streams.
  • BigInsights: Store streaming data in an enterprise-class Hadoop environment for additional analysis or historical retention. Streams can land data into BigInsights in many formats, including BigSQL, HDFS, Hive, HBase, Parquet, and Avro.
  • IBM InfoSphere Governance Catalog: Speed development by dragging and dropping data definitions onto Streams Studio to generate schema. Streams can also feed lineage data into Governance Catalog
  • IBM Watson Explorer: Visualize Streams data in the Watson Explorer CXO dashboard and add streaming data to the Watson Explorer index.
  • Microsoft Excel: Analyze and visualize streaming data in Microsoft Excel worksheets. Business users can display streaming data directly in Excel by dragging and dropping onto a worksheet. The streaming data continually updates in the worksheet and can then be analyzed and visually represented through available features in Excel. Streams performance advantages over alternative stream computing platforms.

Product Positioning

IBM Streams offers an ultra-high performance analytics solution for real-time analytic processing (RTAP). RTAP extends online analytic processing (OLAP) by analyzing data-in-motion instead of data-at-rest. This enables higher volume and lower-latency analysis. Streams represents a new evolution in the business intelligence and complex event processing (CEP) markets. It is characterized by incredible throughput rates of disparate data types, including structured and unstructured information that must be processed with millisecond or microsecond latencies using complex correlations and powerful analytics.

A key difference between Streams and other business intelligence offerings, such as OLAP, is that the latter requires data to be at rest before running analytics. However, Streams analyzes data in-motion, a faster process, because disk storage is not required. This can lead to ultra-low latencies as compared to existing technologies. Streams can manage data or data subsets as the business demands.

In addition, while OLAP offerings are traditionally limited to supporting only structured data, Streams technology can support both structured and unstructured data. While being radically different from existing business intelligence solutions, Streams can efficiently extend and add value to existing business intelligence offerings. For example, Streams can reuse and continually refine existing analytics expressed in Predictive Model Markup Language (PMML) standards.

Streams comes standard with several real-time analytic toolkits to help provide quicker time to value. These include telecommunications event data, time series, text, messaging, database, geospatial, and more. Many of these toolkits are part of the Streams Open Source Project. Streams serves the real-time analytics market. It can read in and process much higher discrete event throughput rates compared to other market offerings. Streams analyzes continuous streams of information such as audio feeds from hydrophones or video feeds from television. Furthermore, Streams can process unstructured data such as audio, video, text, and EKG/EEG waveforms.

Next steps

Contact us today to learn how IBM InfoSphere Streams can help your company to maximize performance - you can complete the form or call us at 877-454-4898, and we would be delighted to consult with you and make specific recommendations.