< Return to Big Data Analytics page

Enables continuous and extremely fast analysis of massive volumes of information-in-motion to help improve business insights and decision making.

 

  • Highlights
  • What's New
  • Resources

IBM Streams is an advanced analytic platform that allows user-developed applications to quickly ingest, analyze and correlate information as it arrives from thousands of data stream sources. The solution can handle very high data throughput rates, up to millions of events or messages per second.

IBM Streams helps you:

  • Analyze data in motion - provides sub-millisecond response times, allowing you to view information and events as they unfold.
    • Supports analysis of continuous data including text, images, audio, voice, video, web traffic, email, GPS data, financial transactions, satellite data and sensor logs.
    • Includes toolkits and accelerators for advanced analytics, including a telco event data accelerator that analyzes large volumes of streaming data from telecommunications systems in near real time and a social data accelerator for analyzing social media data.
    • Distributes portions of programs over one or more nodes of the runtime computing cluster to help achieve volumes in the millions of messages per second with velocities of under a millisecond.
    • Allows you to filter and extract only relevant data from unimportant volumes of information to help reduce data storage costs.
    • Scales from a single server to thousands of computer nodes based on data volumes or analytics complexity.
    • Provides security features and confidentiality for shared information.
  • Simplify development of streaming applications - uses an Eclipse-based integrated development environment (IDE).
    • Allows you to build applications with drag operators, and dynamically add new views to running applications using data visualization capabilites such as charts and graphs.
    • Enables you to create, edit, visualize, test, debug and run Streams Processing Language (SPL) applications.
    • Provides composites capability to increase application modularity and support large or distributed application development teams.
    • Allows you to nest and aggregate data types within a single stream definition.
    • Enables applications to be built on a development cluster and moved into production without recompiling.
  • Extend the value of existing systems - integrates with your applications, and supports both structured and unstructured data sources.
    • Adapts to rapidly changing data forms and types.
    • Allows you to quickly develop new applications that can be mapped to a variety of hardware configurations.
    • Supports reuse of existing Java or C++ code, as well as Predictive Model Markup Language (PMML) models.
    • Includes a limited license for IBM BigInsights, a Hadoop-based offering for analyzing large volumes of unstructured data at rest.
    • Integrates with IBM DB2, IBM Informix, IBM PureData System Oracle, Microsoft SQLServer and MySQL, and more.

Related Products

InfoSphere BigInsights   

An enterprise-ready, Apache Hadoop-based solution for managing and analyzing massive volumes of structured and unstructured data.

IBM InfoSphere Warehouse 

Provides a comprehensive data warehouse platform that delivers access to structured and unstructured information in real time.

        

IBM PureData powered by Netezza technology

Simplifies and optimizes performance of data services for analytic applications, enabling very complex algorithms to run in minutes not days.



Next steps

Contact us today to learn how IBM InfoSphere Streams can help your company to maximize performance - you can complete the form or call us at 877-454-4898, and we would be delighted to consult with you and make specific recommendations.

IBM Streams 4.1

IBM Streams v4.1 includes new features designed to address the important issues companies are facing. Open source technologies such as Spark are taken into consideration along with reducing time-to-value through use of the Java programming language. Corporate and governance concerns were also addressed through data lineage and data governance.

With IBM Streams V4.1, you can:

  • Easily integrate data with Apache Spark applications and analytics
  • Create data lineage and use flexible schemas for easy ingestion of data
  • Take advantage of automatic schema discovery and mapping via integration with IBM InfoSphere Data Governance Catalog
  • Create IBM Streams applications in Java in addition to IBM Streams Processing Language

Details of the new capabilities in IBM Streams

Faster streaming application delivery

To speed things up, IBM Streams v4.1 has added the ability for developers to create Streams applications in Java. Since Java has widespread usage, the ability to use it to create applications helps reduce the learning curve for many developers and thus accelerate the rate at which applications can be produced. In fact, a developer with no prior knowledge of Streams can create Streams applications in under an hour using Java APIs for streaming analytic libraries such as natural language processing, spatial, temporal, acoustic, image recognition and more.

More intelligent applications

IBM Streams v4.1 increases application intelligence with the integration of open source technologies such as Spark and Hadoop through Java APIs. This means that data streams are captured efficiently and used alongside other data at rest (Hadoop, databases, and more). In addition, Spark and IBM Streams complement each other, with Spark working well for data at rest and IBM Streams excelling at event driven low latency apps. IBM Streams also has the broadest range of machine learning, adding Spark and MLlib to existing Streams Native Machine Learning, SPSS, R and PMML, making even more sophisticated analytics possible.

More confidence in data stream insights

Many executives are still relying on their gut, revealing a lack of confidence in insights. Yet, strong governance can counter these feelings of doubt and help meet corporate mandates. IBM Streams v4.1 introduces the creation of data lineage and use of flexible schemas for easier data ingestion. It also enables automatic schema discovery and mapping through integration with IBM InfoSphere Data Governance Catalog. The added assurance of data quality and reliability can be the difference between a gut check decision and one backed by defensible insight.

Next steps

Contact us today to learn how IBM InfoSphere Streams can help your company to maximize performance - you can complete the form or call us at 877-454-4898, and we would be delighted to consult with you and make specific recommendations.