Riak claims thousands of users, including Comcast, Yammer, Voxer, Boeing, Joyent, DotCloud, GitHub and the Danish Government. The website boasts "Riak is the most powerful open-source, distributed database you'll ever put into production." New users should check out the Riak Fast Track for an introduction and quick installation guide. Operating System: Linux, OS X.
Capable of scaling from a single server to thousands of machines, BigData is a high-performance RDF database with high availability and high concurrency. Commercial support and licenses are available. Operating System: OS Independent.
Formerly a Hadoop sub-project, Hive is a data warehouse designed for easy data summarization, ad-hoc queries, and the analysis of large datasets. It uses a SQL-like language known as HiveQL. Operating System: OS Independent.
Ideal for data volumes up to 50TB, InfoBright Community Edition (a.k.a. ICE) offers fast response times for ad hoc queries and industry-leading data compression. You can find commercial products based on the open source edition at InfoBright.com. Operating System: Windows, Linux.
Java-based JMagallanes offers OLAP and dynamic reporting from a variety of data sources, including SQL, Excel, XML, and others. Commercial support is available on a per incident basis, and you can also purchase two related products--JMagallanes Datawarehouse and JMagallanes Web. Operating System: OS Independent.
Business Intelligence and Reporting Tools, a.k.a. BIRT, can add reporting features to any Java/Java EE application. Actuate is the company that leads development of BIRT and also offer commercial products based on the open source project. Operating System: OS Independent.
This Web-based reporting interface works with a variety of reporting engines, including JasperReports, JFreeReport, JXLS, and BIRT. The paid professional version adds OLAP support, dashboards, conditional scheduling and some other advanced features. Operating System: OS Independent.
Short for "Konstanz Information Miner," KNIME describes itself as "a user-friendly and comprehensive open-source data integration, processing, analysis, and exploration platform." Gartner named KNIME a "Cool Vendor" in analytics, business intelligence and performance management in 2010. The Desktop version is open source; the Professional, Team Space, Server and Cluster Execution editions require a paid subscription. Operating System: Windows, Linux, OS X.
Owned by Rapid-I, Rapid Miner is the self-proclaimed "world-leading open-source system for data and text mining." It's available as a standalone solution, as a data mining engine for use with other applications, or as part of the RapidAnalytics server suite. Paid enterprise versions of the software are available. Operating System: OS Independent.
This "fruitful and fun" project aims to offer data visualization and analysis capabilities that can be used by both experienced professionals and novices. Add-ons are available for bioinformatics and text mining. Operating System: Windows, Linux, OS X.
Java-based jHepWork, or jWork, is a platform for analysis of large volumes of numbers, data mining, statistical analysis and mathematics. It includes libraries for creating data visualizations, as well as libraries for data structures and data manipulation. Operating System: OS Independent.