Sunday, May 9, 2021

Supersize Me: Hadoop Upgrade Will Handle Even Bigger Data

InfoWorld: According to Hortonworks, the next version of the Apache Foundation’s Hadoop big data processing tool will support even more data and offer faster performance. Version 0.23 of the open source tool, currently an alpha release, will eventually run across 6,000-machine clusters, each with 16 or more cores, and process 10,000 concurrent jobs. The next version of Hadoop will also offer federation and high availability for the Hadoop Distributed File System (HDFS). In addition, it will incorporate the “Yarn” upgrades for MapReduce.

The Hadoop update should become available as a general release later this year.

Similar articles

Latest Articles

Top 10 Professional Services...

These are some of the best PSA tools for organizations of all sizes. What Is Professional Services Automation Software? Professional services automation (PSA) software aims to...

What is Data Aggregation?

Data aggregation is the process where raw data is gathered and presented in a summarized format for statistical analysis. The data may be gathered...

Dell APEX: Our...

One of the missteps IBM made last century was collapsing their sales model, which was services based, to generate a short-term revenue spike. Up...

Companies that Scaled Technology...

NEW YORK — Companies that “doubled down” on their investment in mostly data-heavy technology during the COVID-19 pandemic have seen their revenue grow five...