2805 Bowers Ave, Santa Clara, CA 95051 | 408-730-2275
sales@colfax-intl.com
My Colfax  


Colfax Hadoop Cluster for Next-Gen Analytics

 
 

Resources

Download White Paper: Deploying Apache™ Hadoop® with Colfax and Mellanox VPI Solutions

Download White Paper: Extract, Transform, and Load Big Data with Apache Hadoop

Download White Paper: Using Big Data Predictive Analytics to Optimize Sales

Enormous data stores have become a fact of life for organizations of all types and sizes. The ability to manipulate, transform, and drive benefit from that data—which is often unstructured big data—is quickly becoming the norm, and the tools and techniques for doing so are becoming more commonplace.

Hadoop is an open-source software framework written in Java* and based on Google’s MapReduce* and distributed file system work. It is built to support distributed applications by analyzing very large bodies of data using clusters of servers, transforming it into a form that is more usable by those applications. Hadoop is designed to be deployed on commonly available, general-purpose infrastructure. Tasks that the framework is particularly well suited for include indexing and sorting large data sets, data mining, log analytics, and image manipulation.

Colfax's ClusterEdge H2200x offers a complete turnkey solution for getting started with Apache Hadoop. The ClusterEdge H2200x is a highly scalable, balanced and easy to deploy platform bundled with Intel® Distribution for Apache Hadoop* software.

 

An Enterprise-Ready Big Data Solution

The ClusterEdge H2200x delivers real-time big data processing and analytics for enterprise customers, with an integrated software environment that is optimized to deliver superior performance, security, and manageability on servers powered by Intel Xeon processors. The bundled Intel® Distribution for Apache Hadoop* software package contains core components of the Apache Hadoop framework, including MapReduce*, Apache Hadoop Distributed File System* (HDFS*), Apache Hive* data warehouse infrastructure, Apache Pig* data flow language, and Apache HBase* database. It also includes Intel® Manager for Apache Hadoop software to simplify deployment and management.

 


Performance

The ability to store and analyze huge amounts of unstructured data promises ongoing opportunities for businesses, academic institutions, and government organizations. The ClusterEdge H2000x offers significant performance through a balanced infrastructure based on well-selected hardware components and the use of the Intel® Distribution for Apache Hadoop* software.

 


Management

The ClusterEdge H2000x includes Intel manager simplifies the deployment, configuration, tuning, monitoring, and security of your Hadoop deployment. Intel Manager takes the delay and complexity out of your deployment by automatically installing, configuring, and optimizing the Hadoop cluster. Intel Manager locates server nodes, installs Hadoop software components, assigns roles to nodes, and presents you with a web-based console for operational control. The configurable dashboard provides complete system visibility by tracking resource utilization metrics.



ClusterEdge H2200x Building Blocks

The ClusterEdge H2200x solution includes:

  • Wide range of servers based on Intel® Xeon® Processor E5-2600 V2 series
  • 24 DIMMs for up to 768GB and 4x 3.5" HDDs or 8x 2.5" HDDs / SSDs per Name Node
  • 24 DIMMs for up to 768GB and 12x 3.5" HDDs or 24x 2.5" HDDs / SSDs per Data Node
  • Platinum Level Efficiency Redundant Power Supplies
  • Remote Systems and Cluster Management with Onboard IPMI
  • InfiniBand or 10 Gigabit Ethernet Interconnect

 

For additional details on the ClusterEdge H2200x Hadoop Solution, contact us at sales@colfax-intl.com or 408-730-2275.


Download White Paper: Deploying Apache™ Hadoop® with Colfax and Mellanox VPI Solutions