hadoop-emr

Hadoop Migration to Amazon Elastic Map Reduce PoC

Many data-driven businesses possess large-scale on-premise Hadoop clusters, but they are costly and difficult to scale. They are attracted to Amazon Elastic Map Reduce, a managed Hadoop framework, which offers scaling and price / performance characteristics impossible to achieve on-premise. Our PoC engagement jumpstarts customers with data migration to S3 and aggregation for analytics. PoC includes:

  • Establishing S3 as the “landing zone” for data previously stored in “always on” HDFS
  • Demonstration of process to subset data for map reduce analysis
  • Demonstration of use of “transient” EMR clusters that can be commissioned and de-commissioned as needed
  • A roadmap and prioritized recommendations for migration activities beyond the PoC

Contact us today to discuss your needs