Ask questions, get answers, and share ideas or best practices for our cloud based solutions (ETL on EC2, Hadoop EMR for AWS, Docker).

19 Members
Join Us!

IronCluster on EMR 1.0.3

I am currently using EMR with Hadoop ver 1.0.3 on a persistent cluster. I'll be leveraging ETL/ELT on HBase extensively and would like to try Syncsort.

Can I use IronCLuster on (EMR) Hadoop 1.0.3?

If not, are there any other options from SyncSort? Can I use install DMExpress-H on EC2 directly and will this support (EMR) Hadoop 1.0.3?

You need to be a member of Syncsort Community to add comments!

Join Syncsort Community

Email me when people reply –

Replies

  • Hi Ram-

    Our integration with Hadoop and EMR, for Ironcluster and DMX-h, requires Hadoop 2. You can read more about the integration on our web site and there is also an interesting BLOG here: http://blog.syncsort.com/2013/02/hadoop-mapreduce-to-sort-or-not-to...

     

    The only Hadoop distribution that supports the "backport" for MapReduce 1 at this time is Cloudera (MapR is coming soon). We can't support Amazon EMR with Hadoop 1.

     

    Couple of options:

    • Use EMR Hadoop 2
    • Try out with the latest releases of Cloudera of Hortonworks
    • You can download a "Test Drive" of our product and either 1) download the test drive that is a full VM with the Cloudera distribution, 2) download the Hortonworks Sandbox and instll our Test Drive on that VM. Both are available here: www.syncsort.com/try

    --Keith Kohl

    kkohl@syncsort.com

This reply was deleted.

To access Syncsort Knowledge Base, visit:

My Support