pentaho - ETL in Amazon EC2 and utilization doubts -
i using pentaho sometime. have basic question on etl infrastructure. need run job on remote ec2 instance extract data multiple database around 2000. need have machine capable doing in ec2. etl ec2 serving process point , storage in host.now need know instance should go in amazon.
these etl jobs have select query , put in table output. no complex transformation , no sorting. etl processes cpu intensive or memory intensive?. how decide whether etl process cpu or memory intensive or i/o intensive?
i upto you, using m3.medium instance according data in database , fine, if have no problem amount of time take execute transformation choose small size instance or go higher instance.
Comments
Post a Comment