a framework for developing and managing scalable scientific data processing pipelines targeted for cloud or cluster computing