"The data scientist just got a personal assistant," Dziekan says. "This Data Science Pack features tools data scientists are familiar with already and we're now operationalizing them."
The Pentaho 5.1 platform also adds full YARN integration, making it much simpler for developers working with Pentaho Data Integration to exploit the computational power of Hadoop without having to write complex MapReduce code. Dziekan says the YARN support allows PDI jobs to make elastic use of Hadoop resources, expanding and contracting as data volumes and processing requirements change. He notes that YARN's advanced resource management capabilities support mixed workload scenarios where continuous data transformation and analysis is required.
Sign up for Computerworld eNewsletters.