Abstract: A system and method for predicting the amount of time and/or resources required to execute a job on a big data set, and/or a system and method for automatically providing one or more suitable commands to a user for constructing a job for manipulating a big data set. The system and method are optionally and preferably implemented with regard to Hadoop.