Could you tell me the name of opensource tools and web-based for the analysis of big data volumes? (For the prediction, classification, clustering, etc).
That would depend a lot on what language you use and your performance requirements. If you are just getting started in this space and are comfortable with Python I recommend scikit-learn: http://scikit-learn.org/
It packs a lot of power into a very intuitive API and it's documentation is top notch. Critically, the doc's explain how the methods work in addition to providing implementation examples.
We are using Apache Flink (http://flink.apache.org/) for teaching students in analysis of big data volumes. Apache Flink is an open source platform for distributed stream and batch data processing.