There is a new online course from EdX on Introduction to Big Data with Apache Spark starting on 1st June. You can take the course online for free, or pay a small fee ($50) for the "verified certificate" option.
Apache Spark is a "fast and general engine for large-scale data processing", which is gaining a lot of interest for Big Data applications. It can run on an existing Hadoop YARN cluster, an Apache Mesos cluster, on a stand-alone cluster, or you can even run it on your local PC e.g. for ad hoc data exploration. It includes powerful tools for reading and processing data from different sources e.g. CSV files, HDFS, databases etc, as well as specific libraries for machine learning and stream processing. Spark provides APIs for Scala, Python and Java. There are interactive shells for Scala and Python, and the Python API can also be used interactively via the IPython Notebook.
Spark was a very popular topic for dicussion at the recent Strata Hadoop World conference in London, so this could be your opportunity to find out more about this great new tool for Big Data applications.