Big data (and the associated tools, such as Hadoop, Spark, Cassandra, HDInsight, MongoDB, Hive, HBase, and CouchDB) brings incredible opportunity to data science and analytics, but also introduces new challenges when it comes to digital forensic investigations.
Properly collecting and analyzing evidence in big data environments requires a thoughtful approach so that accurate, repeatable results can be produced for study or for use in legal proceedings.
Big data forensics can be broken into six phases: