Shuttl provides data archive management for Splunk. It supports backend storage solutions such as: ApacheHDFS, Amazon S3, or NFS attached storage. Shuttl works on the bucket level, and leverages the standard Splunk mechanism for archiving data based on total data size or time expiration. Use of Shuttl eliminates the need for Splunk users to implement their own homegrown solution for bulk-moving data to storage backends.

In addition to Archiving, Shuttl is useful for both compliance needs of data retention, as well as improving performance of Splunk. Shuttl also supports archiving the data in CSV format, and therefore, when data is moved to HDFS, it opens up the data to other tools such as Apache Hive and Hadoop Map Reduce to do further data processing and analysis.

For more information see the following blog articles:

Source code is available here: https://github.com/splunk/splunk-shuttl

Quickstart Guide is available here: https://github.com/splunk/splunk-shuttl/wiki/Quickstart-Guide

Setup video is available here: http://www.youtube.com/watch?v=OP7IYNVR5ms

For feedback, please email shuttl-dev at splunk.com.