Check this out…
IBM and Facebook as well as others are starting to contribute to a massive big data based repository of threat related information.
I had an internal startup for some time that was targeting security as well as general operational data to point to trends that need attention such as disk series that are reaching failure points, apps that suddenly morph and such.
Another topic was cleansing the data from any personal or internal information by tokenizing it.
I stopped this startup since I got to meet someone who was doing the same and pointed to the fact that there’s already enough data on one hand (and now per this post we have got much more of that) and on the other hand companies would agree to share cleansed data but would not be able to do it due to regulations that take time to defuse.
In any case you have now lots of data too sip through if you are a hungry Data scientist…