Loading…
BSidesSF 2018 has ended
View analytic
Monday, April 16 • 2:50pm - 3:20pm
An Open Source Malware Classifier and Dataset

Sign up or log in to save this to your schedule and see who's attending!

Feedback form is now closed.
Research in machine learning for static malware detection has been stymied because of stale, biased, and otherwise limited public datasets. In this talk, I will introduce an open source dataset of labels for a diverse and representative set of Windows PE files. The dataset also includes feature vectors for machine learning model building, a high-performing pre-trained model for research, and source code to reproducibly generate the features and model. I’ll also detail the reasoning behind the features and labels and demonstrate how the machine learning model performs on samples in the wild.

Presenters
avatar for Phil Roth

Phil Roth

Data Scientist, Endgame
Dr. Phil Roth is a senior data scientist at Endgame, where he develops products that help security analysts find and respond to threats. This work has ranged from tuning a machine learning algorithm to best identify malware to building a data exploration platform for HTTP request... Read More →


ember pdf

Monday April 16, 2018 2:50pm - 3:20pm
City View - Presidio