Full Program »

Estimating Intrinsic Dimension To Architect Compact Neural Networks

This project is supported by Grant H98230-21-1-0317.

Information security professionals around the world are always looking for ways to improve the robustness of threat detection and response. A machine learning paradigm such as an autoencoder can be used to automate flagging and increase the accuracy of threat detection. Autoencoders are unsupervised neural networks that are able to take input data, encode and compress the data, and decompress the data into a reconstruction extremely similar to the original data (Badr, 2019). Compression schemes such as autoencoders need to encode input to a size that maximizes space savings while minimizing information loss – a size governed by the intrinsic dimension of the data. This research focuses on a suite of intrinsic dimension estimation techniques implemented in the Scikit-dimension Python package and subsequent use of the Pytorch package to develop an autoencoder model. This is an extremely important interplay because without the correct intrinsic dimension, adversaries can take advantage of the excess space in the autoencoder and insert malicious packets to the system. On the other hand, if the intrinsic dimension of an autoencoder is set too low, it will struggle to learn the representation of even normal, non-malicious data. It is for these reasons that it is imperative to discover a reliable method of intrinsic dimension estimation.

James Bonacci
Robert Morris University
United States

Peyton Lutchkus
Robert Morris University
United States

Reese Martin
Robert Morris University
United States

Robert Pava
Robert Morris University
United States

Ping Wang
Robert Morris University
United States

Bradford Kline
National Security Agency
United States