Project: AI Threat Detection
Our group was provided with a large internet traffic dataset that was already labeled, but still not ready for supervised learning. I made sure to remove columns that were definitely unnecessary. I was hesitant to remove certain columns, but after conducting certain tests I determined that they were not meaningful. After cleaning the data, I tried the random forest algorithm and got a certain level of accuracy. Other algorithms were tried as well, and the parameters were adjusted.