Artificial Intelligence Intern

Over the summer between my Junior and Senior years at ASU, I worked at Systems Imagination, a small tech startup focused on artificial intelligence and data analytics with genomics and bioinformatics. In the first week I was promoted to team lead where I would be managing 4 other members on our project.
The Project
During the first half, the company tasked me with finding new use cases for their propriety hypergraph database so that they could sell it to new customers. We successfully implemented a new hypergraph using the Open Flights Dataset from OpenFlights.org, but the underlying data was unfortunately not complex enough to benefit from the hypergraph.
During the second half, I shifted the team towards a project with more AI, and so we decided to compete in the Predicting Molecular Properties kaggle competition. The poster below summarizes our experience, but it was really great to see all the colaboration and learning taking place within the discussion and notebook sections on kaggle. People would publicly post their solutions with code, and others would comment and learn from it.
The NVIDIA DGX Workstation
Systems imagination takes big data seriously. In the office they have a $40,000 computer from NVIDIA with 256GB of RAM, 4 GPUs, and a 40 core CPU. I used this beast to train AI algorithms, and boy was it effective. Since our team took a data-driven approach, we created over 1,000 features which adds up to multiple gigabytes of data. This is usually impossible to work with on average desktop computers with 16GB of ram and lower GPU and CPU benchmarks because they can't handle the amount of data or the computation would take to long. But with a DGX, this is not a problem. I threw all the engineered features at the DGX and trained a boosted random forest. I'm proud to say that I was using 100% of the 40 core CPU. After the training, this model scored the best yet, placing us higher on the kaggle leaderboard. I think I can say I know what big data is now.
What I Learned
- Leadership - I led a team of 4 composed of a graduated biomedical engineer from ASU, a mathematics major from Carnegie Melon, a high school student from BASIS Scottsdale, and a fellow Computer Scientist from a local technological institute. The challenging part was distributing tasks that each person could handle given their background. I leared that it's important to provide reasons to people you are giving tasks to, so that they understand the goal of what they are doing. Otherwise they will be lost and may not ask clarifying questions.
- Scrum - At the beginning of the project, the biomedical engineer got a scrum certification, so he practiced being the role of scrum master for our team. He also served on the development team, and I led the team as the product owner working on the development team. It was great to get more practice with scrum, and I while I enjoy leading, I also enjoy working so the product owner role is great for me.
- Startup Culture - The office was small and very relaxed. The CEO actively promoted the "free innovation" mindset and encouraged employees to come up with business ideas and act on them. The whole summer we made references to the show Silicon Valley on HBO, which is a comedy about a start-up software company.
- GPU Acceleration - I'm really happy I learned how to use Tensorflow and MXNet on multiple NVIDIA GPU's. The performance benefits are insane. I will definitely use this knowledge throughout the rest of my AI career.
- Boosted Decision Trees - I learned the math behind the algorithm and how to tune the many hyperparameters. Things like number_of_estimators, max_depth, number_of_features, and iterations.
Conclusion
I really loved working with Systems Imagination because I got to do hands on AI projects and work in an awesome office🙂. I practiced using boosted decision trees and fully connected neural networks. Using a super computer at 100% CPU usage while training a boosted random forest was the highlight of my internship. Thanks to Kendyl Douglass for giving me the opportunity to intern with them, and to David Schneider for his mentorship.