Variational encoders (VAEs) are generative models, in contrast to typical standard neural networks used for regression or classification tasks. VAEs have diverse applications from generating fake human faces and handwritten digits to producing purely "artificial" music. This post will explore what a VAE is, the intuition behind it and also the tough looking (but quite … Continue reading Variational Autoencoder Explained
The final event of Thomso 2018, Wargasm, a party filled with explosively loud music, high college students and electricity in the air, signed off the annual cultural fest of IIT Roorkee with a bass drop! So, this is my third Thomso after coming to IIT Roorkee and for various silly reasons my first time attending … Continue reading My First EDM Night!
Image segmentation is the task in which we assign a label to pixels (all or some in the image) instead of just one label for the whole image. As a result, image segmentation is also categorized as a dense prediction task. Unlike detection using rectangular bounding boxes, segmentation provides pixel accurate locations of objects in … Continue reading A Look at Image Segmentation using CNNs
Motivation One of the biggest challenges during designing new neural network architectures is time. During real-world workflows, one often trains many different neural networks during the experimentation and design process. This is a wasteful process in which each new model is trained from scratch. In a typical workflow, one trains multiple models, with each model … Continue reading Paper Explanation: Net2Net – Accelerating Learning via Knowledge Transfer
Motivation What if you want to do some real-time face detection/recognition using a deep learning system running on a pair of glasses? What if you want your alarm clock to be able to record and analyze your sleep and conditions around you and come up with the most optimal way of waking you up each … Continue reading Paper Explanation: Binarized Neural Networks: Training Neural Networks with Weights and Activations Constrained to +1 or −1
This movie changed my views on musicals. I always though musicals never had much of a story and their only selling point was the dancing and the music. When I watch a movie, I watch it more for the plot, the story and not music or dancing, though there presence in small amounts does give … Continue reading The Greatest Showman – A Magnificent Musical
This paper is the outcome when Microsoft finally released the beast! The ResNet "slayed" everything, and won not one, not two, but five competitions; ILSVRC 2015 Image Classification, Detection and Localization, and COCO 2015 detection and segmentation. Problems the Paper Addressed The paper analysed what was causing the accuracy of deeper networks to drop as … Continue reading Paper Explanation – Deep Residual Learning for Image Recognition (ResNet)