Visual Question Answering Problems: Reasoning With Deep Learning

Ilija Ilievski is a PhD student at the National University of Singapore, studying interdisciplinary research in the intersection of vision and language. He believes question answering over multimodal data is the next frontier of deep learning, focusing his research on 'Visual Question Answering'.

As a side project, he created, a place to share his experience developing deep learning methods for real-world problems, with the hope of clearing up the "dark magic" surrounding the development and application of deep learning models for novel problems. 

At the Deep Learning Summit in Singapore (20-21 October), Ilija will introduce the Visual Question Answering (VQA) problem, its application and significance, as well as presenting a deep learning model able to associate question words to specific image objects. I spoke with him ahead of the summit and asked a few questions to learn more of his thoughts on deep learning.

What has driven you to work in the area of deep learning?
Deep learning methods have attracted a lot of attention recently by achieving state of the art results in problems like computer vision and speech recognition. But, deep learning is not just another better-performing machine learning method. What's fascinating to me is that deep learning methods are able to outperform other methods but without using human engineered features, and this makes them the best candidate for achieving artificial general intelligence.

Which industries do you think will be disrupted by deep learning in the future, and how?
I expect all industries to be disrupted to some degree. I think deep learning, and machine learning in general, will change the way we work the same way computers did in the past. Every business generates data that deep learning methods can use to help business owners make better decisions, increase their efficiency, develop new products and so on.

What do you feel are the most valuable applications of deep learning?
One of the most valuable applications of deep learning is in the sciences. I expect as deep learning models have increasingly more reasoning power, they will help scientists by for example pruning unpromising experiments or even proposing possible solutions to existing problems. This will advance the field much faster, which in turn will bring even more advanced machine learning models.

What advice would you give someone who would like to work in this field?
Don't be dissuaded by the steep learning curve. Deep learning may seem as daunting but in fact, the theory behind it is rather simple. Further, there are many excellent resources available online, from books and courses to web portals and open-source libraries.

What developments can we expect to see in deep learning in the next 5 years?
I hope to see the development of deep learning methods applied to natural language processing problems that will transform the field. There is also an increasing interest in developing deep learning models for unsupervised learning and reinforcement learning, so we can expect significant advances in these fields as well.

Ilija Ilievski will be speaking at the Deep Learning Summit in Singapore on 20-21 October. Other speakers include Brian Cheung, Google Brain; Modar Alaoui, Eyeris; Pradeep Kumar, Lenovo; and Vassilios Vonikakis, Advanced Digital Sciences Center.

The summit will showcase the opportunities of advancing trends in deep learning and the impact on business and society. Explore the latest advances in deep learning technologies like pattern recognition, NLP, neural networks and reinforcement learning, and learn how they will impact will impact communications, manufacturing, healthcare and transportation. 

Tickets are limited for this event, so book early to avoid disappointment! For more information and to register, please visit the website here.
Deep Learning Deep Learning Summit Natural Language Understanding Text Analysis Natural Language Processing Computer Vision Deep Learning Algorithms



Recommended Posts

Latest Posts

Upcoming Events

Deep Learning in Healthcare Summit Boston

25 May 2017, Boston

The Deep Learning in Healthcare Summit will explore recent breakthroughs in technical advancements and healthcare applications, from algorithms that learn to recognise complex patterns within rich medical data, to analysing real world evidence for personalised medicine, to discovering the sequence specificities of DNA- binding proteins and how it can aid genome diagnostics.

Deep Learning Summit Boston

25 May 2017, Boston

The Deep Learning Summit is the next revolution in artificial intelligence. Explore the impact of image & speech recognition as a disruptive trend in business and industry. How can multiple levels of representation and abstraction help to make sense of data such as images, sound, and text. Hear the latest insights and technology advancements from industry leaders, startups and researchers.

Deep Learning in Finance Summit London

01 June 2017, London

The Deep Learning in Finance Summit is a multidisciplinary event bringing together data scientists, engineers, CTOs, CEOs & leading financial corporations to explore the impact of deep learning in the financial sector. Applications include identifying and preventing risks, revolutionising financial forecasting & compliance. Explore the latest technology trends & innovations with influential research scientists, startups & business leaders across the industry.


Be Sociable

  • Twitter
  • Facebook
  • Linkedin
  • Youtube
  • Flickr
  • Lanyrd
  • Instagram
  • Google plus
  • Medium