This page has not been updated in a while. Please see RAIVN Lab's publication page for recent papers and preprints.
Publications
- 2020
- What’s Hidden in a Randomly Weighted Neural Network?
Vivek Ramanujan*, Mitchell Wortsman*, Aniruddha Kembhavi, Ali Farhadi, and Mohammad Rastegari
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- RoboTHOR: An Open Simulation-to-Real Embodied AI Platform
Matt Deitke, Winson Han, Alvaro Herrasti, Aniruddha Kembhavi, Eric Kolve, Roozbeh Mottaghi, Jordi Salvador, Dustin Schwenk,Eli VanderBilt, Mathew Walingford, Luca Weihs,Mark Yatskar, Ali Farhadi
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- Visual Reaction: Learning to Play Catch with Your Drone
Kuo-Hao Zeng, Roozbeh Mottaghi, Luca Weihs, and Ali Farhadi
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- Butterfly Transform: An Efficient FFT Based Neural Architecture Design
Keivan Alizadeh vahid, Anish Prabhu, Ali Farhadi, and Mohammad Rastegari
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- Use the Force, Luke! Learning to Predict Physical Forces by Simulating Effects
Kiana Ehsani, Shubham Tulsiani, Saurabh Gupta, Ali Farhadi, and Abhinav Gupta
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
- 2019
- Discovering Neural Wirings
Mitchel Wortsman, Ali Farhadi, Mohammad Rastegari
in Proceedings of the Conference on Neural Information Processing Systems (NeurIPS), 2019.
- Defending Against Neural Fake news
Rowan Zellers, Ari Holtzman, Hannah Rashkin, Yonatan Bisk, Ali Farhadi, Franziska Roesner, Yejin Choi
in Proceedings of the Conference on Neural Information Processing Systems (NeurIPS), 2019.
- Conditional Driving from Natural Language Instructions
Junha Roh, Chris Paxton, Andrezej Pronobis, Ali Farhadi, Dieter Fox
in Proceedings of the Conference on Robot Learning (CoRL), 2019.
- Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index
Minjoon Seo, J Lee, Tom Kwiatkowski, AP Parikh, Ali Farhadi, Hannaneh Hajishirzi
in Proceedings of the Association of Computational Linguistics (ACL), 2019.
- HellaSwag: Can a Machine Really Finish Your Sentence?
Rowan Zellers, A Holtzman, Yonatan Bisk, Ali Farhadi, Yejin Choi
in Proceedings of the Association of Computational Linguistics (ACL), 2019.
- Learning to Learn How to Learn:Self-Adaptive Visual Navigation Using Meta-Learning
Mitchell Wortsman, Kiana Ehsani, Mohammad Rastegari, Ali Farhadi, Roozbeh Mottaghi
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
- Two Body Problem: Collaborative Visual Task Completion
Unnat Jain, Luca Weihs, Eric Kolve, Mohammad Rastegari, Svetlana Lazebnik, Ali Farhadi, Alexander Schwing, Aniruddha Kembhavi
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
- From Recognition to Cognition: Visual Commonsense Reasoning
Rowan Zellers, Yonatan Bisk, Ali Farhadi, Yejin Choi
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
- ELASTIC: Improving CNNs with Instance Specific Scaling Policies
Huiyu Wang, Aniruddha Kembhavi, Ali Farhadi, Alan Yuille, Mohammad Rastegari
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
- Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Yao-Hung Hubert Tsai, Santosh Divvala, Louis-Philippe Morency, Ruslan Salakhutdinov, Ali Farhadi
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
- OK-VQA: A Visual Question Answering Benchmark Requiring External Knowledge
Kenneth Marino, Mohammad Rastegari, Ali Farhadi, Roozbeh Mottaghi
in Proceedings of the Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
- Visual Semantic Navigation using Scene Priors
Wei Yang, Xiaolong Wang, Ali Farhadi, Abhinav Gupta, Roozbeh Mottaghi
in Proceedings of the International Conference on Learning Representations (ICLR), 2019.
- 2018
- Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension
Minjoon Seo, Tom Kwiatkowski, Ankur P. Parikh, Ali Farhadi, and Hannaneh Hajishirzi
in Proceedings of the Epirical Methods in Natural Langauge Processing (EMNLP), 2018.
- PhotoShape: Photorealistic Materials for Large-Scale Shape Collections
Keunhong Park, Konstantinos Rematas, Ali Farhadi, Steve Seitz
ACM SIGGRAPH Asia, 2018.
- Imagine This! Scripts to Compositions to Videos
Tanmay Gupta, Dustin Schwenk, Ali Farhadi, Derek Hoiem, and Aniruddha Kembhavi
in Proceedings of the European Confernce on Computer Vision (ECCV), 2018.
- Transferring Common-Sense Knowledge for Object Detection
Krishna Kumar Singh, Santosh Kumar Divvala, Ali Farhadi, and Yong Jae Lee
in Proceedings of the European Confernce on Computer Vision (ECCV), 2018.
- Who Let The Dogs Out? Modeling Dog Behavior From Visual Data
K Ehsani, H Bagherinezhad, J Redmon, R Mottaghi, A Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2018.
- Segan: Segmenting and generating the invisible
K Ehsani, R Mottaghi, A Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2018.
- Actor and Observer: Joint Modeling of First and Third-Person Videos
Gunnar Sigurdsson, Abhinav Gupta, Cordelia Schmid, Ali Farhadi, Karteek Alahari
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2018.
- IQA: Visual Question Answering in Interactive Environments
D Gordon, A Kembhavi, M Rastegari, J Redmon, D Fox, A Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2018.
[NVIDIA Pioneer Award]
- Structured Set Matching Networks for One-Shot Part Labeling
J Choi, J Krishnamurthy, A Kembhavi, A Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2018.
- AI2-THOR: An Interactive 3D Environment for Visual AI
E Kolve, R Mottaghi, D Gordon, Y Zhu, A Gupta, A Farhadi
- Neural Speed Reading via Skim-RNN
M Seo, S Min, A Farhadi, H Hajishirzi
in Proceedings of the International Conferene on Representation Learning (ICLR), 2018.
- Re3 : Real-Time Recurrent Regression Networks for Visual Tracking of Generic Objects
Daniel Gordon, Ali Farhadi, Dieter Fox
in Robotics and Automation Letters, and ICRA 2018.
- AJILE Movement Prediction: Multimodal Deep Learning for Natural Human Neural Recordings and Video
XRN Wang, A Farhadi, R Rao, B Brunton
in Proceedings of the Conference in Artificial Intelligence (AAAI), 2018.
- 2017
- Visual Semantic Planning using Deep Successor Representations
Yuke Zhu*, Daniel Gordon*, Eric Kolve, Dieter Fox, Li Fei-Fei, Abhinav Gupta, Roozbeh Mottaghi, Ali Farhadi (* equal contribution)
in Proceedings of International Conference on Computer Vision (ICCV), 2017.
- See the Glass Half Full: Reasoning about Liquid Containers, their Volume and Content
Roozbeh Mottaghi, Connor Schenck, Dieter Fox, Ali Farhadi
in Proceedings of International Conference on Computer Vision (ICCV), 2017.
- YOLO9000: Better, Faster, Stronger
Joseph Redmon, Ali Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2017.
[Best Paper Honorable Mention Award]
- LCNN: Lookup-based Convolutional Neural Network
Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2017.
- Commonly Uncommon: Semantic Sparsity in Situation Recognition
Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2017.
- Are You Smarter Than A Sixth Grader?
Textbook Question Answering for Multimodal Machine Comprehension
Aniruddha Kembhavi, Minjoon Seo, Eric Klove, Dustin Schwenk, Hannaneh Hajishirzi, Ali Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2017.
- Asynchronous Temporal Fields for Action Recognition
Gunnar A Sigurdsson, Santosh Divvala, Ali Farhadi, Abhinav Gupta
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2017.
- Bidirectional Attention Flow for Machine Comprehension
Minjoon Seo, Aniruddha Kembhavi, Ali Farhadi, Hannaneh Hajishirzi
in Proceedings of the International Conferene on Representation Learning (ICLR), 2017.
- Query-Reduction Networks for Question Answering
Minjoon Seo, Sewon Min, Ali Farhadi, Hannaneh Hajishirzi
in Proceedings of the International Conferene on Representation Learning (ICLR), 2017.
- Target-driven visual navigation in indoor scenes using deep reinforcement learning
Yuke Zhu, Roozbeh Mottaghi, Eric Kolve, Joseph Lim, Abhinav Gupta, Fei-Fei Li, Ali Farhadi
in Proceedings of the International Conferene on Robotics and Automation (ICRA), 2017.
- Summarizing unconstrained videos using salient montages
M Sun, A Farhadi, B Taskar, S Seitz
in IEEE transactions on pattern analysis and machine intelligence (TPAMI), 2017.
- Semantic Highlight Retrieval and Term Prediction
M Sun, KH Zeng, YC Lin, A Farhadi
in IEEE transactions on Image Processing, 2017.
- 2016
- XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
Mohammad Rastegari, Vicente Ordonez, Joe Redmon, Ali Farhadi
in Proceedings of the European Conference on Computer Vision (ECCV), 2016.
- "What happens if..." Learning to predict the effect of forces in images
Roozbeh Mottaghi, Mohammad Rastegari, Abhinav Gupta, Ali Farhadi
in Proceedings of the European Conference on Computer Vision (ECCV), 2016.
- A Diagram Is Worth A Dozen Images
Ani Kembhavi, Mike Salvato, Eric Kolve, Minjoon Seo, Hannaneh Hajishirzi, Ali Farhadi
in Proceedings of the European Conference on Computer Vision (ECCV), 2016.
- Deep3D: Fully Automatic 2D-to-3D Video Conversion with Deep Convolutional Neural Networks
Eric Xie, Ross Girshick, Ali Farhadi
in Proceedings of the European Conference on Computer Vision (ECCV), 2016.
- Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
Gunnar Sigurdsson, Gul Varol, Xiaolong Wang, Ali Farhadi, Ivan Laptev, Abhinav Gupta
in Proceedings of the European Conference on Computer Vision (ECCV), 2016. [Project page, Charades dataset]
- FigureSeer:Parsing Result-Figures in Research Papers
Noah Siegel, Santosh Divvala, Ali Farhadi
in Proceedings of the European Conference on Computer Vision (ECCV), 2016.
- Situation Recognition: Visual Semantic Role Labeling for Image Understanding
Mark Yatskar, Luke Zettlemoyer, Ali Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2016. [demo, dataset, code]
- Newtonian Image Understanding: Unfolding the Dynamics of Objects in Static Images
Roozbeh Mottaghi, Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2016. [Project page, dataset, code]
- You Only Look Once: Unified, Real-Time Object Detection
Joseph Redmon, Santosh Divvala, Ross Girshick, Ali Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2016. [Project page, code]
[OpenCV People's Choice Award]
- Actions~Transformation
Xiaolong Wang, Ali Farhadi, and Abhinav Gupta
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2016.
- A Task-Oriented Approach for Cost-sensitive Recognition
Roozbeh Mottaghi, Hannaneh Hajishirzi, and Ali Farhadi
in Proceedings of the Conference of Computer Vision and Pattern Recognition (CVPR), 2016.
- Unsupervised Deep Embedding for Clustering Analysis
Junyuan Xie, Ross Girshick, and Ali Farhadi
in International Conference on Machine Learning (ICML), 2016. [Project page, data, code]
- Stating the Obvious: Extracting Visual Common Sense Knowledge
Mark Yatskar, Vicente Ordonez, Ali Farhadi
in Proceedings of the North American Chapter of the Association for Computational Linguistics (NAACL), 2016.
- Toward a Taxonomy and Computational Models of Abnormalities in Images
Babak Saleh, Ahmed Elgammal, Jacob Feldman, Ali Farhadi
in Proceedings of the Conference in Artificial Intelligence (AAAI), 2016.
[Best Student Paper award]
- Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects
Hessam Bagherinezhad, Hannaneh Hajishirzi, Yejin Choi, and Ali Farhadi
in Proceedings of the Conference in Artificial Intelligence (AAAI), 2016.
- 2015
- Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing
Hamid Izadinia, Fereshteh Sadeghi, Santosh K Divvala, Hannaneh Hajishirzi, Yejin Choi, and Ali Farhadi
in Proceedings of International Conference on Computer Vision
(ICCV'15), 2015.
- Generating Notifications for Missing Actions: Don’t forget to turn the lights off!
Bilge Soran, Ali Farhadi, and Linda Shapiro
in Proceedings of International Conference on Computer Vision
(ICCV'15), 2015.
- VISALOGY: Answering Visual Analogy Questions
Fereshteh Sadeghi, Larry Zittnick, and Ali Farhadi
in Proceedings of Neural Information Processing Systems
(NIPS'15), 2015.
- Solving Geometry Problems: Combining Text and Diagram Interpretation
Minjoon Seo, Hannaneh Hajishirzi, Ali Farhadi, Oren Etzioni, and Clint Malcolm
in Proceedings of Empirical Methods in Natural Language Processing
(EMNLP'15), 2015.
- VisKE: Visual Knowledge Extraction and Question Answering by Visual Verification of Relation Phrases
Fereshteh Sadeghi, Santosh K Divvala, Ali Farhadi
in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'15), 2015.[Project Page]
- Discriminative and Consistent Similarities in Instance-Level Multiple Instance Learning
Mohammad Rastegari, Hannaneh Hajishirzi, Ali Farhadi
in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'15), 2015.
- Deep Classifiers from Image Tags in the Wild
Hamid Izadinia, Bryan C Russell, Ali Farhadi, Matthew D Hoffman, Aaron Hertzmann
Workshop on Community-Organized Multimodal Mining: Opportunities for Novel Solutions, 2015.
- Learning to Select and Order Vacation Photographs
Fereshteh Sadeghi, J Rafael Tena, Ali Farhadi, Leonid Sigal
(WACV'15), 2015.
- 2014
- Learning Everything about Anything: Webly-Supervised Visual Concept Learning
Santosh K Divvala, Ali Farhadi, Carlos Guestrin
in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'14), 2014.[Project Page]
- Incorporating Scene Context and Object Layout into Appearance Modeling
Hamid Izadinia, Fereshteh Sadeghi, Ali Farhadi
in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'14), 2014.
- Failure Prediction in Vision Systems
Peng Zhang, Jiuling Wang, Ali Farhadi, Martial Hebert, Devi Parikh
in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'14), 2014.
- Towards Transparent Systems: Semantic Characterization of Failure Modes
Aayush Bansal, Ali Farhadi, Devi Parikh
in Proceedings of European Conference on Computer Vision
(ECCV'14), 2014.
- Salient montages from unconstrained videos
Min Sun, Ali Farhadi, Ben Taskar, Steve Seitz
in Proceedings of European Conference on Computer Vision
(ECCV'14), 2014.
- Ranking Domain-Specific Highlights by Analyzing Edited Videos
Min Sun, Ali Farhadi, Steve Seitz
in Proceedings of European Conference on Computer Vision
(ECCV'14), 2014.
- Diagram Understanding in Geometry Questions
Min Joon Seo, Hannaneh Hajishirzi, Ali Farhadi, Oren Etzioni
in Proceedings of the Conference in Artificial Intelligence
(AAAI'14), 2014.[Project Page and Demo]
- Multi Resolution Language Grounding with Weak Supervision
Rik Koncel Kedziorski, Hannaneh Hajishirzi, and Ali Farhadi
in Proceedings of the Conference on Empirical Methods in Natural Language Processing
(EMNLP'14), 2014.
- Action Recognition in the Presence of One Egocentric and Multiple Static Cameras
Bilge Soran, Ali Farhadi, Linda Shapiro
in Proceedings of the Asian Conference on Computer Vision
(ACCV'14), 2014.
- 2013
- Multi-Attribute Queries: To Merge or Not to Merge?
Mohammad Rastegari, Ali Diba, Devi Parikh, Ali Farhadi
in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'13), 2013.
-
Object-Centric Anomaly Detection by Attribute-Based Reasoning
Babak Saleh, Ali Farhadi, Ahmed Elgammal
in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'13), 2013.
- Adding Unlabeled Samples to Categories by Learned Attributes
Jonghyun Choi, Mohammad Rastegari, Ali Farhadi, Larry Davis
in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'13), 2013.
- 2012
-
Attribute Discovery via Predictable Discriminative Binary Codes
Mohammad Rastegari, Ali Farhadi, David Forsyth
in Proceedings of European Conference on Computer Vision
(ECCV'12), 2012.
-
Semantic Understanding of Proefessional Soccer Commentaries
Hannaneh Hajishirzi, Mohammad Rastegari, Ali Farhadi, and Jessica Hodgins
in Proceedings of 28th conference on Uncertainty in Artificial Intelligence
(UAI'12), 2012.
-
Building a Dictionary of Image Fragments
Zicheng Liao, Ali Farhadi, Yang Wang, Ian Endres, David Forsyth,
In proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'12), 2012.
- 2011
- 2010
-
Every Picture Tells a Story: Generating Sentences for Images
Ali Farhadi, Mohsen Hejrati, Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, David Forsyth
In proceedings of European conference on Computer Vision
(ECCV'10), 2010.
-
Attribute-Centric Recognition for Cross-Category Generalization
Ali Farhadi, Ian Endres, Derek Hoiem
In proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'10), 2010.
-
The Benefits and Challenges of Collecting Richer Object Annotations
Ian Endres, Ali Farhadi, Derek Hoiem, and David Forsyth
In Advancing Computer Vision with Humans in the Loop
ACVHL 2010 (in conjunction with CVPR), 2010.
-
It's All About the Data
Tamara L. Berg, Alexander Sorokin, Gang Wang, David A. Forsyth, Derek Hoiem, Ian Endres, Ali Farhadi
In Proceedings of the IEEE, Special Issue on Internet Vision.
- 2009
-
A Latent Model of Discriminative Aspect,
Ali Farhadi, Mostafa Kamali, Ian Endres, David Forsyth
In proceedings of International Conference on Computer Vision
(ICCV'09), 2009.
-
Unlabeled Data Improves Word Prediction
Nicolas Loeff, Ali Farhadi, Ian Endres, David Forsyth
In proceedings of International Conference on Computer Vision
(ICCV'09), 2009.
-
Describing Objects by their Attributes
Ali Farhadi, Ian Endres, Derek Hoiem, David Forsyth
In proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'09), 2009.
-
Book Chapter: Words and Pictures: Categories, Modifiers, Depiction and Iconography
David Forsyth, Tamara Berg, C. Alm, Ali
Farhadi, Julia Hockenmaier, Nicols Loef, Gang
Wang,
Object Categorization: Computer and Human Vision Perspectives
Cambridge University Press, 2009.
- 2008 and before
-
Learning to Recognize Activities from a Wrong Viewpoint
Ali Farhadi and Mostafa Kamali
In proceedings of European conference on Computer Vision
(ECCV'08), 2008.
-
Scene Discovery by Matrix Factorization
Nicolas Loeff and Ali Farhadi
In proceedings of European conference on Computer Vision
(ECCV'08), 2008.
-
Transfer Learning in Sign Language
Ali Farhadi, David Forsyth, and Ryan White
In proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'07), 2007.
-
Aligning ASL for Statistical Translation Using a Discriminative Word Model
Ali Farhadi and David Forsyth
In proceedings of IEEE Conference on Computer Vision and Pattern Recognition
(CVPR'06), 2006.
-
An Application of Linear Predictive Coding and Computational Geometry to Iris Recognition
Ali Farhadi, Masoud Alipour, and Nima Razavi
International Journal of Imaging Systems and Technology, 2006
-
How to Tell the Difference Between a Cat and a Dog?
Nima Razavi, Golnoosh Samei, Masoud Alipour, and Ali Farhadi
International Journal of Imaging Systems and Technology, 2006
-
Image Segmentation via Local Higher Order Statistics
Ali Farhadi and Mehrdad Shahshahani
International Journal of Imaging Systems and Technology, 2003
-
Higher Order Statistics in Computer Vision
Ali Farhadi and Mehrdad Shahshahani
Special Issue of Annals of New York Academy of Sciences, 2002
-
Classification and Detection in Computer Vision
Ali Farhadi and Mehrdad Shahshahani
Proceedings of multi-conferences on computer sciences
(METMBS 2001), 2001