| |
Research
My main research area is Digital Media, specifically focused on organizing large collections of
images with associated text through the use of techniques from Natural Language Processing and
Computer Vision. Today billions of images with associated text are available in web pages,
captioned photographs from news sources, video with speech or closed captioning, and others. In
order to organize, search and exploit these enormous collections we have developed methods that
combine information from both the visual and textual sources effectively. Past and current projects include:
automatically identifying people in news photographs, classifying images from the web,
selecting aesthetically pleasing or interesting images, generating natural language descriptions
for images, and recognizing the clothing items people are wearing.
I am also generally interested in bringing
together people and expertise from various areas of Digital Media including digital art, music,
and cultural studies.
Bio
I received my B.S. in Mathematics and Computer Science from the University of Wisconsin, Madison in 2001. I then completed a PhD in Computer Science from the University of California, Berkeley in 2007 under the advisorship of Professor David Forsyth as a member of the Berkeley Computer Vision Group. Afterward, I spent 1 year as a research scientist at Yahoo! Research before joining the Computer Science department at Stony Brook University as an Assistant Professor and core member of the consortium for Digital Art, Culture, and Technology (cDACT). My research straddles the boundary between Computer Vision and Natural Language Processing with applications to large scale recognition, retrieval, and social network analysis.
Recent News
- Are you working on vision and language? Submit a paper to our NAACL-HLT workshop on Vision and Language (WVL)!
- New NSF IIS-Core Medium grant awarded.
- New NSF CI-P grant awarded.
- 2 papers accepted at CVPR 2012.
- 1 paper accepted at ACL 2012.
- 1 paper accepted at NAACL 2012.
- 1 paper accepted at EACL 2012.
- JHU-CLSP Summer 2011 Workshop webpage with papers and data now available.
Teaching
Spring 2013 - CSE/ISE 364 Advanced Multimedia
Spring 2013 - CSE 590 Computational Photography
Fall 2012 - CSE 595 Words & Pictures
Spring 2012 - CSE/ISE 364 Advanced Multimedia
Spring 2012 - CSE 591 Recognizing People, Objects, and Actions
Fall 2011 - CSE 590 Computational Photography
Spring 2011 - CSE 595 Words & Pictures
Spring 2011 - CSE/ISE 364 Advanced Multimedia
Spring 2010 - CSE/ISE 364 Advanced Multimedia
Fall 2009 - CSE 591 Recognizing People, Objects, and Actions
Spring 2009 - CSE/ISE 364 Advanced Multimedia
Fall 2008 - CSE 690 Internet Vision
Students
Vicente Ordonez (PhD)
Kota Yamaguchi (PhD)
Hadi Kiapour (PhD)
Sirion Vittayakorn (PhD)
Hanyu Liu (MS)
Sebo Kim (Undergrad)
Former Students
Chaitanya Kommini (MS Indepdendent Study) - 2012
Deepak Venkatachalam (MS Independent Study) - 2011
Farheen Noorie (MS Independent Study) - 2011
Girish Kulkarni (MS) - 2011 Epic Systems
Debaleena Chattopadhy (MS) - 2011 Indiana School of Informatics PhD
Sagnik Dhar (MS) 2010 - Honda Research
Visruth Premraj (MS) 2010 Epic Systems
Erin Palmer (MS) 2009 - Factset
Jose Villa (MS) 2010
Piyush Kumat, (MS Indendent Study) Fall 2009
Current Funding
NSF Faculty Early Career Development (CAREER) Program: Award #1054133 - Toward a General Framework for Words & Pictures. Project Page
IIS Core: Award #1161876 - RI: Medium: Integrating Humans and Computers for Image and Video Understanding
CI-P:Collaborative Research Award #1205354 - Visual Entailment data set and challenge for the language and vision communities
Seeing Social: Exploiting Computer Vision in Online Communities. Google Faculty Research Award
SBU/BNL Seed Grant: "The Data Sensorium: Multi-Modal Explorations of Scientific Data". Personel - Dan Weymouth, Kevin Yager, Tamara Berg, Margaret Schedel, Klaus Mueller, Dimitris Samaras, Tony Phillips, Rita Goldstein, Nelly Alia-Klein, Zabet Patterson.
NSF MRI-R2 grant: "Development of an Immersive Giga-pixel Display". Contributor as Senior Personel.
Past Funding
Stony Brook FAHSS grant: "Encountering Data". Daniel Weymouth, Tamara Berg, Zabet Patterson, Margaret Schedel, John Lutterbie.
Stony Brook FAHSS grant: "Hybrid Geographies". Zabet Patterson, Christa Erickson, Margaret Schedel, Tamara Berg, Raiford Guins, Andrew Uroskie.
| |
Publications
Studying Relationships Between Human Gaze, Description, and Computer Vision
Kiwon Yun,
Yifan Peng,
Greg Zelinsky,
Dimitris Samaras,
Tamara L Berg
Computer Vision and Pattern Recognition, (CVPR) 2013.
BabyTalk: Understanding and Generating Simple Image Descriptions
Girish Kulkarni,
Visruth Premraj,
Vicente Ordonez,
Sagnik Dhar,
Siming Li,
Yejin Choi,
Alexander C. Berg,
Tamara L Berg
Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2013.
Parsing Clothing in Fashion Photographs
[pdf]
Kota Yamaguchi,
Hadi Kiapour,
Luis E. Ortiz,
Tamara L. Berg
Computer Vision and Pattern Recognition, (CVPR) 2012.
Data/Code
Understanding and Predicting Importance in Images
[pdf]
Alexander C. Berg,
Tamara L. Berg,
Hal Daumé III,
Jesse Dodge,
Amit Goyal,
Xufeng Han,
Alyssa Mensch,
Margaret Mitchell,
Aneesh Sood,
Karl Stratos,
Kota Yamaguchi
Computer Vision and Pattern Recognition, (CVPR) 2012.
Data/Annotations
Collective Generation of Natural Image Descriptions
[pdf]
Polina Kuznetsova,
Vicente Ordonez,
Alexander C. Berg,
Tamara L. Berg,
Yejin Choi
Association for Computational Linguistics (ACL) 2012.
Pre-Processed Data
Detecting Visual Text
[pdf]
Jesse Dodge,
Amit Goyal,
Xufeng Han,
Alyssa Mensch,
Margaret Mitchell,
Karl Stratos,
Kota Yamaguchi,
Yejin Choi,
Hal Daumé III,
Alexander C. Berg,
Tamara L. Berg,
North American Chapter of the Association for Computational Linguistics (NAACL) 2012.
Data/Code
Midge: Generating Image Descriptions From Computer Vision Detections
[pdf]
Margaret Mitchell,
Jesse Dodge,
Amit Goyal,
Kota Yamaguchi,
Karl Sratos,
Xufeng Han,
Alysssa Mensch,
Alexander C. Berg,
Tamara L. Berg,
Hal Daumé III
European Chapter of the Association for computational Linguistics, (EACL) 2012.
Interactive Music: Human Motion Initiated Music Generation
Using Skeletal Tracking By Kinect
[pdf]
Tamara L. Berg,
Debaleena Chattopadhyay,
Margaret Schedel,
Timothy Vallier
SEAMUS, 2012.
Two-person Interaction Detection Using Body-Pose Features and Multiple Instance Learning
[pdf]
Kiwon Yun,
Jean Honorio,
Debaleena Chattopadhyay,
Tamara L. Berg,
Dimitris Samaras
The 2nd International Workshop on Human Activity Understanding from 3D Data at Conference on Computer Vision and Pattern Recognition, (CVPR) 2012.
- Im2Text: Describing Images Using 1 Million Captioned Photographs
[pdf]
Vicente Ordonez,,
Girish Kulkarni,,
Tamara L. Berg
Neural Information Processing Systems (NIPS), 2011.
Dataset: SBU Captioned Photo Dataset
- Composing Simple Image Descriptions using Web-scale N-grams.
[pdf]
Siming Li,
Girish Kulkarni,
Tamara L. Berg,
Alexander C. Berg,
Yejin Choi
Computational Natural Language Learning (CoNLL), 2011.
- Iconizer: A Framework to Identify and Create Effective
Representations for Visual Information Encoding
[pdf]
Supriya Garg,
Tamara L. Berg,
Klaus Mueller
The 11th International Symposium on Smart Graphics (SG), 2011
- Baby Talk: Understanding and Generating Simple Image Descriptions
[pdf]
Girish Kulkarni,
Visruth Premraj,
Sagnik Dhar,
Siming Li,
Yejin Choi,
Alexander C. Berg,
Tamara L. Berg
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2011 (oral)
- High Level Describable Attributes for Predicting Aesthetics and Interestingness
[pdf]
Sagnik Dhar,
Vicente Ordonez,
Tamara L. Berg,
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2011
- Who are you with and where are you going?
[pdf]
Kota Yamaguchi,
Alexander C. Berg,
Luis Ortiz
Tamara L. Berg,
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2011
- Can Computers Master the Art of Communication? An Excursion with a Focus on Visual Analytics
Klaus Mueller,
Supriya Garg,
Julia Nam,
Tamara L. Berg,
Kevin McDonnell.
IEEE Computer Graphics and Applications, May/June 2011.
- Automatic Attribute Discovery and Characterization from Noisy Web Data
[pdf]
Tamara L. Berg,
Alexander C. Berg,
Jonathan Shih
The European Conference on Computer Vision (ECCV) 2010.
Dataset: Attribute Discovery Dataset
- iWalk, A Tool for Interacting with Geo-Located Data Through Movement and Gesture
[pdf]
Visruth Premraj,
Margaret Schedel,
Tamara L. Berg,
ACM Multimedia, Human Centered Multimedia Track (ACM MM) 2010.
- It's All About the Data
Tamara L. Berg, Alexander Sorokin, Gang Wang, David A. Forsyth, Derek Hoiem, Ali Farhadi, Ian Endres.
Proceedings of the IEEE, Special Issue on Internet Vision, August 2010, 98-8, 1434-1453.
- Finding Iconic Images
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
The 2nd Internet Vision Workshop at Conference on Computer Vision and Pattern Recognition (CVPR) 2009.
- Words and Pictures: Categories, Modifiers, Depiction and Iconography
D.A. Forsyth, T.L. Berg, C. Alm, A. Farhadi, J. Hockenmaier, N. Loeff, G. Wang.
Object Categorization: Computer and Human Vision Perspectives. Cambridge University Press, 2009, in press. Sven Dickinson, Michael Tarr, Ales Leonardis, Bernt Schiele (eds)
- Names and Faces
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
Jaety Edwards,
Michael Maire,
Ryan White,
Yee Whye Teh,
Erik Learned-Miller,
David A. Forsyth
In Submission
- Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments [pdf]
Gary B. Huang, Marwan Mattar, Tamara Berg, and Erik Learned-Miller.
The Workshop on Faces in Real-Life Images at European Conference on Computer Vision (ECCV) 2008.
- Labeled Faces in the Wild: A Database for Studying Face Recognition in Unconstrained Environments [pdf]
Gary B. Huang, Manu Ramesh, Tamara Berg, and Erik Learned-Miller.
University of Massachusetts, Amherst, Technical Report 07-49, October, 2007
- Exploiting Words and Pictures
[pdf]
Tamara L. Berg
U.C. Berkeley Ph.D. Thesis, May. 2007
- Dataset Issues in Object Recognition
[pdf]
[ps]
J. Ponce, T. L. Berg, M. Everingham, D.A. Forsyth, M. Hebert, S. Lazebnik, M. Marszalek, C. Schmid, B.C. Russell, A. Torralba, C.K.I. Williams, J. Zhang and A. Zisserman,
Toward Category-Level Object Recognition,
Springer-Verlag Lecture Notes in Computer Science. J. Ponce, M. Hebert, C. Schmid and A. Zisserman (eds.), Feb 2007.
- Automatic Ranking of Iconic Images
[pdf]
[ps]
Tamara L. Berg,
David A. Forsyth
U.C. Berkeley Technical Report, Jan. 2007
- Names and Faces
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
Jaety Edwards,
Michael Maire,
Ryan White,
Yee Whye Teh,
Erik Learned-Miller,
David A. Forsyth
U.C. Berkeley Technical Report, Jan. 2007
- Animals on the Web
[pdf]
[ps]
Tamara L. Berg,
David A. Forsyth
Computer Vision and Pattern Recognition (CVPR) 2006
Demo: Animals on the Web
Dataset: Animals on the Web Dataset
-
Shape Matching and Object Recognition using Low Distortion Correspondence
[pdf]
[ps]
[ppt]
Alexander C. Berg,
Tamara L. Berg,
Jitendra Malik
Computer Vision and Pattern Recognition (CVPR), 2005
- Shape Matching and Object Recognition using Low Distortion Correspondence
[pdf]
[ps]
Alexander C. Berg,
Tamara L. Berg,
Jitendra Malik
U.C. Berkeley Technical Report, Dec. 2004
- Who's in the Picture?
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
Jaety Edwards,
David A. Forsyth
Neural Information Processing Systems (NIPS), 2004
Demo: Face Dictionary
Dataset: Faces In the Wild
Dataset: Labeled Faces In the Wild
- Names and Faces in the News
[pdf]
[ps]
Tamara L. Berg,
Alexander C. Berg,
Jaety Edwards,
Michael Maire,
Ryan White,
Yee Whye Teh,
Erik Learned-Miller,
David A. Forsyth
Computer Vision and Pattern Recognition (CVPR), 2004
|
Alex Berg, my husband.
Arnold Miller, my dad.
|