 |
 |
 |
|
|
|
Visual Search + |
 |
|
New Approaches in Content-Based Image Retrieval |
 |
Researchers |
Thomas Vetter, Bernhard Egger |
 |
|
Project Partners |
Michael Renner & Theodore Davis, HGK Basel (lead); SodaTech AG; KEYSTONE AG |
 |
|
Funding |
SNF |
 |
|
Project Duration |
1.10.2011 - 30.9.2013 |
 |
With an exponential growth in both the generation and sharing of digital photographs, the topic
of Content-Based Image Retrieval (CBIR) has become increasingly important across all disciplines. This
is particularly true as the model of manually tagging content and context within every media file becomes
unsustainable. While automation in keyword tagging can be found in products such as facial recognition
software, by maintaining textual descriptions of visual media, both the stored labels and attempts to
match them are grossly limited by the semantic gap. Nevertheless, with the inevitable advancement of
image processing to transcode qualitative attributes into quantifiable data, design will play an everincreasing
role in providing the bridge for users to access this information through visual means. Visual
Search + explores the ways in which this bridge can be built, through the formation of an interdisciplinary
team consisting of junior researchers from the departments of computer vision and interface design.
This research strives to make progress beyond the well-explored low-level CBIR attributes (color,
contour, texture) and broaden the contextual interpretation of images by computers through reducing the
sampled imagery to one type, portraits. A focus is placed on images involving the face, due to its crucial
role in everyday communication. Furthermore, while the innate human ability to recognize and process
features within the face exists across all mediums, there remains a great limitation to articulate and
describe them in detail. Using the 3D Morphable Face Model, a technology capable of analyzing 2D
photographs in regard to face attributes such as eye gaze, head tilt, age, sex, mouth activity, facial
expression, beard, among others, this collaboration pushes the technology to meet time restrictions
imposed by the real world environments of content management and retrieval. It is with this challenge that
the 3D-MFM will also be utilized to generate an unlimited set of sample training imagery, used to teach
Fast Image Detectors how to describe and quantify visual attributes with concise speed and accuracy.
For each facial attribute extracted, a visual module will be designed for the query of numerically
stored data. Exploring the possibilities and usability of both 2D and 3D widgets, visual qualities within
the image will be retrieved through the abstracted visual representation of those very same qualities.
Realized through a web application environment (Linux, Apache, MySQL, PHP), this will allow for the
seamless connection of an online graphical user interface, with a database used to store all extracted
information, after having been populated through offline image processing.
Through a collaborative venture of the Graphics and Vision Research Group at the University of
Basel and the Visual Communication Institute of FHNW HGK Basel, Visual Search + yields the real
world application and user consideration of cutting-edge computer vision techniques, while providing
the field of design an opportunity to realize investigations of interface design through the underpinning
of developing technology.
|
|
|
 |
|
Publications |
Pose Normalization for Eye Gaze Estimation and Facial Attribute Description from Still Images
[pdf]
Bernhard Egger, Sandro Schönborn, Andreas Forster and Thomas Vetter
IN: Proceedings of the 36th German Conference of Pattern Recognition, GCPR 2014, Münster, Germany, September 2-5.
|
 |
|
|
Interfacing CBIR: Designing Interactive Widgets to Query Attribute Data in Face Image Retrieval.
Ted Davis, in HCI International 2014 Proceedings (HCII 2014 – DUXU), Crete, Greece (June 2014)
|
 |
|
|
A Monte Carlo Strategy to Integrate Detection
and Model-Based Face Analysis
[pdf]
Sandro Schönborn, Andreas Forster, Bernhard Egger and Thomas Vetter
IN: Proceedings of the 35th German Conference of Pattern Recognition, GCPR 2013, Saarbrücken, Germany, September 3-6,
LNCS Volume 8142, 2013, pp 101-110 .
DOI: http://dx.doi.org/10.1007/978-3-642-40602-7_11
|
 |
|
|