Program and Keynotes

The conference takes place at: Comwell Hvide Hus, Vesterbro 2, 9000 Aalborg.

The complete proceedings are available here.


08:45SSDBM 2014 Welcome by the Chairs
09:00Keynote 1: Professor Michael H. Böhlen, University of Zurich: Database Technology for Long Data Slides
Chair: Christian S. Jensen
10:00Coffee break
10:30Research Session 1: Spatial and Spatio-temporal data
Chair: Goce Trajcevski
-Offline cleaning of RFID trajectory data, Bettina Fazzinga, Sergio Flesca, Filippo Furfaro, and Francesco Parisi
-Skew-Resistant Parallel In-Memory Spatial Join, Suprio Ray, Bogdan Simion, Angela Demke Brown, and Ryan Johnson
-Mining Statistically Sound Co-location Patterns at Multiple Distances, Sajib Barua and Jörg Sander
-Integrating Non-Spatial Preferences into Spatial Location Queries, Qiang Qu, Siyuan Liu, Bin Yang, and Christian S. Jensen
13:20Research Session 2: Arrays, matrices, and statistics
Chair: Lukasz Golab
-SAGA: Array Storage as a DB with Support for Structural Aggregations, Yi Wang, Arnab Nandi, and Gagan Agrawal
-Extending the SQL Array Concept to Support Scientific Analytics, Dimitar Misev and Peter Baumann
-SLACID - Sparse Linear Algebra in a Column-Oriented In-Memory Database System, David Kernert, Frank Köhler, and Wolfgang Lehner
-Efficient Data Management and Statistics with Zero-Copy Integration, Jonathan Lajus and Hannes Mühleisen  Slides
15:00Coffee break
15:30Short papers and demos
Chair: Hua Lu
Short Papers
-A Case Study in Optimizing Continuous Queries Using the Magic Update Technique, Andreas Behrend and Gereon Schueller
-Node Classification in Uncertain Graphs, Michele Dallachiesa, Charu Aggarwal, and Themis Palpanas
-Point Cloud Databases, László Dobos, István Csabai, János M. Szalai-Gindl, Tamás Budavári, and Alexander S. Szalay
-Toward Efficient and Reliable Genome Analysis using Main-Memory Database Systems, Sebastian Dorok, Sebastian Breß, Horstfried Läpple, and Gunter Saake
-Efficient Processing of Exploratory Top-k Joins, Orestis Gkorgkas, Akrivi Vlachou, Christos Doulkeridis, and Kjetil Nørvåg
-A Study of Partitioning and Parallel UDF Execution with the SAP HANA Database, Philipp Große, Norman May, and Wolfgang Lehner
-Subspace Anytime Stream Clustering, Marwan Hassani, Philipp Kranen, Rajveer Saini, and Thomas Seidl
-Efficient Temporal Shortest Path Queries on Evolving Social Graphs, Wenyu Huo and Vassilis J. Tsotras
-Data Movement in Hybrid Analytic Systems: A Case for Automation, Patrick Leyshock, David Maier, and Kristin Tufte
-Protection of Sensitive Trajectory Datasets Through Spatial and Temporal Exchange, Elham Naghizade, Lars Kulik, and Egemen Tanin
-New Approaches to Storing and Manipulating Multi-Dimensional Sparse Arrays, E. J. Otoo, Hairong Wang, and Gideon Nimako
-Exploring Subspace Clustering for Recommendations, Katharina Rausch, Eirini Ntoutsi, Kostas Stefanidis, and Hans-Peter Kriegel
-Data Patterns to Alleviate the Design of Scientific Workflows Exemplified by a Bone Simulation, Peter Reimann, Holger Schwarz, and Bernhard Mitschang
-Communication-Efficient Preference Top-k Monitoring Queries via Subscriptions, Kamalas Udomlamlert, Takahiro Hara, and Shojiro Nishio
-On efficiently generating realistic social media timeline structures, Chengcheng Yu, Fan Xia, Weining Qian, Aoying Zhou, and Jianlong Chang
-DistillFlow: Removing redundancy in Scientific Workflows, Jiuqiang Chen, Sarah Cohen-Boulakia, Christine Froidevaux, Carole Goble, Paolo Missier, and Alan R. Williams
-MR-microT: A MapReduce-based MicroRNA Target Prediction Method, Ilias Kanellos, Thanasis Vergoulis, Dimitris Sacharidis, Theodore Dalamagas, Artemis Hatzigeorgiou, Stelios Sartzetakis, and Timos Sellis
-Integrating Fault-Tolerance and Elasticity in a Distributed Data Stream Processing System, Kasper Grud Skat Madsen, Philip Thyssen, and Yongluan Zhou
-Simulation Workflow Design Tailor-Made for Scientists, Peter Reimann and Holger Schwarz
-A System for Efficient and Simultaneous Processing of Moving K Nearest Neighbor and Spatial Keyword Queries, Chongsheng Zhang
19:00Welcome reception at the Utzon Center


09:00Research Session 3: Data Mining
Chair: Lars Dannecker
-Data Perturbation for Outlier Detection Ensembles, Arthur Zimek, Ricardo J. G. B. Campello, and Jörg Sander
-A Subspace Filter Supporting the Discovery of Small Clusters in Very Noisy Datasets, Frank Höppner
-DivIDE: Efficient Diversification for Interactive Data Exploration, Hina A. Khan, Mohamed A. Sharaf, and Abdullah Albarrak
-Local Context Selection for Outlier Ranking in Graphs with Multiple Numeric Node Attributes, Patricia Iglesias Sánchez, Emmanuel Müller, Oretta Irmler, and Klemens Böhm
10:40Coffee break
11:10Research Session 4: Advanced Issues
Chair: Emmanuel Müller
-(k, d)-Core Anonymity: Structural Anonymization of Massive Networks, Roland Assam, Marwan Hassani, Michael Brysch, and Thomas Seidl
-Matching Dominance: Capture the Semantics of Dominance for Multi-dimensional Uncertain Objects, Ying Zhang, Wenjie Zhang, Xuemin Lin, Muhammad Aamir Cheema, and Chengqi Zhang
-A Provable Algorithmic Approach to Product Selection Problems for Market Entry and Sustainability, Silei Xu, Yishi Lin, Hong Xie, and John C.S. Lui
-Distributed Data Placement to Minimize Communication Cost via Graph Partitioning, Lukasz Golab, Marios Hadjieleftheriou, Howard Karloff, and Barna Saha
13:30Tour and banquet
23:00Return to hotel


09:00Keynote 2: Chief Specialist Anders Rhod Gregersen, Vestas Wind Systems: Data-driven Wind Business
Chair: Torben Bach Pedersen
10:00Coffee break
10:30Research Session 5: Sensor networks and streams
Chair: David Maier
-SensorBench: Benchmarking Approaches to Processing Wireless Sensor Network Data, Ixent Galpin, Alan B. Stokes, George Valkanas, Alasdair J. G. Gray, Norman W. Paton, Alvaro A. A. Fernandes, Kai-Uwe Sattler, and Dimitrios Gunopulos  Slides
-Managing Evolving Shapes in Sensor Networks, Besim Avci, Goce Trajcevski, Peter Scheuermann
-Proactive Adaptations In Sensor Network Query Processing, Alan B. Stokes, Norman W. Paton, and Alvaro A. A. Fernandes  Slides
-Boosting the Basic Counting on Distributed Streams, Bojian Xu
13:20Case Studies Session:
Chair: Milena Ivanova
-Maintaining a Microbial Genome & Metagenome Data Analysis System in an Academic Setting, I-Min A. Chen, Victor M. Markowitz, Ernest Szeto, Krishna Palaniappan, and Ken Chu
-Efficient classification of billions of points into complex geographic regions using hierarchical triangular mesh, Dániel Kondor, László Dobos, István Csabai, András Bodor, and Gábor Vattay  Slides
14:10Research Session 6: Scientific Data Storage and Indexing
Chair: Hannes Mühleisen
-PStore: An Efficient Storage Framework for Managing Scientific Data, Souvik Bhattacherjee, Amol Deshpande, and Alan Sussman
-Inverse Predictions on Continuous Models in Scientific Databases, Anca M. Zimmer, Philip Driessen, Philipp Kranen, and Thomas Seidl
15:00Coffee break
15:30Research Session 7: Schema and Data Matching
Chair: Victor Markowitz
-Geometric Graph Matching and Similarity: A Probabilistic Approach, Ayser Armiti and Michael Gertz
-Schema Matching over Relations, Attributes, and Data Values, Aibo Tian, Mayank Kejriwal, and Daniel P. Miranker
-Helping Scientists Reconnect their Datasets, Abdussalam Alawini, David Maier, Kristin Tufte, and Bill Howe  Slides
-Detecting Correlated Columns in Relational Databases with Mixed Datatypes, Hoang Vu Nguyen, Emmanuel Müller, Periklis Andritsos, and Klemens Böhm

Keynote: Database Technology for Long Data


Long data is data with a prominent temporal context that captures changes in the real-world. Long data is being generated and collected at an unprecedented scale, and data-driven decision making is omnipresent in our society. In stark contrast database technology in general, and the relational model in particular, are at odds with data that exhibits a prominent temporal context. Recently, however, the major database companies have significantly progressed their infrastructures to deal with temporal data. The talk works out the key requirements to manage temporal data, shows how the requirements can be mapped to simple and powerful primitives for the relational model and database systems, and identifies a range of open problems when dealing with long data.


Michael H. Böhlen is a professor of computer science at the University of Zürich where he heads the database technology group. His research interests include various aspects of data management, and have focused on time-varying information, data warehousing and data analysis, and similarity search. He received his M.Sc. and Ph.D. degrees from ETH Zürich in respectively 1990 and 1994. Before joining the University of Zürich he visited the University of Arizona for one year, and was a faculty member at Aalborg University for eight years and the Free University of Bozen-Bolzano for six years. He was Program co-Chair of the 39th International Conference on Very Large Data Bases and served as an associate editor for ACM TODS and The VLDB Journal. He served as a PC member for SIGMOD, VLDB, ICDE and EDBT. He is a member of the VLDB Endowment's Board of Trustees.

Keynote 2: Data-driven Wind Business

Using computation and data to bring wind on par with fossil fuel


Vestas is daily creating value from petabytes of data, from over 35.000 wind turbines and from simulation data. This added value is important in making carbon neutral energy competitive with fossil fuels. Working with data at the petabyte scale is not feasible with relational databases. In partnership with IBM research in Almaden, Vestas has succeeded in getting SQL-like capability that scales. This talk addresses how big data has become big business for Vestas and how technical challenges were overcome.


Anders Rhod Gregersen is the chief specialist in high performance and data heavy computing at Vestas Wind Systems A/S. At Vestas he designed and operates the Firestorm supercomputer, the third largest commercially used supercomputer in the world at the time of installation. Before Vestas, Anders successfully enabled the University supercomputers in the Nordic countries to analyse the vast data streams from the largest machine in the world, the large hadron collider (LHC) at CERN, Geneva. He is the vice-chair for the Industrial Advisory Committee of PRACE (Partnership for Advanced Computing in Europe).

Important Dates

Submission Deadlines (New)

Abstract:Feb. 19, 2014, 23:59 PST
Full Paper:Feb. 19, 2014, 23:59 PST
Notification: May 5, 2014, 23:59 PST
Final Version: May 26, 2014, 23:59 PST

The submission deadlines for conference papers has passed.

Social Program

Welcome reception at Utzon Center. Banquet at the cozy port of Skagen, with trips to Grenen (The nothernmost point of Denmark, where the two seas Kattegat and Skagerrak meet) and Råbjerg Mile (one of the biggest migrating dunes in Europe).