Last update:
Sat Nov 9 17:14:24 MST 2024
Mark D. Hill Is transactional memory an oxymoron? . . 1--1 Justin Zobel Databases and the silification of health 2--2 Stephen Blott and Roger Weber What's wrong with high-dimensional similarity search? . . . . . . . . . . . 3--3 Nicolas Bruno and Surajit Chaudhuri Constrained physical design tuning . . . 4--15 Anastasios Kementsietsidis and Frank Neven and Dieter Van de Craen and Stijn Vansummeren Scalable multi-query optimization for exploratory queries over federated scientific databases . . . . . . . . . . 16--27 David J. DeWitt and Erik Paulson and Eric Robinson and Jeffrey Naughton and Joshua Royalty and Srinath Shankar and Andrew Krioukov Clustera: an integrated computation and data management system . . . . . . . . . 28--41 Alvin Cheung and Samuel Madden Performance profiling with EndoScope, an acquisitional software monitoring framework . . . . . . . . . . . . . . . 42--53 Ziv Bar-Yossef and Maxim Gurevich Mining search engine query logs via suggestion sampling . . . . . . . . . . 54--65 Mert Akdere and U\ugur Çetintemel and Nesime Tatbul Plan-based complex event detection across distributed sources . . . . . . . 66--77 Alexander Lachmann and Mirek Riedewald Finding relevant patterns in bursty sequences . . . . . . . . . . . . . . . 78--89 Hao Cheng and Kien A. Hua and Khanh Vu Constrained locally weighted clustering 90--101 Michael Hay and Gerome Miklau and David Jensen and Don Towsley and Philipp Weis Resisting structural re-identification in anonymized social networks . . . . . 102--114 Manolis Terrovitis and Nikos Mamoulis and Panos Kalnis Privacy-preserving anonymization of set-valued data . . . . . . . . . . . . 115--125 HweeHwa Pang and Kyriakos Mouratidis Authenticating the query results of text search engines . . . . . . . . . . . . . 126--137 Ashish Kundu and Elisa Bertino Structural signatures for tree data structures . . . . . . . . . . . . . . . 138--150 Haggai Roitman and David Carmel and Elad Yom-Tov Maintaining dynamic channel profiles on the Web . . . . . . . . . . . . . . . . 151--162 Fan Yang and Nitin Gupta and Chavdar Botev and Elizabeth F. Churchill and George Levchenko and Jayavel Shanmugasundaram WYSIWYG development of data driven Web applications . . . . . . . . . . . . . . 163--175 Eda Baykan and Monika Henzinger and Ingmar Weber Web page language identification based on URLs . . . . . . . . . . . . . . . . 176--187 Wook-Shin Han and Wooseong Kwak and Jinsoo Lee and Guy M. Lohman and Volker Markl Parallelizing query optimization . . . . 188--200 Marios Hadjieleftheriou and Xiaohui Yu and Nick Koudas and Divesh Srivastava Hashed samples: selectivity estimators for set similarity selection queries . . 201--212 Edith Cohen and Haim Kaplan Tighter estimation using bottom $k$ sketches . . . . . . . . . . . . . . . . 213--229 Bogdan Alexe and Wang-Chiew Tan and Yannis Velegrakis STBenchmark: towards a benchmark for mapping systems . . . . . . . . . . . . 230--244 Yannis Katsis and Alin Deutsch and Yannis Papakonstantinou Interactive source registration in community-oriented information integration . . . . . . . . . . . . . . 245--259 Mauricio A. Hernández and Paolo Papotti and Wang-Chiew Tan Data exchange with data-metadata translations . . . . . . . . . . . . . . 260--273 Jin Li and Kristin Tufte and Vladislav Shkapenyuk and Vassilis Papadimos and Theodore Johnson and David Maier Out-of-order processing: a new architecture for high-performance stream systems . . . . . . . . . . . . . . . . 274--288 Wook-Shin Han and Haifeng Jiang and Howard Ho and Quanzhong Li StreamTX: extracting tuples from streaming XML data . . . . . . . . . . . 289--300 Cheqing Jin and Ke Yi and Lei Chen and Jeffrey Xu Yu and Xuemin Lin Sliding-window top-$k$ queries on uncertain streams . . . . . . . . . . . 301--312 Christoph Koch and Dan Olteanu Conditioning probabilistic databases . . 313--325 George Beskales and Mohamed A. Soliman and Ihab F. Ilyas Efficient search for the top-$k$ probable nearest neighbors in uncertain databases . . . . . . . . . . . . . . . 326--339 Daisy Zhe Wang and Eirinaios Michelakis and Minos Garofalakis and Joseph M. Hellerstein BayesStore: managing large, uncertain data repositories with probabilistic graphical models . . . . . . . . . . . . 340--351 Daniel Deutch and Tova Milo Type inference and type checking for queries on execution traces . . . . . . 352--363 Haichuan Shang and Ying Zhang and Xuemin Lin and Jeffrey Xu Yu Taming verification hardness: an efficient algorithm for testing subgraph isomorphism . . . . . . . . . . . . . . 364--375 Lukasz Golab and Howard Karloff and Flip Korn and Divesh Srivastava and Bei Yu On generating near-optimal tableaux for conditional functional dependencies . . 376--390 Wenfei Fan and Shuai Ma and Yanli Hu and Jie Liu and Yinghui Wu Propagating functional dependencies with conditions . . . . . . . . . . . . . . . 391--407 Ioannis Antonellis and Hector Garcia Molina and Chi Chao Chang Simrank++: query rewriting through link analysis of the click graph . . . . . . 408--421 Dmitry Lizorkin and Pavel Velikhov and Maxim Grinev and Denis Turdakov Accuracy estimate and optimization techniques for SimRank computation . . . 422--433 Badrish Chandramouli and Jun Yang End-to-end support for joins in large-scale publish/subscribe systems 434--450 Ashwin Machanavajjhala and Erik Vee and Minos Garofalakis and Jayavel Shanmugasundaram Scalable ranked publish/subscribe . . . 451--462 Jens Teubner and Torsten Grust and Sebastian Maneth and Sherif Sakr Dependable cardinality forecasts for XQuery . . . . . . . . . . . . . . . . . 463--477 Hongzhi Wang and Jianzhong Li and Jizhou Luo and Hong Gao Hash-base subgraph query processing method for graph-structured XML documents . . . . . . . . . . . . . . . 478--489 Sara Cohen Generating XML structure using examples and constraints . . . . . . . . . . . . 490--501 Allison L. Holloway and David J. DeWitt Read-optimized databases, in depth . . . 502--513 Ioannis Koltsidas and Stratis D. Viglas Flashing up the storage layer . . . . . 514--525 Russell Sears and Mark Callaghan and Eric Brewer Rose: compressed, log-structured replication . . . . . . . . . . . . . . 526--537 Michael J. Cafarella and Alon Halevy and Daisy Zhe Wang and Eugene Wu and Yang Zhang WebTables: exploring the power of tables on the Web . . . . . . . . . . . . . . . 538--549 Charles Garrod and Amit Manjhi and Anastasia Ailamaki and Bruce Maggs and Todd Mowry and Christopher Olston and Anthony Tomasic Scalable query result caching for Web applications . . . . . . . . . . . . . . 550--561 Daniele Braga and Stefano Ceri and Florian Daniel and Davide Martinenghi Optimization of multi-domain queries on the Web . . . . . . . . . . . . . . . . 562--573 YongChul Kwon and Magdalena Balazinska and Albert Greenberg Fault-tolerant stream processing using a distributed, replicated file system . . 574--585 Mi-Yen Yeh and Kun-Lung Wu and Philip S. Yu and Ming-Syan Chen LeeWave: level-wise distribution of wavelet coefficients for processing $k$ NN queries over distributed streams . . 586--597 Marcos K. Aguilera and Wojciech Golab and Mehul A. Shah A practical scalable distributed B-tree 598--609 Lin Qiao and Vijayshankar Raman and Frederick Reiss and Peter J. Haas and Guy M. Lohman Main-memory scan sharing for multi-core CPUs . . . . . . . . . . . . . . . . . . 610--621 Ryan Johnson and Vijayshankar Raman and Richard Sidle and Garret Swart Row-wise parallel predicate evaluation 622--634 Gokul Soundararajan and Jin Chen and Mohamed A. Sharaf and Cristiana Amza Dynamic partitioning of the cache hierarchy in shared data centers . . . . 635--646 Thomas Neumann and Gerhard Weikum RDF-3X: a RISC-style engine for RDF . . 647--659 Alkis Simitsis and Akanksha Baid and Yannis Sismanis and Berthold Reinwald Multidimensional content eXploration . . 660--671 Marcus Fontoura and Vanja Josifovski and Ravi Kumar and Christopher Olston and Andrew Tomkins and Sergei Vassilvitskii Relaxation in text search using taxonomies . . . . . . . . . . . . . . . 672--683 Hoa Nguyen and Thanh Nguyen and Juliana Freire Learning to extract form labels . . . . 684--694 Magesh Jayapandian and H. V. Jagadish Automated creation of a forms-based database query interface . . . . . . . . 695--709 Sihem Amer Yahia and Michael Benedikt and Laks V. S. Lakshmanan and Julia Stoyanovich Efficient network aware search in collaborative tagging sites . . . . . . 710--721 Reynold Cheng and Jinchuan Chen and Xike Xie Cleaning uncertain data with quality guarantees . . . . . . . . . . . . . . . 722--735 Jiansheng Huang and Ting Chen and AnHai Doan and Jeffrey F. Naughton On the provenance of non-answers to queries over extracted data . . . . . . 736--747 Shenghuo Zhu and Tao Li and Zhiyuan Chen and Dingding Wang and Yihong Gong Dynamic active probing of helpdesk databases . . . . . . . . . . . . . . . 748--760 Carlo A. Curino and Hyun J. Moon and Carlo Zaniolo Graceful database schema evolution: the PRISM workbench . . . . . . . . . . . . 761--772 Xiaoyong Chai and Mayssam Sayyadian and AnHai Doan and Arnon Rosenthal and Len Seligman Analyzing and revising data integration schemas to improve their matchability 773--784 Partha Pratim Talukdar and Marie Jacob and Muhammad Salman Mehmood and Koby Crammer and Zachary G. Ives and Fernando Pereira and Sudipto Guha Learning to create data-integrating queries . . . . . . . . . . . . . . . . 785--796 Christopher Ré and Dan Suciu Approximate lineage for probabilistic databases . . . . . . . . . . . . . . . 797--808 Prithviraj Sen and Amol Deshpande and Lise Getoor Exploiting shared correlations in probabilistic databases . . . . . . . . 809--820 Vibhor Rastogi and Dan Suciu and Evan Welbourne Access control over uncertain data . . . 821--832 Graham Cormode and Divesh Srivastava and Ting Yu and Qing Zhang Anonymizing bipartite graph data using safe groupings . . . . . . . . . . . . . 833--844 Yingyi Bu and Ada Wai Chee Fu and Raymond Chi Wing Wong and Lei Chen and Jiuyong Li Privacy preserving serial data publishing by role composition . . . . . 845--856 Xiaokui Xiao and Yufei Tao Output perturbation with query relaxation . . . . . . . . . . . . . . . 857--869 David Lomet and Mingsheng Hong and Rimma Nehme and Rui Zhang Transaction time indexing with version compression . . . . . . . . . . . . . . 870--881 Hyun J. Moon and Carlo A. Curino and Alin Deutsch and Chien-Yi Hou and Carlo Zaniolo Managing and querying transaction-time databases under schema evolution . . . . 882--895 Reza Sherkat and Davood Rafiei On efficiently searching trajectories and archival data for historical similarities . . . . . . . . . . . . . . 896--908 Ken Q. Pu and Xiaohui Yu Keyword query cleaning . . . . . . . . . 909--920 Ziyang Liu and Yi Cher Reasoning and identifying relevant matches for XML keyword search . . . . . 921--932 Chuan Xiao and Wei Wang and Xuemin Lin Ed-Join: an efficient algorithm for similarity joins with edit distance constraints . . . . . . . . . . . . . . 933--944 Sanjay Agrawal and Kaushik Chakrabarti and Surajit Chaudhuri and Venkatesh Ganti Scalable ad-hoc entity extraction from text collections . . . . . . . . . . . . 945--957 Parag Agrawal and Daniel Kifer and Christopher Olston Scheduling shared scans of large data files . . . . . . . . . . . . . . . . . 958--969 Suman Nath and Phillip B. Gibbons Online maintenance of very large random samples on flash storage . . . . . . . . 970--983 Tingjian Ge and Stan Zdonik A skip-list approach for efficiently processing forecasting queries . . . . . 984--995 Thomas Phan and Wen-Syan Li A request-routing framework for SOA-based enterprise computing . . . . . 996--1007 Cathrin Weiss and Panagiotis Karras and Abraham Bernstein Hexastore: sextuple indexing for Semantic Web data management . . . . . . 1008--1019 Cyrus Shahabi and Lu-An Tang and Songhua Xing Indexing land surface for efficient kNN query . . . . . . . . . . . . . . . . . 1020--1031 Raymond Chi-Wing Wong and Ada Wai-Chee Fu and Jian Pei and Yip Sing Ho and Tai Wong and Yubao Liu Efficient skyline querying with variable user preferences on nominal attributes 1032--1043 Lin Guo and Sihem Amer Yahia and Raghu Ramakrishnan and Jayavel Shanmugasundaram and Utkarsh Srivastava and Erik Vee Efficient top-$k$ processing over query-dependent functions . . . . . . . 1044--1055 Wei Wu and Fei Yang and Chee-Yong Chan and Kian-Lee Tan FINCH: evaluating reverse $k$-Nearest-Neighbor queries on location data . . . . . . . . . . . . . . . . . . 1056--1067 Hoyoung Jeung and Man Lung Yiu and Xiaofang Zhou and Christian S. Jensen and Heng Tao Shen Discovery of convoys in trajectory databases . . . . . . . . . . . . . . . 1068--1080 Jae-Gil Lee and Jiawei Han and Xiaolei Li and Hector Gonzalez TraClass: trajectory classification using hierarchical region-based and trajectory-based clustering . . . . . . 1081--1094 Sarana Nutanong and Rui Zhang and Egemen Tanin and Lars Kulik The V*-Diagram: a query-dependent approach to moving KNN queries . . . . . 1095--1106 Ravindra Guravannavar and S. Sudarshan Rewriting procedures for batched bindings . . . . . . . . . . . . . . . . 1107--1123 Harish D. and Pooja N. Darera and Jayant R. Haritsa Identifying robust plans through plan diagram reduction . . . . . . . . . . . 1124--1140 Surajit Chaudhuri and Vivek Narasayya and Ravi Ramamurthy A pay-as-you-go framework for query execution feedback . . . . . . . . . . . 1141--1152 Tyson Condie and David Chu and Joseph M. Hellerstein and Petros Maniatis Evita raced: metacompilation for declarative networks . . . . . . . . . . 1153--1165 Fei Chiang and Renée J. Miller Discovering data quality rules . . . . . 1166--1177 Xiang Zhang and Feng Pan and Wei Wang and Andrew Nobel Mining non-redundant high order correlations in binary data . . . . . . 1178--1188 Bhavana Bharat Dalvi and Meghana Kshirsagar and S. Sudarshan Keyword search on external memory data graphs . . . . . . . . . . . . . . . . . 1189--1204 Ioannis Koltsidas and Heiko Müller and Stratis D. Viglas Sorting hierarchical data in external memory for archiving . . . . . . . . . . 1205--1216
Ahmed Metwally and Fatih Emekçi and Divyakant Agrawal and Amr El Abbadi SLEUTH: Single-pubLisher attack dEtection Using correlaTion Hunting . . 1217--1228 Meikel Poess and Raghunath Othayoth Nambiar Energy cost, the key challenge of today's data centers: a power consumption analysis of TPC-C results 1229--1240 Jayant Madhavan and David Ko and Lucja Kot and Vignesh Ganapathy and Alex Rasmussen and Alon Halevy Google's Deep Web crawl . . . . . . . . 1241--1252 Melanie Weis and Felix Naumann and Ulrich Jehle and Jens Lufter and Holger Schuster Industry-scale duplicate detection . . . 1253--1264 Ronnie Chaiken and Bob Jenkins and Per-Åke Larson and Bill Ramsey and Darren Shakib and Simon Weaver and Jingren Zhou SCOPE: easy and efficient parallel processing of massive data sets . . . . 1265--1276 Brian F. Cooper and Raghu Ramakrishnan and Utkarsh Srivastava and Adam Silberstein and Philip Bohannon and Hans-Arno Jacobsen and Nick Puz and Daniel Weaver and Ramana Yerneni PNUTS: Yahoo!'s hosted data serving platform . . . . . . . . . . . . . . . . 1277--1288 Srini Acharya and Peter Carlin and Cesar Galindo-Legaria and Krzysztof Kozielczyk and Pawel Terlecki and Peter Zabback Relational support for flexible schema scenarios . . . . . . . . . . . . . . . 1289--1300 Niloy Mukherjee and Bharath Aleti and Amit Ganesh and Krishna Kunchithapadam and Scott Lynn and Sujatha Muthulingam and Kam Shergill and Shaoyu Wang and Wei Zhang Oracle SecureFiles System . . . . . . . 1301--1312 Jatin Chhugani and Anthony D. Nguyen and Victor W. Lee and William Macy and Mostafa Hagog and Yen-Kuang Chen and Akram Baransi and Sanjeev Kumar and Pradeep Dubey Efficient implementation of sorting on multi-core SIMD CPU architecture . . . . 1313--1324 Atreyee Dey and Sourjya Bhaumik and Harish D. and Jayant R. Haritsa Efficiently approximating query optimizer plan diagrams . . . . . . . . 1325--1336 Dominik \'Sl\kezak and Jakub Wróblewski and Victoria Eastwood and Piotr Synak Brighthouse: an analytic data warehouse for ad-hoc queries . . . . . . . . . . . 1337--1345 Mohamed Ziauddin and Dinesh Das and Hong Su and Yali Zhu and Khaled Yagoub Optimizer plan change management: improved stability and performance in Oracle 11g . . . . . . . . . . . . . . . 1346--1355 Zhen Hua Liu and Sivasankaran Chandrasekar and Thomas Baby and Hui J. Chang Towards a physical XML independent XQuery/SQL/XML engine . . . . . . . . . 1356--1367 Allison W. Lee and Mohamed Zait Closing the query processing loop in Oracle 11g . . . . . . . . . . . . . . . 1368--1378 Namit Jain and Shailendra Mishra and Anand Srinivasan and Johannes Gehrke and Jennifer Widom and Hari Balakrishnan and U\ugur Çetintemel and Mitch Cherniack and Richard Tibbetts and Stan Zdonik Towards a streaming SQL standard . . . . 1379--1390 Yu Huang and Ziyang Liu and Yi Chen eXtract: a snippet generation system for XML search . . . . . . . . . . . . . . . 1392--1395 James F. Terwilliger and Sergey Melnik and Philip A. Bernstein Language-integrated querying of XML data in SQL server . . . . . . . . . . . . . 1396--1399 Christian Mathis and Andreas M. Weiner and Theo Härder and Caesar Ralf Franz Hoppen XTCcmp: XQuery compilation on XTC . . . 1400--1403 Yuanyuan Tian and Jignesh M. Patel and Viji Nair and Sebastian Martini and Matthias Kretzler Periscope/GQ: a graph querying toolkit 1404--1407 Andrey Balmin and Latha Colby and Emiran Curtmola and Quanzhong Li and Fatma Özcan and Sharath Srinivas and Zografoula Vagena SEDA: a system for search, exploration, discovery, and analysis of XML Data . . 1408--1411 Hamid Motahari and Boualem Benatallah and Regis Saint-Paul and Fabio Casati and Periklis Andritsos Process spaceship: discovering and exploring process views from event logs in data spaces . . . . . . . . . . . . . 1412--1415 Mihai Lupu and Y. C. Tay P 3 N: profiling the potential of a peer-based data management system . . . 1416--1419 Mounir Tlili and W. Kokou Dedzoe and Esther Pacitti and Patrick Valduriez and Reza Akbarinia and Pascal Molli and Gérôme Canals and Stéphane Lauri\`ere P2P logging and timestamping for reconciliation . . . . . . . . . . . . . 1420--1423 Toan Luu and Gleb Skobeltsyn and Fabius Klemm and Maroje Puh and Ivana Podnar Zarko and Martin Rajman and Karl Aberer AlvisP2P: scalable peer-to-peer text retrieval in a structured P2P network 1424--1427 S. Abiteboul and T. Allard and P. Chatalic and G. Gardarin and A. Ghitescu and F. Goasdoué and I. Manolescu and B. Nguyen and M. Ouazara and A. Somani and N. Travers and G. Vasile and S. Zoupanos WebContent: efficient P2P Warehousing of Web data . . . . . . . . . . . . . . . . 1428--1431 Pawel Jurczyk and Li Xiong DObjects: enabling distributed data services for metacomputing platforms . . 1432--1435 Qihong Shao and Yi Chen and Shu Tao and Xifeng Yan and Nikos Anerousis EasyTicket: a ticket routing recommendation engine for enterprise problem resolution . . . . . . . . . . . 1436--1439 Cristian Duda and Gianni Frey and Donald Kossmann and Chong Zhou AJAXSearch: crawling, indexing and searching Web 2.0 applications . . . . . 1440--1443 Kun Liu and Evimaria Terzi and Tyrone Grandison ManyAspects: a system for highlighting diverse concepts in documents . . . . . 1444--1447 Emiran Curtmola and Alin Deutsch and Dionysios Logothetis and K. K. Ramakrishnan and Divesh Srivastava and Kenneth Yocum XTreeNet: democratic community search 1448--1451 Guoliang Li and Jianhua Feng and Jianyong Wang and Lizhu Zhou An effective and versatile keyword search engine on heterogeneous data sources . . . . . . . . . . . . . . . . 1452--1455 Akanksha Baid and Andrey Balmin and Heasoo Hwang and Erik Nijkamp and Jun Rao and Berthold Reinwald and Alkis Simitsis and Yannis Sismanis and Frank van Ham DBPubs: multidimensional exploration of database publications . . . . . . . . . 1456--1459 Wenfei Fan and Floris Geerts and Xibei Jia Semandaq: a data quality system based on conditional functional dependencies . . 1460--1463 Yannis Katsis and Alin Deutsch and Yannis Papakonstantinou and Keliang Zhao RIDE: a tool for interactive source registration in community-oriented information integration . . . . . . . . 1464--1467 Bogdan Alexe and Wang-Chiew Tan and Yannis Velegrakis Comparing and evaluating mapping systems with STBenchmark . . . . . . . . . . . . 1468--1471 Dionysios Logothetis and Kenneth Yocum Ad-hoc data processing in the cloud . . 1472--1475 Felix Weigel and Biswanath Panda and Mirek Riedewald and Johannes Gehrke and Manuel Calimlim Large-scale collaborative analysis and extraction of Web data . . . . . . . . . 1476--1479 Tom Crecelius and Mouna Kacimi and Sebastian Michel and Thomas Neumann and Josiane Xavier Parreira and Ralf Schenkel and Gerhard Weikum Making SENSE: socially enhanced search and exploration . . . . . . . . . . . . 1480--1483 Wentian Lu and Gerome Miklau AuditGuard: a system for database auditing under retention restrictions 1484--1487 Ling Hu and Kenneth A. Ross and Yuan-Chi Chang and Christian A. Lang and Donghui Zhang QueryScope: visualizing queries for repeatable database tuning . . . . . . . 1488--1491 Katja Hose and Daniel Klan and Matthias Marx and Kai-Uwe Sattler When is it time to rethink the aggregate configuration of your OLAP server? . . . 1492--1495 Robert Kallman and Hideaki Kimura and Jonathan Natkins and Andrew Pavlo and Alexander Rasin and Stanley Zdonik and Evan P. C. Jones and Samuel Madden and Michael Stonebraker and Yang Zhang and John Hugg and Daniel J. Abadi H-store: a high-performance, distributed main memory transaction processing system . . . . . . . . . . . . . . . . . 1496--1499 Eric Perlman and Randal Burns and Michael Kazhdan Organizing and indexing non-convex regions . . . . . . . . . . . . . . . . 1500--1503 Eric Paquet and Herna L. Viktor Capri/MR: exploring protein databases from a structural and physicochemical point of view . . . . . . . . . . . . . 1504--1507 Fan Guo and Lei Li and Christos Faloutsos and Eric P. Xing C-DEM: a multi-modal query system for Drosophila Embryo databases . . . . . . 1508--1511 Tova Milo and Daniel Deutch Querying and monitoring distributed business processes . . . . . . . . . . . 1512--1515 Michael Franklin and Alon Halevy and David Maier A first tutorial on dataspaces . . . . . 1516--1517 Enrico Franconi Ontologies and databases: myths and challenges . . . . . . . . . . . . . . . 1518--1519 Magdalena Balazinska and Christopher Ré and Dan Suciu Systems aspects of probabilistic data management . . . . . . . . . . . . . . . 1520--1521 Wenfei Fan and Floris Geerts and Xibei Jia A revival of integrity constraints for data cleaning . . . . . . . . . . . . . 1522--1523 Mirella M. Moro and Zografoula Vagena and Vassilis J. Tsotras XML Structural Summaries . . . . . . . . 1524--1525 Mohamed A. Sharaf and Alexandros Labrinidis and Panos K. Chrysanthis Scheduling continuous queries in data stream management systems . . . . . . . 1526--1527 Hans-Peter Kriegel and Peer Kröger and Arthur Zimek Detecting clusters in moderate-to-high dimensional data: subspace clustering, pattern-based clustering, and correlation clustering . . . . . . . . . 1528--1529 Graham Cormode and Marios Hadjieleftheriou Finding frequent items in data streams 1530--1541 Hui Ding and Goce Trajcevski and Peter Scheuermann and Xiaoyue Wang and Eamonn Keogh Querying and mining of time series data: experimental comparison of representations and distance measures 1542--1552 Lefteris Sidirourgos and Romulo Goncalves and Martin Kersten and Niels Nes and Stefan Manegold Column-store support for RDF data management: not all swans are white . . 1553--1563 Virginie Sans and Dominique Laurent Prefix based numbering schemes for XML: techniques, applications and performances . . . . . . . . . . . . . . 1564--1573 Su Chen and Christian S. Jensen and Dan Lin A benchmark for evaluating moving object indexes . . . . . . . . . . . . . . . . 1574--1585 Jens Dittrich and Lukas Blunschi and Marcos Antonio Vaz Salles Dwarfs in the rearview mirror: how big are they really? . . . . . . . . . . . . 1586--1597 Jie Shao and Heng Tao Shen and Xiaofang Zhou Challenges and techniques for effective and efficient similarity search in large video databases . . . . . . . . . . . . 1598--1603 Frank Hopfgartner Studying interaction methodologies in video retrieval . . . . . . . . . . . . 1604--1608 David Lo and Siau-Cheng Khoo Mining patterns and rules for software specification discovery . . . . . . . . 1609--1616 Laurynas Biveinis and Simonas Saltenis Towards efficient main-memory use for optimum tree index update . . . . . . . 1617--1622 Alexander Holupirek and Marc H. Scholl Implementing filesystems by tree-aware DBMSs . . . . . . . . . . . . . . . . . 1623--1630 Artin Avanes and Johann-Christoph Freytag Adaptive workflow scheduling under resource allocation constraints and network dynamics . . . . . . . . . . . . 1631--1637 Sergej Zerr and Wolfgang Nejdl Privacy preserving document indexing infrastructure for a distributed environment . . . . . . . . . . . . . . 1638--1643 Jiajia Miao GS-TMS: a global stream-based threat monitor system . . . . . . . . . . . . . 1644--1651 Evgeny Kharlamov and Werner Nutt Incompleteness in information integration . . . . . . . . . . . . . . 1652--1658 Daniel Deutch and Tova Milo Querying Web-based applications under models of uncertainty . . . . . . . . . 1659--1665 Panu Silvasti and Seppo Sippu and Eljas Soisalon-Soininen XML-document-filtering automaton . . . . 1666--1671 Tobias Scholl and Alfons Kemper Community-driven data grids . . . . . . 1672--1677
Wolfgang Gatterbauer and Magdalena Balazinska and Nodira Khoussainova and Dan Suciu Believe it or not: adding belief annotations to databases . . . . . . . . 1--12 Zhenjie Zhang and Beng Chin Ooi and Srinivasan Parthasarathy and Anthony K. H. Tung Similarity search on Bregman divergence: towards non-metric indexing . . . . . . 13--24 Zhiping Zeng and Anthony K. H. Tung and Jianyong Wang and Jianhua Feng and Lizhu Zhou Comparing stars: on approximating graph edit distance . . . . . . . . . . . . . 25--36 Steven Euijong Whang and Hector Garcia-Molina and Chad Brower and Jayavel Shanmugasundaram and Sergei Vassilvitskii and Erik Vee and Ramana Yerneni Indexing Boolean expressions . . . . . . 37--48 Yongluan Zhou and Ali Salehi and Karl Aberer Scalable delivery of stream query result 49--60 Michael Benedikt and James Cheney Schema-based independence analysis for XML updates . . . . . . . . . . . . . . 61--72 Rimma V. Nehme and Elke A. Rundensteiner and Elisa Bertino Tagging stream data for rich real-time services . . . . . . . . . . . . . . . . 73--84 Atish Das Sarma and Ashwin Lall and Danupon Nanongkai and Jun Xu Randomized multi-pass streaming skyline algorithms . . . . . . . . . . . . . . . 85--96 Galen Reeves and Jie Liu and Suman Nath and Feng Zhao Managing massive time series streams with multi-scale compressed trickles . . 97--108 Tianyi Wu and Dong Xin and Qiaozhu Mei and Jiawei Han Promotion analysis in multi-dimensional space . . . . . . . . . . . . . . . . . 109--120 Nikos Sarkas and Nilesh Bansal and Gautam Das and Nick Koudas Measure-driven keyword-query expansion 121--132 Bin Liu and H. V. Jagadish Using trees to depict a forest . . . . . 133--144 Hazem Elmeleegy and Ahmed K. Elmagarmid and Emmanuel Cecchet and Walid G. Aref and Willy Zwaenepoel Online piece-wise linear approximation of numerical streams with precision guarantees . . . . . . . . . . . . . . . 145--156 Mirco Stern and Erik Buchmann and Klemens Böhm A wavelet transform for efficient consolidation of sensor relations with quality guarantees . . . . . . . . . . . 157--168 Liu Yu and Jianzhong Li and Hong Gao and Xiaolin Fang Enabling $ \epsilon $-approximate querying in sensor networks . . . . . . 169--180 Arnab Nandi and Philip A. Bernstein HAMSTER: using search clicklogs for schema and taxonomy matching . . . . . . 181--192 Lucja Kot and Christoph Koch Cooperative update exchange in the Youtopia system . . . . . . . . . . . . 193--204 Panagiotis Papapetrou and Vassilis Athitsos and George Kollios and Dimitrios Gunopulos Reference-based alignment in large sequence databases . . . . . . . . . . . 205--216 Sudipto Das and Shyam Antony and Divyakant Agrawal and Amr El Abbadi Thread cooperation in multicore architectures for frequency counting over multiple data streams . . . . . . . 217--228 Rene Mueller and Jens Teubner and Gustavo Alonso Streams on wires: a query compiler for FPGAs . . . . . . . . . . . . . . . . . 229--240 Badrish Chandramouli and Jonathan Goldstein and David Maier On-the-fly progress detection in iterative stream queries . . . . . . . . 241--252 Tim Kraska and Martin Hentschel and Gustavo Alonso and Donald Kossmann Consistency rationing in the cloud: pay only when it matters . . . . . . . . . . 253--264 David Lomet and Mohamed F. Mokbel Locking key ranges with unbundled transaction services . . . . . . . . . . 265--276 George Candea and Neoklis Polyzotis and Radek Vingralek A scalable, predictable join operator for highly concurrent data warehouses 277--288 Rahul Gupta and Sunita Sarawagi Answering table augmentation queries from unstructured lists on the Web . . . 289--300 Bogdan Cautis and Alin Deutsch and Nicola Onose and Vasilis Vassalos Efficient rewriting of XPath queries using Query Set Specifications . . . . . 301--312 Ziyang Liu and Peng Sun and Yi Chen Structured search result differentiation 313--324 Eduard C. Dragut and Thomas Kabisch and Clement Yu and Ulf Leser A hierarchical approach to model Web query interfaces for Web source integration . . . . . . . . . . . . . . 325--336 Gao Cong and Christian S. Jensen and Dingming Wu Efficient retrieval of the top-$k$ most relevant spatial Web objects . . . . . . 337--348 Eduard Dragut and Fang Fang and Prasad Sistla and Clement Yu and Weiyi Meng Stop word and related problems in Web interface integration . . . . . . . . . 349--360 Devesh Agrawal and Deepak Ganesan and Ramesh Sitaraman and Yanlei Diao and Shashi Singh Lazy-Adaptive Tree: an optimized index structure for flash devices . . . . . . 361--372 Rubao Lee and Xiaoning Ding and Feng Chen and Qingda Lu and Xiaodong Zhang MCC-DB: minimizing cache conflicts in multi-core processors for databases . . 373--384 Thomas Willhalm and Nicolae Popovici and Yazan Boshmaf and Hasso Plattner and Alexander Zeier and Jan Schaffner SIMD-scan: ultra fast in-memory table scan using on-chip vector processing units . . . . . . . . . . . . . . . . . 385--394 Surajit Chaudhuri and Venkatesh Ganti and Dong Xin Mining document collections to facilitate accurate approximate entity matching . . . . . . . . . . . . . . . . 395--406 Wenfei Fan and Xibei Jia and Jianzhong Li and Shuai Ma Reasoning about record matching rules 407--418 Alin Dobra and Chris Jermaine and Florin Rusu and Fei Xu Turbo-charging estimate convergence in DBO . . . . . . . . . . . . . . . . . . 419--430 Edith Cohen and Nick Duffield and Haim Kaplan and Carsten Lund and Mikkel Thorup Composable, scalable, and accurate weight summarization of unaggregated data sets . . . . . . . . . . . . . . . 431--442 Sai Wu and Shouxu Jiang and Beng Chin Ooi and Kian-Lee Tan Distributed online aggregations . . . . 443--454 Georgia Koloniari and Evaggelia Pitoura A recall-based cluster formation game in peer-to-peer systems . . . . . . . . . . 455--466 Alan Fekete and Shirley N. Goldrei and Jorge Pérez Asenjo Quantifying isolation anomalies . . . . 467--478 Ryan Johnson and Ippokratis Pandis and Anastasia Ailamaki Improving OLTP scalability using speculative lock inheritance . . . . . . 479--489 Russell Sears and Eric Brewer Segment-based recovery: write-ahead logging revisited . . . . . . . . . . . 490--501 Jian Li and Barna Saha and Amol Deshpande A unified approach to ranking in probabilistic databases . . . . . . . . 502--513 Arvind Arasu and Surajit Chaudhuri and Raghav Kaushik Learning string transformations from examples . . . . . . . . . . . . . . . . 514--525 Graham Cormode and Antonios Deligiannakis and Minos Garofalakis and Andrew McGregor Probabilistic histograms for probabilistic data . . . . . . . . . . . 526--537 Ohad Greenshpan and Tova Milo and Neoklis Polyzotis Autocompletion for mashups . . . . . . . 538--549 Xin Luna Dong and Laure Berti-Equille and Divesh Srivastava Integrating conflicting data: the role of source dependence . . . . . . . . . . 550--561 Xin Luna Dong and Laure Berti-Equille and Divesh Srivastava Truth discovery and copying detection in a dynamic world . . . . . . . . . . . . 562--573 Lukasz Golab and Howard Karloff and Flip Korn and Avishek Saha and Divesh Srivastava Sequential dependencies . . . . . . . . 574--585 Dimitar Denev and Arturas Mazeika and Marc Spaniol and Gerhard Weikum SHARC: framework for quality-conscious Web archiving . . . . . . . . . . . . . 586--597 George Beskales and Mohamed A. Soliman and Ihab F. Ilyas and Shai Ben-David Modeling and querying possible repairs in duplicate detection . . . . . . . . . 598--609 Denis Mindolin and Jan Chomicki Discovering relative importance of skyline attributes . . . . . . . . . . . 610--621 Min-Soo Kim and Jiawei Han A particle-and-density based evolutionary clustering method for dynamic networks . . . . . . . . . . . . 622--633 Xiaoyan Yang and Cecilia M. Procopiuc and Divesh Srivastava Summarizing relational databases . . . . 634--645 Edith Cohen and Haim Kaplan and Subhabrata Sen Coordinated weighted sampling for estimating aggregates over multiple weight assignments . . . . . . . . . . . 646--657 Hongrae Lee and Raymond T. Ng and Kyuseok Shim Power-law based estimation of set similarity join size . . . . . . . . . . 658--669 Panagiotis Karras Optimality and scalability in lattice histogram construction . . . . . . . . . 670--681 Ymir Vigfusson and Adam Silberstein and Brian F. Cooper and Rodrigo Fonseca Adaptively parallelizing distributed range queries . . . . . . . . . . . . . 682--693 Shirish Tatikonda and Srinivasan Parthasarathy Mining tree-structured data on multicore systems . . . . . . . . . . . . . . . . 694--705 P. Unterbrunner and G. Giannikis and G. Alonso and D. Fauser and D. Kossmann Predictable performance for unpredictable workloads . . . . . . . . 706--717 Yang Zhou and Hong Cheng and Jeffrey Xu Yu Graph clustering based on structural/attribute similarities . . . 718--729 Mohammad Al Hasan and Mohammed J. Zaki Output space sampling for graph patterns 730--741 Chen Chen and Cindy X. Lin and Matt Fredrikson and Mihai Christodorescu and Xifeng Yan and Jiawei Han Mining graph patterns efficiently via randomized summaries . . . . . . . . . . 742--753 Sihem Amer-Yahia and Senjuti Basu Roy and Ashish Chawlat and Gautam Das and Cong Yu Group recommendation: semantics and efficiency . . . . . . . . . . . . . . . 754--765 Smriti Bhagat and Graham Cormode and Balachander Krishnamurthy and Divesh Srivastava Class-based graph anonymization for social network data . . . . . . . . . . 766--777 Nikos Sarkas and Gautam Das and Nick Koudas Improved search for socially annotated data . . . . . . . . . . . . . . . . . . 778--789 Ashwin Machanavajjhala and Johannes Gehrke and Michaela Götz Data publishing against realistic adversaries . . . . . . . . . . . . . . 790--801 HweeHwa Pang and Jilian Zhang and Kyriakos Mouratidis Scalable verification for outsourced dynamic databases . . . . . . . . . . . 802--813 Xiaokui Xiao and Yufei Tao and Minghua Chen Optimal random perturbation at multiple privacy levels . . . . . . . . . . . . . 814--825 Ira Assent and Marc Wichterich and Ralph Krieger and Hardy Kremer and Thomas Seidl Anticipatory DTW for efficient similarity search in time series databases . . . . . . . . . . . . . . . 826--837 Dimitris Tsirogiannis and Sudipto Guha and Nick Koudas Improving the performance of list intersection . . . . . . . . . . . . . . 838--849 Raghav Kaushik and Dan Suciu Consistent histograms in the presence of distinct value counts . . . . . . . . . 850--861 Charu Aggarwal and Yan Xie and Philip S. Yu GConnect: a connectivity index for massive disk-resident graphs . . . . . . 862--873 Di Yang and Elke A. Rundensteiner and Matthew O. Ward A shared execution strategy for multiple pattern mining requests over streaming data . . . . . . . . . . . . . . . . . . 874--885 Lei Zou and Lei Chen and M. Tamer Özsu Distance-join: pattern match query in a large graph database . . . . . . . . . . 886--897 Qian Wan and Raymond Chi-Wing Wong and Ihab F. Ilyas and M. Tamer Özsu and Yu Peng Creating competitive products . . . . . 898--909 Rene Mueller and Jens Teubner and Gustavo Alonso Data processing on FPGAs . . . . . . . . 910--921 Azza Abouzeid and Kamil Bajda-Pawlikowski and Daniel Abadi and Avi Silberschatz and Alexander Rasin HadoopDB: an architectural hybrid of MapReduce and DBMS technologies for analytical workloads . . . . . . . . . . 922--933 Yeye He and Jeffrey F. Naughton Anonymization of set-valued data via top-down, local generalization . . . . . 934--945 Lei Zou and Lei Chen and M. Tamer Özsu $k$-automorphism: a general framework for privacy preserving network publication . . . . . . . . . . . . . . 946--957 Nick Koudas and Divesh Srivastava and Ting Yu and Qing Zhang Distribution based microdata anonymization . . . . . . . . . . . . . 958--969 Michael Meier and Michael Schmidt and Georg Lausen On chase termination beyond stratification . . . . . . . . . . . . . 970--981 Guido Moerkotte and Thomas Neumann and Gabriele Steidl Preventing bad plans by bounding the impact of cardinality estimation errors 982--993 Surajit Chaudhuri and Vivek Narasayya and Ravi Ramamurthy Exact cardinality query optimization for optimizer testing . . . . . . . . . . . 994--1005 Balder ten Cate and Laura Chiticariu and Phokion Kolaitis and Wang-Chiew Tan Laconic schema mappings: computing the core with SQL queries . . . . . . . . . 1006--1017 Marcelo Arenas and Jorge Pérez and Juan Reutter and Cristian Riveros Inverting schema mappings: bridging the gap between theory and practice . . . . 1018--1029 James F. Terwilliger and Philip A. Bernstein and Sergey Melnik Full-fidelity flexible object-oriented XML access . . . . . . . . . . . . . . . 1030--1041 Ting Wang and Ling Liu Privacy-aware mobile services over road networks . . . . . . . . . . . . . . . . 1042--1053 Leong Hou U. and Nikos Mamoulis and Kyriakos Mouratidis A fair assignment algorithm for multiple preference queries . . . . . . . . . . . 1054--1065 Takeshi Mishima and Hiroshi Nakamura Pangea: an eager database replication middleware guaranteeing snapshot isolation without modification of database servers . . . . . . . . . . . . 1066--1077 Hazem Elmeleegy and Jayant Madhavan and Alon Halevy Harvesting relational tables from lists on the web . . . . . . . . . . . . . . . 1078--1089 Michael J. Cafarella and Alon Halevy and Nodira Khoussainova Data integration for the relational web 1090--1101 Georg Gottlob and Reinhard Pichler and Vadim Savenkov Normalization and optimization of schema mappings . . . . . . . . . . . . . . . . 1102--1113 Songhua Xing and Cyrus Shahabi and Bei Pan Continuous monitoring of nearest neighbors on land surface . . . . . . . 1114--1125 Raymond Chi-Wing Wong and M. Tamer Özsu and Philip S. Yu and Ada Wai-Chee Fu and Lian Liu Efficient method for maximizing bichromatic reverse nearest neighbor . . 1126--1137 Muhammad Aamir Cheema and Xuemin Lin and Ying Zhang and Wei Wang and Wenjie Zhang Lazy updates: an efficient technique to continuously monitoring reverse kNN . . 1138--1149 Ling Chen and Sourav S. Bhowmick and Wolfgang Nejdl NEAR-Miner: mining evolution associations of Web site directories for efficient maintenance of Web archives 1150--1161 W. K. Wong and David W. Cheung and Edward Hung and Ben Kao and Nikos Mamoulis An audit environment for outsourcing of frequent itemset mining . . . . . . . . 1162--1173 Barzan Mozafari and Carlo Zaniolo Publishing naive Bayesian classifiers: privacy without accuracy loss . . . . . 1174--1185 Kostas Tzoumas and Man Lung Yiu and Christian S. Jensen Workload-aware indexing of continuously moving objects . . . . . . . . . . . . . 1186--1197 Meihui Zhang and Su Chen and Christian S. Jensen and Beng Chin Ooi and Zhenjie Zhang Effectively indexing uncertain moving objects for predictive queries . . . . . 1198--1209 Jagan Sankaranarayanan and Hanan Samet and Houman Alborzi Path oracles for spatial networks . . . 1210--1221 Hideaki Kimura and George Huo and Alexander Rasin and Samuel Madden and Stanley B. Zdonik Correlation maps: a compressed access method for exploiting soft functional dependencies . . . . . . . . . . . . . . 1222--1233 Karl Schnaitter and Neoklis Polyzotis and Lise Getoor Index interactions in physical design tuning: modeling, analysis, and applications . . . . . . . . . . . . . . 1234--1245 Songyun Duan and Vamsidhar Thummala and Shivnath Babu Tuning database configuration parameters with iTuned . . . . . . . . . . . . . . 1246--1257 Marcos Vaz Salles and Tuan Cao and Benjamin Sowell and Alan Demers and Johannes Gehrke and Christoph Koch and Walker White An evaluation of checkpoint recovery for massively multiplayer online games . . . 1258--1269 Emmanuel Müller and Stephan Günnemann and Ira Assent and Thomas Seidl Evaluating clustering in subspace projections of high dimensional data . . 1270--1281 Oktie Hassanzadeh and Fei Chiang and Hyun Chul Lee and Renée J. Miller Framework for evaluating clustering algorithms in duplicate detection . . . 1282--1293
Hongfei Guo and Dan Jones and Jennifer Beckmann and Praveen Seshadri Declarative management in Microsoft SQL server . . . . . . . . . . . . . . . . . 1294--1305 Amr El-Helw and Ihab F. Ilyas and Calisto Zuzarte StatAdvisor: recommending statistical views . . . . . . . . . . . . . . . . . 1306--1317 Mustafa Canim and George A. Mihaila and Bishwaranjan Bhattacharjee and Kenneth A. Ross and Christian A. Lang An object placement advisor for DB2 using solid state storage . . . . . . . 1318--1329 Manish Bhide and Manoj K. Agarwal and Amir Bar-Or and Sriram Padmanabhan and Srinivas K. Mittapalli and Girish Venkatachaliah XPEDIA: XML processing for data integration . . . . . . . . . . . . . . 1330--1341 Roger Bamford and Vinayak Borkar and Matthias Brantner and Peter M. Fischer and Daniela Florescu and David Graf and Donald Kossmann and Tim Kraska and Dan Muresan and Sorin Nasoi and Markos Zacharioudakis XQuery reloaded . . . . . . . . . . . . 1342--1353 Ning Zhang and Nipun Agarwal and Sivasankaran Chandrasekar and Sam Idicula and Vijay Medi and Sabina Petride and Balasubramanyam Sthanikam Binary XML storage and query processing in Oracle 11g . . . . . . . . . . . . . 1354--1365 Srikanth Bellamkonda and Rafi Ahmed and Andrew Witkowski and Angela Amor and Mohamed Zait and Chun-Chieh Lin Enhanced subquery optimizations in Oracle . . . . . . . . . . . . . . . . . 1366--1377 Changkyu Kim and Tim Kaldewey and Victor W. Lee and Eric Sedlar and Anthony D. Nguyen and Nadathur Satish and Jatin Chhugani and Andrea Di Blas and Pradeep Dubey Sort vs. Hash revisited: fast join implementation on modern multi-core CPUs 1378--1389 Yu Xu and Pekka Kostamaa Efficient outer join data skew handling in parallel DBMS . . . . . . . . . . . . 1390--1396 Eric Friedman and Peter Pawlowski and John Cieslewicz SQL/MapReduce: a practical approach to self-describing, polymorphic, and parallelizable user-defined functions 1402--1413 Alan F. Gates and Olga Natkovich and Shubham Chopra and Pradeep Kamath and Shravan M. Narayanamurthy and Christopher Olston and Benjamin Reed and Santhosh Srinivasan and Utkarsh Srivastava Building a high-level dataflow system on top of Map-Reduce: the Pig experience 1414--1425 Biswanath Panda and Joshua S. Herbach and Sugato Basu and Roberto J. Bayardo PLANET: massively parallel learning of tree ensembles with MapReduce . . . . . 1426--1437 Thomas Legler and Wolfgang Lehner and Jan Schaffner and Jens Krüger Robust and distributed top-n frequent-pattern mining with SAP BW accelerator . . . . . . . . . . . . . . 1438--1449 Nicolas Dieu and Adrian Dragusanu and Françoise Fabret and François Llirbat and Eric Simon 1,000 tables under the form . . . . . . 1450--1461 Bishwaranjan Bhattacharjee and Lipyeow Lim and Timothy Malkemus and George Mihaila and Kenneth Ross and Sherman Lau and Cathy McArthur and Zoltan Toth and Reza Sherkat Efficient index compression in DB2 LUW 1462--1473 Zoé Lacroix and Christophe Legendre and Spyro Mousses Storing scientific workflows in a database . . . . . . . . . . . . . . . . 1474--1480 Jeffrey Cohen and Brian Dolan and Mark Dunlap and Joseph M. Hellerstein and Caleb Welton MAD skills: new analysis practices for big data . . . . . . . . . . . . . . . . 1481--1492 Michael Ley DBLP: some lessons learned . . . . . . . 1493--1500 Niloy Mukherjee and Amit Ganesh and Vinayagam Djegaradjane and Sujatha Muthulingam and Wei Zhang and Krishna Kunchithapadam and Scott Lynn and Bharath Aleti and Kam Shergill and Shaoyu Wang Oracle SecureFiles: prepared for the digital deluge . . . . . . . . . . . . . 1501--1511 Robert Baumgartner and Georg Gottlob and Marcus Herzog Scalable Web data extraction for online market intelligence . . . . . . . . . . 1512--1523 Anand Rajaraman Kosmix: high-performance topic exploration using the deep Web . . . . . 1524--1529 Rimma V. Nehme and Karen E. Works and Elke A. Rundensteiner and Elisa Bertino Query mesh: multi-route query processing technology . . . . . . . . . . . . . . . 1530--1533 P. Cudre-Mauroux and H. Kimura and K.-T. Lim and J. Rogers and R. Simakov and E. Soroush and P. Velikhov and D. L. Wang and M. Balazinska and J. Becla and D. DeWitt and B. Heath and D. Maier and S. Madden and J. Patel and M. Stonebraker and S. Zdonik A demonstration of SciDB: a science-oriented DBMS . . . . . . . . . 1534--1537 Kuien Liu and Ke Deng and Zhiming Ding and Mingshu Li and Xiaofang Zhou MOIR/MT: monitoring large-scale road network traffic in real-time . . . . . . 1538--1541 Romain Colle and Leonidas Galanis and Supiti Buranawatanachoke and Stratos Papadomanolakis and Yujun Wang Oracle Database Replay . . . . . . . . . 1542--1545 Nedyalko Borisov and Shivnath Babu and Sandeep Uttamchandani and Ramani Routray and Aameek Singh DIADS: a problem diagnosis tool for databases and storage area networks . . 1546--1549 Melanie Herschel and Mauricio A. Hernández and Wang-Chiew Tan Artemis: a system for analyzing missing answers . . . . . . . . . . . . . . . . 1550--1553 Eugene Wu and Philippe Cudre-Mauroux and Samuel Madden Demonstration of the TrajStore system 1554--1557 M. H. Ali and C. Gerea and B. S. Raman and B. Sezgin and T. Tarnavski and T. Verona and P. Wang and P. Zabback and A. Ananthanarayan and A. Kirilov and M. Lu and A. Raizman and R. Krishnan and R. Schindlauer and T. Grabs and S. Bjeletich and B. Chandramouli and J. Goldstein and S. Bhat and Ying Li and V. Di Nicola and X. Wang and David Maier and S. Grell and O. Nano and I. Santos Microsoft CEP Server and online behavioral targeting . . . . . . . . . . 1558--1561 Stefan Krompass and Harumi Kuno and Janet L. Wiener and Kevin Wilkinson and Umeshwar Dayal and Alfons Kemper A testbed for managing dynamic mixed workloads . . . . . . . . . . . . . . . 1562--1565 Yanif Ahmad and Christoph Koch DBToaster: a SQL compiler for high-performance delta processing in main-memory databases . . . . . . . . . 1566--1569 Nicoleta Preda and Fabian M. Suchanek and Gjergji Kasneci and Thomas Neumann and Maya Ramanath and Gerhard Weikum ANGIE: active knowledge for interactive exploration . . . . . . . . . . . . . . 1570--1573 Hanna Köpcke and Andreas Thor and Erhard Rahm Comparative evaluation of entity resolution approaches with FEVER . . . . 1574--1577 Falk Brauer and Wojciech Barczynski and Gregor Hackenbroich and Marcus Schramm and Adrian Mocan and Felix Förster RankIE: document retrieval on ranked entity graphs . . . . . . . . . . . . . 1578--1581 Giansalvatore Mecca and Paolo Papotti and Salvatore Raunich and Marcello Buoncristiano Concise and expressive mappings with +Spicy . . . . . . . . . . . . . . . . . 1582--1585 Isabel F. Cruz and Flavio Palandri Antonelli and Cosmin Stroe AgreementMaker: efficient matching for large real-world schemas and ontologies 1586--1589 Oktie Hassanzadeh and Reynold Xin and Renée J. Miller and Anastasios Kementsietsidis and Lipyeow Lim and Min Wang Linkage Query Writer . . . . . . . . . . 1590--1593 Xiaoyuan Wang and Xingzhi Sun and Feng Cao and Li Ma and Nick Kanellos and Kang Zhang and Yue Pan and Yong Yu SMDM: enhancing enterprise-wide master data management using semantic Web technologies . . . . . . . . . . . . . . 1594--1597 Michael N. Gubanov and Lucian Popa and Howard Ho and Hamid Pirahesh and Jeng-Yih Chang and Shr-Chang Chen IBM UFO repository: object-oriented data integration . . . . . . . . . . . . . . 1598--1601 Huajun Chen and Bin Lu and Yuan Ni and Guotong Xie and Chunying Zhou and Jinhua Mi and Zhaohui Wu Mashup by surfing a Web of data APIs . . 1602--1605 Reinhard Pichler and Vadim Savenkov DEMo: data exchange modeling tool . . . 1606--1609 Julie Letchner and Christopher Ré and Magdalena Balazinska and Matthai Philipose Lahar demonstration: warehousing Markovian streams . . . . . . . . . . . 1610--1613 Peng Sun and Ziyang Liu and Sivaramakrishnan Natarajan and Susan B. Davidson and Yi Chen WOLVES: achieving correct provenance analysis by detecting and resolving unsound workflow views . . . . . . . . . 1614--1617 Chenyun Dai and Gabriel Ghinita and Elisa Bertino and Ji-Won Byun and Ninghui Li TIAMAT: a tool for interactive analysis of microdata anonymization techniques 1618--1621 Yintao Yu and Cindy X. Lin and Yizhou Sun and Chen Chen and Jiawei Han and Binbin Liao and Tianyi Wu and ChengXiang Zhai and Duo Zhang and Bo Zhao iNextCube: information network-enhanced text cube . . . . . . . . . . . . . . . 1622--1625 Ashish Thusoo and Joydeep Sen Sarma and Namit Jain and Zheng Shao and Prasad Chakka and Suresh Anthony and Hao Liu and Pete Wyckoff and Raghotham Murthy Hive: a warehousing solution over a map-reduce framework . . . . . . . . . . 1626--1629 Arjun Satish and Ramesh Jain and Amarnath Gupta Tolkien: an event based storytelling system . . . . . . . . . . . . . . . . . 1630--1633 Emre Sarigöl and Oriana Riva and Patrick Stuedi and Gustavo Alonso Enabling social networking in ad hoc networks of mobile phones . . . . . . . 1634--1637 Zhuowei Bao and Sarah Cohen-Boulakia and Susan B. Davidson and Pierrick Girard PDiffView: viewing the difference in provenance of workflow results . . . . . 1638--1641 Daniel Deutch and Tova Milo and Tom Yam Goal-oriented Web-site navigation for on-line shoppers . . . . . . . . . . . . 1642--1645 Fernando Pereira and Anand Rajaraman and Sunita Sarawagi and William Tunstall-Pedoe and Gerhard Weikum and Alon Halevy Answering Web questions using structured data: dream or reality? . . . . . . . . 1646--1646 Philip A. Bernstein and Daniel J. Abadi and Michael J. Cafarella and Joseph M. Hellerstein and Donald Kossmann and Samuel Madden How best to build Web-scale data managers? . . . . . . . . . . . . . . . 1647--1647 Stefan Manegold and Martin L. Kersten and Peter Boncz Database architecture evolution: mammals flourished long before dinosaurs became extinct . . . . . . . . . . . . . . . . 1648--1653 Xin Luna Dong and Felix Naumann Data fusion: resolving data conflicts for integration . . . . . . . . . . . . 1654--1655 Jeffrey Heer and Joseph M. Hellerstein Data visualization and social data analysis . . . . . . . . . . . . . . . . 1656--1657 Surajit Chaudhuri and Gautam Das Keyword querying and ranking in databases . . . . . . . . . . . . . . . 1658--1659 Marios Hadjieleftheriou and Chen Li Efficient approximate search on string collections . . . . . . . . . . . . . . 1660--1661 Divesh Srivastava and Suresh Venkatasubramanian Information theory for data management 1662--1663 Daniel J. Abadi and Peter A. Boncz and Stavros Harizopoulos Column-oriented database systems . . . . 1664--1665
Divesh Srivastava and Lukasz Golab and Rick Greer and Theodore Johnson and Joseph Seidel and Vladislav Shkapenyuk and Oliver Spatscheck and Jennifer Yates Enabling real time data analysis . . . . 1--2 Paul Matsudaira High-end biological imaging generates very large $3$D+ and dynamic datasets 3--3 Junghoo Cho and Hector Garcia-Molina Dealing with Web data: history and look ahead . . . . . . . . . . . . . . . . . 4--4 Bettina Kemme and Gustavo Alonso Database replication: a tale of research across communities . . . . . . . . . . . 5--12 Mustafa Canim and Murat Kantarcio\uglu and Bijit Hore and Sharad Mehrotra Building disclosure risk aware query optimizers for relational databases . . 13--24 Tristan Allard and Nicolas Anciaux and Luc Bouganim and Yanli Guo and Lionel Le Folgoc and Benjamin Nguyen and Philippe Pucheral and Indrajit Ray and Indrakshi Ray and Shaoyi Yin Secure personal data servers: a vision paper . . . . . . . . . . . . . . . . . 25--35 Daniel Fabbri and Kristen LeFevre and Qiang Zhu PolicyReplay: misconfiguration-response queries for data breach reporting . . . 36--47 Carlo Curino and Evan Jones and Yang Zhang and Sam Madden Schism: a workload-driven approach to database replication and partitioning 48--57 Lu Qin and Jeffrey Xu Yu and Lijun Chang Ten thousand SQLs: parallel keyword queries computing . . . . . . . . . . . 58--69 Alexander Thomson and Daniel J. Abadi The case for determinism in database systems . . . . . . . . . . . . . . . . 70--80 Bogdan Alexe and Mauricio Hernández and Lucian Popa and Wang-Chiew Tan MapMerge: correlating independent schema mappings . . . . . . . . . . . . . . . . 81--92 Sergio Greco and Francesca Spezzano Chase termination: a constraints rewriting approach . . . . . . . . . . . 93--104 Bruno Marnette and Giansalvatore Mecca and Paolo Papotti Scalable data exchange with functional dependencies . . . . . . . . . . . . . . 105--116 Yaron Kanza and Roy Levin and Eliyahu Safra and Yehoshua Sagiv Interactive route search in the presence of order constraints . . . . . . . . . . 117--128 Willis Lang and Jignesh M. Patel Energy management for MapReduce clusters 129--139 Akanksha Baid and Ian Rae and Jiexing Li and AnHai Doan and Jeffrey Naughton Toward scalable keyword search over relational data . . . . . . . . . . . . 140--149 Barzan Mozafari and Kai Zeng and Carlo Zaniolo From regular expressions to nested words: unifying languages and query execution for relational and XML sequences . . . . . . . . . . . . . . . 150--161 Torsten Grust and Jan Rittinger and Tom Schreiber Avalanche-safe LINQ compilation . . . . 162--172 Wenfei Fan and Jianzhong Li and Shuai Ma and Nan Tang and Wenyuan Yu Towards certain fixes with editing rules and master data . . . . . . . . . . . . 173--184 Melanie Herschel and Mauricio A. Hernández Explaining missing answers to SPJUA queries . . . . . . . . . . . . . . . . 185--196 George Beskales and Ihab F. Ilyas and Lukasz Golab Sampling the repairs of functional dependency violations under hard constraints . . . . . . . . . . . . . . 197--207 David Menestrina and Steven Euijong Whang and Hector Garcia-Molina Evaluating entity resolution results . . 208--219 Badrish Chandramouli and Jonathan Goldstein and David Maier High-performance dynamic pattern matching over disordered streams . . . . 220--231 Irina Botan and Roozbeh Derakhshan and Nihal Dindar and Laura Haas and Renée J. Miller and Nesime Tatbul SECRET: a model for analysis of the execution semantics of stream processing systems . . . . . . . . . . . . . . . . 232--243 Haopeng Zhang and Yanlei Diao and Neil Immerman Recognizing patterns in streams with imprecise timestamps . . . . . . . . . . 244--255 Thomas Neumann and Gerhard Weikum x-RDF-3X: fast querying, high update rates, and consistency for RDF databases 256--263 Wenfei Fan and Jianzhong Li and Shuai Ma and Nan Tang and Yinghui Wu and Yunpeng Wu Graph pattern matching: from intractable to polynomial time . . . . . . . . . . . 264--275 Hilmi Yildirim and Vineet Chaoji and Mohammed J. Zaki GRAIL: scalable reachability index for large graphs . . . . . . . . . . . . . . 276--284 Yingyi Bu and Bill Howe and Magdalena Balazinska and Michael D. Ernst HaLoop: efficient iterative data processing on large clusters . . . . . . 285--296 Michael Benedikt and Georg Gottlob The impact of virtual views on containment . . . . . . . . . . . . . . 297--308 James F. Terwilliger and Lois M. L. Delcambre and David Maier and Jeremy Steinhauer and Scott Britell Updatable and evolvable transforms for virtual databases . . . . . . . . . . . 309--319 Daniel Deutch and Ohad Greenshpan and Tova Milo Navigating in complex mashed-up applications . . . . . . . . . . . . . . 320--329 Sergey Melnik and Andrey Gubarev and Jing Jing Long and Geoffrey Romer and Shiva Shivakumar and Matt Tolton and Theo Vassilakis Dremel: interactive analysis of Web-scale datasets . . . . . . . . . . . 330--339 Peixiang Zhao and Jiawei Han On graph query optimization in large networks . . . . . . . . . . . . . . . . 340--351 Davide Martinenghi and Marco Tagliasacchi Proximity rank join . . . . . . . . . . 352--363 Akrivi Vlachou and Christos Doulkeridis and Kjetil Nòrvåg and Yannis Kotidis Identifying the most influential data objects with reverse top-$k$ queries . . 364--372 Xin Cao and Gao Cong and Christian S. Jensen Retrieving top-$k$ prestige-based relevant spatial Web objects . . . . . . 373--384 Lei Li and B. Aditya Prakash and Christos Faloutsos Parsimonious linear fingerprinting for time series . . . . . . . . . . . . . . 385--396 Rui Zhang and Martin Stradling The HV-tree: a memory hierarchy aware version index . . . . . . . . . . . . . 397--408 Sakti Pramanik and Alok Watve and Chad R. Meiners and Alex Liu Transforming range queries to equivalent box queries to optimize page access . . 409--416 Songtao Guo and Xin Luna Dong and Divesh Srivastava and Remi Zajac Record linkage with uniqueness constraints and erroneous values . . . . 417--428 Ekaterini Ioannou and Wolfgang Nejdl and Claudia Niederée and Yannis Velegrakis On-the-fly entity-aware query processing in the presence of linkage . . . . . . . 429--438 Mohamed Yakout and Ahmed K. Elmagarmid and Hazem Elmeleegy and Mourad Ouzzani and Alan Qi Behavior based record linkage . . . . . 439--448 Wook-Shin Han and Jinsoo Lee and Minh-Duc Pham and Jeffrey Xu Yu iGraph: a framework for comparisons of disk-based graph indexing techniques . . 449--459 Jörg Schad and Jens Dittrich and Jorge-Arnulfo Quiané-Ruiz Runtime measurements in the cloud: observing, analyzing, and reducing variance . . . . . . . . . . . . . . . . 460--471 Dawei Jiang and Beng Chin Ooi and Lei Shi and Sai Wu The performance of MapReduce: an in-depth study . . . . . . . . . . . . . 472--483 Hanna Köpcke and Andreas Thor and Erhard Rahm Evaluation of entity resolution approaches on real-world match problems 484--493 Tomasz Nykiel and Michalis Potamias and Chaitanya Mishra and George Kollios and Nick Koudas MRShare: sharing across multiple queries in MapReduce . . . . . . . . . . . . . . 494--505 Hoang Tam Vo and Chun Chen and Beng Chin Ooi Towards elastic transactional cloud storage with range query support . . . . 506--514 Jens Dittrich and Jorge-Arnulfo Quiané-Ruiz and Alekh Jindal and Yagiz Kargin and Vinay Setty and Jörg Schad Hadoop++: making a yellow elephant run like a cheetah (without it even noticing) . . . . . . . . . . . . . . . 515--529 Nicolas Bruno and Vivek Narasayya and Ravi Ramamurthy Slicing long-running queries . . . . . . 530--541 Kostas Tzoumas and Amol Deshpande and Christian S. Jensen Sharing-aware horizontal partitioning for exploiting correlations during query processing . . . . . . . . . . . . . . . 542--553 Andrea Cal\`\i and Georg Gottlob and Andreas Pieris Advanced processing for ontological queries . . . . . . . . . . . . . . . . 554--565 Aditya Parameswaran and Hector Garcia-Molina and Anand Rajaraman Towards the Web of concepts: extracting concepts from large datasets . . . . . . 566--577 Pankaj Gulhane and Rajeev Rastogi and Srinivasan H. Sengamedu and Ashwin Tengli Exploiting content redundancy for Web information extraction . . . . . . . . . 578--587 Bin Liu and Laura Chiticariu and Vivian Chu and H. V. Jagadish and Frederick R. Reiss Automatic rule refinement for information extraction . . . . . . . . . 588--597 HweeHwa Pang and Xuhua Ding and Xiaokui Xiao Embellishing text search queries to protect user privacy . . . . . . . . . . 598--607 Rhonda Chaytor and Ke Wang Small domain randomization: same privacy, more utility . . . . . . . . . 608--618 Stavros Papadopoulos and Spiridon Bakiras and Dimitris Papadias Nearest neighbor search with strong location privacy . . . . . . . . . . . . 619--629 Hideaki Kimura and Samuel Madden and Stanley B. Zdonik UPI: a primary index for uncertain databases . . . . . . . . . . . . . . . 630--637 Jian Li and Amol Deshpande Ranking continuous probabilistic datasets . . . . . . . . . . . . . . . . 638--649 Xiang Lian and Lei Chen Set similarity join on probabilistic data . . . . . . . . . . . . . . . . . . 650--659 Louis Woods and Jens Teubner and Gustavo Alonso Complex event detection at wire speed with FPGAs . . . . . . . . . . . . . . . 660--669 Wenbin Fang and Bingsheng He and Qiong Luo Database compression on graphics processors . . . . . . . . . . . . . . . 670--680 Ryan Johnson and Ippokratis Pandis and Radu Stoica and Manos Athanassoulis and Anastasia Ailamaki Aether: a scalable approach to logging 681--692 Kathy Macropol and Ambuj Singh Scalable discovery of best clusters on large graphs . . . . . . . . . . . . . . 693--702 Alexander Smola and Shravan Narayanamurthy An architecture for parallel topic models . . . . . . . . . . . . . . . . . 703--710 Venkatesh Ganti and Yeye He and Dong Xin Keyword++: a framework to improve keyword search over entity databases . . 711--722 Zhenhui Li and Bolin Ding and Jiawei Han and Roland Kays Swarm: mining relaxed temporal moving object clusters . . . . . . . . . . . . 723--734 Su Chen and Beng Chin Ooi and Zhenjie Zhang An adaptive updating protocol for reducing moving object database workload 735--746 Georgios Kellaris and Kyriakos Mouratidis Shortest path computation on air indexes 747--757 Jia Xu and Zhenjie Zhang and Anthony K. H. Tung and Ge Yu Efficient and effective similarity search over probabilistic data based on Earth Mover's Distance . . . . . . . . . 758--769 Michael Benedikt and Evgeny Kharlamov and Dan Olteanu and Pierre Senellart Probabilistic XML via Markov Chains . . 770--781 Subi Arumugam and Fei Xu and Ravi Jampani and Christopher Jermaine and Luis L. Perez and Peter J. Haas MCDB-R: risk analysis in the database 782--793 Michael Wick and Andrew McCallum and Gerome Miklau Scalable probabilistic databases with factor graphs and MCMC . . . . . . . . . 794--804 Meihui Zhang and Marios Hadjieleftheriou and Beng Chin Ooi and Cecilia M. Procopiuc and Divesh Srivastava On multi-column foreign key discovery 805--814 Reynold Cheng and Eric Lo and Xuan S. Yang and Ming-Hay Luk and Xiang Li and Xike Xie Explore or exploit?: effective strategies for disambiguating large databases . . . . . . . . . . . . . . . 815--825 Mohamed A. Soliman and Ihab F. Ilyas and Mina Saleeb Building ranked mashups of unstructured sources with uncertain information . . . 826--837 Chedy Ra\"\issi and Jian Pei and Thomas Kister Computing closed skycubes . . . . . . . 838--847 Eric Lo and Nick Cheng and Wing-Kai Hon Generating databases for query workloads 848--859 Minji Wu and Laure Berti-Équille and Amélie Marian and Cecilia M. Procopiuc and Divesh Srivastava Processing top-$k$ join queries . . . . 860--870 Xavier Martinez-Palau and David Dominguez-Sal and Josep Lluis Larriba-Pey Two-way replacement selection . . . . . 871--881 Sebastian Maneth and Kim Nguyen XPath whole query optimization . . . . . 882--893 Nils Grimsmo and Truls A. Bjòrklund and Magnus Lie Hetland Fast optimal twig joins . . . . . . . . 894--905 Michael Benedikt and James Cheney Destabilizers and independence of XML updates . . . . . . . . . . . . . . . . 906--917 Ziyang Liu and Qihong Shao and Yi Chen Searching workflows with hierarchical views . . . . . . . . . . . . . . . . . 918--927 Ippokratis Pandis and Ryan Johnson and Nikos Hardavellas and Anastasia Ailamaki Data-oriented transaction execution . . 928--939 Daniel Deutch and Tova Milo and Neoklis Polyzotis and Tom Yam Optimal top-$k$ query evaluation for weighted business processes . . . . . . 940--951 Guozhang Wang and Marcos Vaz Salles and Benjamin Sowell and Xun Wang and Tuan Cao and Alan Demers and Johannes Gehrke and Walker White Behavioral simulations in MapReduce . . 952--963 Tingjian Ge and Stan Zdonik A*-tree: a structure for storage and modeling of uncertain multidimensional arrays . . . . . . . . . . . . . . . . . 964--974 Charu C. Aggarwal and Yao Li and Philip S. Yu and Ruoming Jin On dense pattern mining in graph streams 975--984 Man Lung Yiu and Leong Hou U. and Simonas Saltenis and Kostas Tzoumas Efficient proximity detection among mobile users via self-tuning policies 985--996 Michalis Potamias and Francesco Bonchi and Aristides Gionis and George Kollios k-nearest neighbors in uncertain graphs 997--1008 Xin Cao and Gao Cong and Christian S. Jensen Mining significant semantic locations from GPS data . . . . . . . . . . . . . 1009--1020 Michael Hay and Vibhor Rastogi and Gerome Miklau and Dan Suciu Boosting the accuracy of differentially private histograms through consistency 1021--1032 Jianneng Cao and Panagiotis Karras and Chedy Ra\"\issi and Kian-Lee Tan $ \rho $-uncertainty: inference-proof transaction anonymization . . . . . . . 1033--1044 Graham Cormode and Divesh Srivastava and Ninghui Li and Tiancheng Li Minimizing minimality and maximizing utility: analyzing method-based attacks on anonymized data . . . . . . . . . . . 1045--1056 Daisy Zhe Wang and Michael J. Franklin and Minos Garofalakis and Joseph M. Hellerstein Querying probabilistic information extraction . . . . . . . . . . . . . . . 1057--1067 Prithviraj Sen and Amol Deshpande and Lise Getoor Read-once functions and query evaluation in probabilistic databases . . . . . . . 1068--1079 Parag Agrawal and Anish Das Sarma and Jeffrey Ullman and Jennifer Widom Foundations of uncertain-data integration . . . . . . . . . . . . . . 1080--1090 Michael Mathioudakis and Nilesh Bansal and Nick Koudas Identifying, attributing and describing spatial bursts . . . . . . . . . . . . . 1091--1102 Hideaki Kimura and George Huo and Alexander Rasin and Samuel Madden and Stanley B. Zdonik CORADD: correlation aware database designer for materialized views and indexes . . . . . . . . . . . . . . . . 1103--1113 Danupon Nanongkai and Atish Das Sarma and Ashwin Lall and Richard J. Lipton and Jun Xu Regret-minimizing representative databases . . . . . . . . . . . . . . . 1114--1124 Benjamin Arai and Gautam Das and Dimitrios Gunopulos and Vagelis Hristidis and Nick Koudas An access cost-aware approach for object retrieval over multiple sources . . . . 1125--1136 M. Abhirama and Sourjya Bhaumik and Atreyee Dey and Harsh Shrimal and Jayant R. Haritsa On the stability of plan costs and the costs of plan stability . . . . . . . . 1137--1148 Herodotos Herodotou and Shivnath Babu Xplus: a SQL-tuning-aware query optimizer . . . . . . . . . . . . . . . 1149--1160 Wenfei Fan and Jianzhong Li and Shuai Ma and Hongzhi Wang and Yinghui Wu Graph homomorphism revisited for graph matching . . . . . . . . . . . . . . . . 1161--1172 Ramakrishnan Kandhan and Nikhil Teletia and Jignesh M. Patel SigMatch: fast and scalable multi-pattern matching . . . . . . . . . 1173--1184 Shijie Zhang and Jiong Yang and Wei Jin SAPPER: subgraph indexing and approximate matching in large graphs . . 1185--1194 Yinan Li and Bingsheng He and Robin Jun Yang and Qiong Luo and Ke Yi Tree indexing on solid state drives . . 1195--1206 Sai Wu and Dawei Jiang and Beng Chin Ooi and Kun-Lung Wu Efficient B-tree based indexing for cloud data processing . . . . . . . . . 1207--1218 Jiannan Wang and Jianhua Feng and Guoliang Li Trie-join: efficient trie-based string similarity joins with edit-distance constraints . . . . . . . . . . . . . . 1219--1230 Mehdi Sharifzadeh and Cyrus Shahabi VoR-tree: R-trees with Voronoi diagrams for efficient processing of spatial nearest neighbor queries . . . . . . . . 1231--1242 P. Deepak and Prasad M. Deshpande Efficient RkNN retrieval with arbitrary non-metric similarity measures . . . . . 1243--1254 Shiming Zhang and Nikos Mamoulis and David W. Cheung and Ben Kao Efficient skyline evaluation over partially ordered domains . . . . . . . 1255--1266 Mingzhu Wei and Elke A. Rundensteiner and Murali Mani Achieving high output quality under limited resources through structure-based spilling in XML streams 1267--1278 Svilen R. Mihaylov and Marie Jacob and Zachary G. Ives and Sudipto Guha Dynamic join optimization in multi-hop wireless sensor networks . . . . . . . . 1279--1290 Mert Akdere and U\ugur Çetintemel and Eli Upfal Database-support for continuous prediction queries over streaming data 1291--1301 Thanh T. L. Tran and Andrew McGregor and Yanlei Diao and Liping Peng and Anna Liu Conditioning and aggregating uncertain data streams: going beyond expectations 1302--1313 Boris Glavic and Gustavo Alonso and Renée J. Miller and Laura M. Haas TRAMP: understanding the behavior of schema mappings through provenance . . . 1314--1325 Steven Euijong Whang and Hector Garcia-Molina Entity resolution with evolving rules 1326--1337 Girija Limaye and Sunita Sarawagi and Soumen Chakrabarti Annotating and searching Web tables using entities, types and relationships 1338--1347 Srikanta Bedathur and Klaus Berberich and Jens Dittrich and Nikos Mamoulis and Gerhard Weikum Interesting-phrase mining for ad-hoc text analytics . . . . . . . . . . . . . 1348--1357 Xin Luna Dong and Laure Berti-Equille and Yifan Hu and Divesh Srivastava Global detection of complex copying relationships between sources . . . . . 1358--1369 Sabrina De Capitani di Vimercati and Sara Foresti and Sushil Jajodia and Stefano Paraboschi and Pierangela Samarati Fragments and loose associations: respecting privacy in data publishing 1370--1381 Francesco Fusco and Marc Ph. Stoecklin and Michail Vlachos NET-FLi: on-the-fly compression, archiving and indexing of streaming network traffic . . . . . . . . . . . . 1382--1393 Qiong Zou and Huayong Wang and Robert Soulé and Martin Hirzel and Henrique Andrade and Bu\ugra Gedik and Kun-Lung Wu From a stream of relational queries to distributed stream processing . . . . . 1394--1405 James T. L. Mah and Danny C. C. Poo and Shaojiang Cai UASMAs (universal automated SNP mapping algorithms): a set of algorithms to instantaneously map SNPs in real time to aid functional SNP discovery . . . . . . 1406--1413 Biplob Debnath and Sudipta Sengupta and Jin Li FlashStore: high throughput persistent key--value store . . . . . . . . . . . . 1414--1425 Reynold S. Xin and William McLaren and Patrick Dantressangle and Steve Schormann and Sam Lightstone and Maria Schwenger MEET DB2: automated database migration evaluation . . . . . . . . . . . . . . . 1426--1434 Mustafa Canim and George A. Mihaila and Bishwaranjan Bhattacharjee and Kenneth A. Ross and Christian A. Lang SSD bufferpool extensions for database systems . . . . . . . . . . . . . . . . 1435--1446 Charles Loboz and Slawek Smyl and Suman Nath DataGarage: warehousing massive performance data on commodity servers 1447--1458 Songting Chen Cheetah: a high performance, custom data warehouse on top of MapReduce . . . . . 1459--1468 Gustavo H. Orair and Carlos H. C. Teixeira Wagner Meira, Jr. and Ye Wang and Srinivasan Parthasarathy Distance-based outlier detection: consolidation and renewed bearing . . . 1469--1480 Young-Seok Kim and Heegyu Jin and Kyoung-Gu Woo Adaptive logging for mobile device . . . 1481--1492 Peter Pesti and Ling Liu and Bhuvan Bamba and Arun Iyengar and Matt Weber RoadTrack: scaling location updates for mobile clients on road networks with query awareness . . . . . . . . . . . . 1493--1504 Xiance Si and Edward Y. Chang and Zoltán Gyöngyi and Maosong Sun Confucius and its intelligent disciples: integrating social with search . . . . . 1505--1516 Jayant R. Haritsa The Picasso database query optimizer visualizer . . . . . . . . . . . . . . . 1517--1520 Ziyang Liu and Sivaramakrishnan Natarajan and Bin He and Hui-I Hsiao and Yi Chen CODS: evolving data efficiently and scalably in column oriented databases 1521--1524 Mohammad Sadoghi and Martin Labrecque and Harsh Singh and Warren Shum and Hans-Arno Jacobsen Efficient event processing through reconfigurable hardware for algorithmic trading . . . . . . . . . . . . . . . . 1525--1528 Justin J. Levandoski and Mohamed F. Mokbel and Mohamed E. Khalefa CareDB: a context and preference-aware location-based database system . . . . . 1529--1532 Donald Kossmann and Tim Kraska and Simon Loesing and Stephan Merkli and Raman Mittal and Flavio Pfaffhauser Cloudy: a modular cloud storage system 1533--1536 Seyed Jalal Kazemitabar and Ugur Demiryurek and Mohamed Ali and Afsin Akdogan and Cyrus Shahabi Geospatial stream query processing using Microsoft SQL Server StreamInsight . . . 1537--1540 Curtis E. Dyreson and Sourav S. Bhowmick and Kirankanth Mallampalli Using XMorph to transform XML data . . . 1541--1544 Di Wang and Elke A. Rundensteiner and Han Wang and Richard T. Ellison III Active complex event processing: applications in real-time health care 1545--1548 Tom Schreiber and Simone Bonetti and Torsten Grust and Manuel Mayr and Jan Rittinger Thirteen new players in the team: a FERRY-based LINQ to SQL provider . . . . 1549--1552 Serge Abiteboul and Pierre Bourhis and Bogdan Marinoiu and Alban Galland AXART: enabling collaborative work with AXML artifacts . . . . . . . . . . . . . 1553--1556 Christopher McConnell and Fan Ping and Jeong-Hyon Hwang iFlow: an approach for fast and reliable Internet-scale stream processing utilizing detouring and replication . . 1557--1560 Verena Kantere and Maher Manoubi and Iluju Kiringa and Timos Sellis and John Mylopoulos Peer coordination through distributed triggers . . . . . . . . . . . . . . . . 1561--1564 Hao Wu and Guoliang Li and Chen Li and Lizhu Zhou Seaform: search-as-you-type in forms . . 1565--1568 Jannik Strötgen and Michael Gertz TimeTrails: a system for exploring spatio-temporal information in documents 1569--1572 Jeffrey Pound and Ihab F. Ilyas and Grant Weddell QUICK: expressive and flexible search over knowledge bases and text collections . . . . . . . . . . . . . . 1573--1576 Marcin Kwietniewski and Jarek Gryz and Stephanie Hazlewood and Paul Van Run Transforming XML documents as schemas evolve . . . . . . . . . . . . . . . . . 1577--1580 Ziyang Liu and Sivaramakrishnan Natarajan and Peng Sun and Stephen Booher and Tim Meehan and Robert Winkler and Yi Chen XSACT: a comparison tool for structured search results . . . . . . . . . . . . . 1581--1584 Talel Abdessalem and Bogdan Cautis and Nora Derouiche ObjectRunner: lightweight, targeted extraction and querying of structured Web data . . . . . . . . . . . . . . . . 1585--1588 Shady Elbassuoni and Katja Hose and Steffen Metzger and Ralf Schenkel ROXXI: Reviving witness dOcuments to eXplore eXtracted Information . . . . . 1589--1592 Arash Termehchy and Marianne Winslett EXTRUCT: using deep structural information in XML keyword search . . . 1593--1596 Javad Akbarnejad and Gloria Chatzopoulou and Magdalini Eirinaki and Suju Koshy and Sarika Mittal and Duc On and Neoklis Polyzotis and Jothi S. Vindhiya Varman SQL QueRIE recommendations . . . . . . . 1597--1600 Hock Hee Ang and Vivekanand Gopalkrishnan and Wee Keong Ng and Steven C. H. Hoi P2PDocTagger: content management through automated P2P collaborative tagging . . 1601--1604 Vinay Setty and Srikanta Bedathur and Klaus Berberich and Gerhard Weikum InZeit: efficiently identifying insightful time points . . . . . . . . . 1605--1608 Aixin Sun and Sourav S. Bhowmick and Yao Liu iAVATAR: an interactive tool for finding and visualizing visual-representative tags in image search . . . . . . . . . . 1609--1612 Thomas Kabisch and Eduard C. Dragut and Clement Yu and Ulf Leser Deep Web integration with VisQI . . . . 1613--1616 Xin Luna Dong and Laure Berti-Equille and Yifan Hu and Divesh Srivastava SOLOMON: seeking the truth via copying detection . . . . . . . . . . . . . . . 1617--1620 Martin Hentschel and Laura Haas and Renée J. Miller Just-in-time data integration in action 1621--1624 Alexander Alexandrov and Max Heimel and Volker Markl and Dominic Battré and Fabian Hueske and Erik Nijkamp and Stephan Ewen and Odej Kao and Daniel Warneke Massively parallel data analysis with PACTs on Nephele . . . . . . . . . . . . 1625--1628 Morten Middelfart and Torben Bach Pedersen Using sentinel technology in the TARGIT BI suite . . . . . . . . . . . . . . . . 1629--1632 Stephan Günnemann and Ines Färber and Hardy Kremer and Thomas Seidl CoDA: interactive cluster based concept discovery . . . . . . . . . . . . . . . 1633--1636 Sonia Bergamaschi and Elton Domnori and Francesco Guerra and Mirko Orsini and Raquel Trillo Lado and Yannis Velegrakis Keymantic: semantic keyword-based searching in data integration systems 1637--1640 Lukasz Golab and Howard Karloff and Flip Korn and Divesh Srivastava Data Auditor: exploring data quality and semantics using pattern tableaux . . . . 1641--1644 Anil K. Nori Distributed caching platforms . . . . . 1645--1646 Divyakant Agrawal and Sudipto Das and Amr El Abbadi Big data and cloud computing: new wine or just new bottles? . . . . . . . . . . 1647--1648 Hanan Samet Techniques for similarity searching in multimedia databases . . . . . . . . . . 1649--1650 Opher Etzion Event processing: past, present and future . . . . . . . . . . . . . . . . . 1651--1652 Matthias Renz and Reynold Cheng and Hans-Peter Kriegel Similarity search and mining in uncertain databases . . . . . . . . . . 1653--1654 S. Muthukrishnan Data management and mining in Internet AD systems . . . . . . . . . . . . . . . 1655--1656
Patrick Kling and M. Tamer Özsu and Khuzaima Daudjee Generating efficient execution plans for vertically partitioned XML databases . . 1--11 Xiang Lian and Lei Chen A generic framework for handling uncertain data with local correlations 12--21 Nodira Khoussainova and YongChul Kwon and Magdalena Balazinska and Dan Suciu SnipSuggest: context-aware autocompletion for SQL . . . . . . . . . 22--33 Alexandra Meliou and Wolfgang Gatterbauer and Katherine F. Moore and Dan Suciu The complexity of causality and responsibility for query answers and non-answers . . . . . . . . . . . . . . 34--45
Guy Sagy and Daniel Keren and Izchak Sharfman and Assaf Schuster Distributed threshold querying of general functions by a difference of monotonic representation . . . . . . . . 46--57 Nan Wang and Jingbo Zhang and Kian-Lee Tan and Anthony K. H. Tung On triangulation-based dense neighborhood graph discovery . . . . . . 58--68 Michael Rice and Vassilis J. Tsotras Graph indexing of road networks for shortest path queries with label restrictions . . . . . . . . . . . . . . 69--80 Li Qian and Kristen LeFevre and H. V. Jagadish CRIUS: user-friendly database design . . 81--92 João B. Rocha-Junior and Akrivi Vlachou and Christos Doulkeridis and Kjetil Nòrvåg Efficient processing of top-$k$ spatial preference queries . . . . . . . . . . . 93--104 Martin Grund and Jens Krüger and Hasso Plattner and Alexander Zeier and Philippe Cudre-Mauroux and Samuel Madden HYRISE: a main memory hybrid storage engine . . . . . . . . . . . . . . . . . 105--116 Carlo A. Curino and Hyun Jin Moon and Alin Deutsch and Carlo Zaniolo Update rewriting and integrity constraint maintenance in a schema evolution support system: PRISM++ . . . 117--128 Ermelinda Oro and Massimo Ruffolo and Steffen Staab SXPath: extending XPath towards spatial querying on Web documents . . . . . . . 129--140 Mingxuan Yuan and Lei Chen and Philip S. Yu Personalized privacy protection in social networks . . . . . . . . . . . . 141--150
Guilherme A. Toda and Eli Cortez and Altigran S. da Silva and Edleno de Moura A probabilistic approach for automatically filling form-based Web interfaces . . . . . . . . . . . . . . . 151--160 Panagiotis Papadimitriou and Hector Garcia-Molina and Ali Dasdan and Santanu Kolay Output URL bidding . . . . . . . . . . . 161--172 Bahman Bahmani and Abdur Chowdhury and Ashish Goel Fast incremental and personalized PageRank . . . . . . . . . . . . . . . . 173--184 Jongwuk Lee and Seung-won Hwang QSkycube: efficient skycube computation using point-based space partitioning . . 185--196 Bin Liu and Chee-Yong Chan ZINC: efficient indexing for skyline computation . . . . . . . . . . . . . . 197--207
Vibhor Rastogi and Nilesh Dalvi and Minos Garofalakis Large-scale collective entity matching 208--218 Nilesh Dalvi and Ravi Kumar and Mohamed Soliman Automatic wrappers for large scale Web extraction . . . . . . . . . . . . . . . 219--230 Xintian Yang and Srinivasan Parthasarathy and P. Sadayappan Fast sparse matrix-vector multiplication on GPUs: implications for graph mining 231--242 Jun Rao and Eugene J. Shekita and Sandeep Tata Using Paxos to build a scalable, consistent, and highly available datastore . . . . . . . . . . . . . . . 243--254 Bolin Ding and Arnd Christian König Fast set intersection in memory . . . . 255--266
Aditya Parameswaran and Anish Das Sarma and Hector Garcia-Molina and Neoklis Polyzotis and Jennifer Widom Human-assisted graph search: it's okay to ask questions . . . . . . . . . . . . 267--278 Mohamed Yakout and Ahmed K. Elmagarmid and Jennifer Neville and Mourad Ouzzani and Ihab F. Ilyas Guided data repair . . . . . . . . . . . 279--289 Petros Venetis and Hector Gonzalez and Christian S. Jensen and Alon Halevy Hyper-local, directions-based ranking of places . . . . . . . . . . . . . . . . . 290--301 M. Levent Koc and Christopher Ré Incrementally maintaining classification using an RDBMS . . . . . . . . . . . . . 302--313 Bingsheng He and Jeffrey Xu Yu High-throughput transaction executions on graphics processors . . . . . . . . . 314--325 Zhao Cao and Charles Sutton and Yanlei Diao and Prashant Shenoy Distributed inference and query processing for RFID tracking and monitoring . . . . . . . . . . . . . . . 326--337
Hongrae Lee and Raymond T. Ng and Kyuseok Shim Similarity join size estimation using locality sensitive hashing . . . . . . . 338--349 Ziyang Liu and Sivaramakrishnan Natarajan and Yi Chen Query expansion based on clustered results . . . . . . . . . . . . . . . . 350--361 Debabrata Dash and Neoklis Polyzotis and Anastasia Ailamaki CoPhy: a scalable, portable, and interactive index advisor for large workloads . . . . . . . . . . . . . . . 362--372 Feng Niu and Christopher Ré and AnHai Doan and Jude Shavlik Tuffy: scaling up statistical inference in Markov logic networks using an RDBMS 373--384 Eaman Jahani and Michael J. Cafarella and Christopher Ré Automatic optimization for MapReduce programs . . . . . . . . . . . . . . . . 385--396 De-Nian Yang and Yi-Ling Chen and Wang-Chien Lee and Ming-Syan Chen On social-temporal group query with acquaintance constraint . . . . . . . . 397--408
Hoa Nguyen and Ariel Fuxman and Stelios Paparizos and Juliana Freire and Rakesh Agrawal Synthesizing products for online catalogs . . . . . . . . . . . . . . . . 409--418 Avrilia Floratou and Jignesh M. Patel and Eugene J. Shekita and Sandeep Tata Column-oriented storage techniques for MapReduce . . . . . . . . . . . . . . . 419--429 David Lomet and Kostas Tzoumas and Michael Zwilling Implementing performance competitive logical recovery . . . . . . . . . . . . 430--439 Ashwin Machanavajjhala and Aleksandra Korolova and Atish Das Sarma Personalized social recommendations: accurate or private . . . . . . . . . . 440--450 Gabriele Capannini and Franco Maria Nardini and Raffaele Perego and Fabrizio Silvestri Efficient diversification of Web search results . . . . . . . . . . . . . . . . 451--459 Gianmarco De Francisci Morales and Aristides Gionis and Mauro Sozio Social content matching in MapReduce . . 460--469
Naiyong Ao and Fan Zhang and Di Wu and Douglas S. Stones and Gang Wang and Xiaoguang Liu and Jing Liu and Sheng Lin Efficient parallel lists intersection and index compression algorithms using graphics processing units . . . . . . . 470--481 Lei Zou and Jinghui Mo and Lei Chen and M. Tamer Özsu and Dongyan Zhao gStore: answering SPARQL queries via subgraph matching . . . . . . . . . . . 482--493 Sudipto Das and Shoji Nishimura and Divyakant Agrawal and Amr El Abbadi Albatross: lightweight elasticity in shared storage databases for the cloud using live data migration . . . . . . . 494--505 Sarana Nutanong and Edwin H. Jacox and Hanan Samet An incremental Hausdorff distance calculation algorithm . . . . . . . . . 506--517 Barbara Blaustein and Adriane Chapman and Len Seligman and M. David Allen and Arnon Rosenthal Surrogate parenthood: protected and informative graphs . . . . . . . . . . . 518--525
Petros Venetis and Alon Halevy and Jayant Madhavan and Marius Pasca and Warren Shen and Fei Wu and Gengxin Miao and Chung Wu Recovering semantics of tables on the web . . . . . . . . . . . . . . . . . . 528--538 Thomas Neumann Efficiently compiling efficient query plans for modern hardware . . . . . . . 539--550 Ruoming Jin and Lin Liu and Bolin Ding and Haixun Wang Distance-constraint reachability computation in uncertain graphs . . . . 551--562 Yun Chi and Hyun Jin Moon and Hakan Hacigümüs iCBS: incremental cost-based scheduling under piecewise linear SLAs . . . . . . 563--574 Mohamed Y. Eltabakh and Yuanyuan Tian and Fatma Özcan and Rainer Gemulla and Aljoscha Krettek and John McPherson CoHadoop: flexible data placement and its exploitation in Hadoop . . . . . . . 575--585 Stratos Idreos and Stefan Manegold and Harumi Kuno and Goetz Graefe Merging what's cracked, cracking what's merged: adaptive indexing in main-memory column-stores . . . . . . . . . . . . . 586--597
Chonghai Wang and Li Yan Yuan and Jia-Huai You and Osmar R. Zaiane and Jian Pei On pruning for top-$k$ ranking in uncertain databases . . . . . . . . . . 598--609 Ippokratis Pandis and Pinar Tözün and Ryan Johnson and Anastasia Ailamaki PLP: page latch-free shared-everything OLTP . . . . . . . . . . . . . . . . . . 610--621 Jiannan Wang and Guoliang Li and Jeffrey Xu Yu and Jianhua Feng Entity matching: how similar is similar 622--633 Di Wang and Elke A. Rundensteiner and Richard T. Ellison III Active complex event processing over event streams . . . . . . . . . . . . . 634--645 Ceren Budak and Divyakant Agrawal and Amr El Abbadi Structural trend analysis for online social networks . . . . . . . . . . . . 646--656 Hideaki Kimura and Vivek Narasayya and Manoj Syamala Compression aware physical database design . . . . . . . . . . . . . . . . . 657--668 Thomas Bernecker and Tobias Emrich and Hans-Peter Kriegel and Matthias Renz and Stefan Zankl and Andreas Züfle Efficient probabilistic reverse nearest neighbor query processing on uncertain data . . . . . . . . . . . . . . . . . . 669--680 Mehdi Kargar and Aijun An Keyword search in graphs: finding $r$-cliques . . . . . . . . . . . . . . 681--692
Daniel Fabbri and Kristen LeFevre Explanation-based auditing . . . . . . . 1--12 Adam Marcus and Eugene Wu and David Karger and Samuel Madden and Robert Miller Human-powered sorts and joins . . . . . 13--24 Graham Cormode and Justin Thaler and Ke Yi Verifying computations with streaming interactive proofs . . . . . . . . . . . 25--36 Dan Lin and Christian S. Jensen and Rui Zhang and Lu Xiao and Jiaheng Lu A moving-object index for efficient query processing with peer-wise location privacy . . . . . . . . . . . . . . . . 37--48 Essam Mansour and Amin Allam and Spiros Skiadopoulos and Panos Kalnis ERA: efficient serial and parallel suffix tree construction for very long strings . . . . . . . . . . . . . . . . 49--60 Jens Krueger and Changkyu Kim and Martin Grund and Nadathur Satish and David Schwalb and Jatin Chhugani and Hasso Plattner and Pradeep Dubey and Alexander Zeier Fast updates on read-optimized databases using multi-core CPUs . . . . . . . . . 61--72 Amit Goyal and Francesco Bonchi and Laks V. S. Lakshmanan A data-based approach to social influence maximization . . . . . . . . . 73--84
Andrew Pavlo and Evan P. C. Jones and Stanley Zdonik On predictive modeling for optimizing transaction execution in parallel OLTP systems . . . . . . . . . . . . . . . . 85--96 François Goasdoué and Konstantinos Karanasos and Julien Leblay and Ioana Manolescu View selection in Semantic Web databases 97--108 Jeffrey Jestes and Ke Yi and Feifei Li Building wavelet histograms on large data in MapReduce . . . . . . . . . . . 109--120 Di Yang and Elke A. Rundensteiner and Matthew O. Ward Summarization and matching of density-based clusters in streaming environments . . . . . . . . . . . . . . 121--132 Thanh Nguyen and Viviane Moreira and Huong Nguyen and Hoa Nguyen and Juliana Freire Multilingual schema matching for Wikipedia infoboxes . . . . . . . . . . 133--144 Guimei Liu and Haojun Zhang and Limsoon Wong Controlling false positives in association rule mining . . . . . . . . 145--156
Fabian M. Suchanek and Serge Abiteboul and Pierre Senellart PARIS: probabilistic alignment of relations, instances, and schema . . . . 157--168 Sayan Ranu and Ambuj K. Singh Answering top-$k$ queries over a mixture of attractive and repulsive dimensions 169--180 Michael Armbrust and Kristal Curtis and Tim Kraska and Armando Fox and Michael J. Franklin and David A. Patterson PIQL: success-tolerant query processing in the cloud . . . . . . . . . . . . . . 181--192 Peixiang Zhao and Charu C. Aggarwal and Min Wang gSketch: on query estimation in graph streams . . . . . . . . . . . . . . . . 193--204 Brian E. Ruttenberg and Ambuj K. Singh Indexing the earth mover's distance using normal distributions . . . . . . . 205--216 Rani Qumsiyeh and Maria S. Pera and Yiu-Kai Ng Generating exact- and ranked partially-matched answers to questions in advertisements . . . . . . . . . . . 217--228 Georgios J. Fakas and Zhi Cai and Nikos Mamoulis Size-$l$ object summaries for relational keyword search . . . . . . . . . . . . . 229--240 Lujun Fang and Anish Das Sarma and Cong Yu and Philip Bohannon REX: explaining relationships between entity pairs . . . . . . . . . . . . . . 241--252 Guoliang Li and Dong Deng and Jiannan Wang and Jianhua Feng Pass-join: a partition-based method for similarity joins . . . . . . . . . . . . 253--264 Christopher Hoobin and Simon J. Puglisi and Justin Zobel Relative Lempel--Ziv factorization for efficient storage and retrieval of Web collections . . . . . . . . . . . . . . 265--273
Ning Zhang and Junichi Tatemura and Jignesh M. Patel and Hakan Hacigümüs Towards cost-effective storage provisioning for DBMSs . . . . . . . . . 274--285 Hongchan Roh and Sanghyun Park and Sungho Kim and Mincheol Shin and Sang-Won Lee B+-tree index optimization by exploiting internal parallelism of flash-based solid state drives . . . . . . . . . . . 286--297 Per-Åke Larson and Spyros Blanas and Cristian Diaconu and Craig Freedman and Jignesh M. Patel and Mike Zwilling High-performance concurrency control mechanisms for main-memory databases . . 298--309 Shuai Ma and Yang Cao and Wenfei Fan and Jinpeng Huai and Tianyu Wo Capturing topology in graph pattern matching . . . . . . . . . . . . . . . . 310--321 Arun Kumar and Christopher Ré Probabilistic management of OCR data using an RDBMS . . . . . . . . . . . . . 322--333 Mateusz Pawlik and Nikolaus Augsten RTED: a robust algorithm for the tree edit distance . . . . . . . . . . . . . 334--345 Yael Amsterdamer and Susan B. Davidson and Daniel Deutch and Tova Milo and Julia Stoyanovich and Val Tannen Putting lipstick on pig: enabling database-style workflow provenance . . . 346--357 Jun Gao and Ruoming Jin and Jiashuai Zhou and Jeffrey Xu Yu and Xiao Jiang and Tengjiao Wang Relational approach for shortest path discovery over large graphs . . . . . . 358--369 Marina Barsky and Sangkyum Kim and Tim Weninger and Jiawei Han Mining flipping correlations from large datasets with taxonomies . . . . . . . . 370--381 Arnd Christian König and Bolin Ding and Surajit Chaudhuri and Vivek Narasayya A statistical approach towards robust progress estimation . . . . . . . . . . 382--393
Yizhou Sun and Charu C. Aggarwal and Jiawei Han Relation strength-aware clustering of heterogeneous information networks with incomplete attributes . . . . . . . . . 394--405 Lingkun Wu and Xiaokui Xiao and Dingxiong Deng and Gao Cong and Andy Diwen Zhu and Shuigeng Zhou Shortest path and distance queries on road networks: an experimental evaluation . . . . . . . . . . . . . . . 406--417 Dóra Erd\Hos and Vatche Ishakian and Andrei Lapets and Evimaria Terzi and Azer Bestavros The filter-placement problem and its application to minimizing information multiplicity . . . . . . . . . . . . . . 418--429 Venu Satuluri and Srinivasan Parthasarathy Bayesian locality sensitive hashing for fast similarity search . . . . . . . . . 430--441 Yasuhiro Fujiwara and Makoto Nakatsuji and Makoto Onizuka and Masaru Kitsuregawa Fast and exact top-$k$ search for random walk with restart . . . . . . . . . . . 442--453 Bahman Bahmani and Ravi Kumar and Sergei Vassilvitskii Densest subgraph in streaming and MapReduce . . . . . . . . . . . . . . . 454--465 Arlei Silva and Wagner Meira, Jr. and Mohammed J. Zaki Mining attribute-structure correlated patterns in large attributed graphs . . 466--477 Karl Schnaitter and Neoklis Polyzotis Semi-automatic index tuning: keeping DBAs in the loop . . . . . . . . . . . . 478--489 Robert Fink and Larisa Han and Dan Olteanu Aggregation in probabilistic databases via knowledge compilation . . . . . . . 490--501
Felix Halim and Stratos Idreos and Panagiotis Karras and Roland H. C. Yap Stochastic database cracking: towards robust adaptive indexing in main-memory column-stores . . . . . . . . . . . . . 502--513 Chao Li and Gerome Miklau An adaptive mechanism for accurate query answering under differential privacy . . 514--525 Georgios Giannikis and Gustavo Alonso and Donald Kossmann SharedDB: killing one thousand queries with one stone . . . . . . . . . . . . . 526--537 Joachim Selke and Christoph Lofi and Wolf-Tilo Balke Pushing the boundaries of crowd-enabled databases with query-driven schema expansion . . . . . . . . . . . . . . . 538--549 Bo Zhao and Benjamin I. P. Rubinstein and Jim Gemmell and Jiawei Han A Bayesian approach to discovering truth from conflicting sources for data integration . . . . . . . . . . . . . . 550--561 Prasang Upadhyaya and Magdalena Balazinska and Dan Suciu How to price shared optimizations in the cloud . . . . . . . . . . . . . . . . . 562--573 Albert Angel and Nikos Sarkas and Nick Koudas and Divesh Srivastava Dense subgraph maintenance under streaming edge weight updates for real-time story identification . . . . . 574--585 Iman Elghandour and Ashraf Aboulnaga ReStore: reusing results of MapReduce jobs . . . . . . . . . . . . . . . . . . 586--597
Nodira Khoussainova and Magdalena Balazinska and Dan Suciu PerfXplain: debugging MapReduce job performance . . . . . . . . . . . . . . 598--609 Francesco Gullo and Andrea Tagarelli Uncertain centroid based partitional clustering of uncertain data . . . . . . 610--621 Bahman Bahmani and Benjamin Moseley and Andrea Vattani and Ravi Kumar and Sergei Vassilvitskii Scalable $k$-means$ + + $ . . . . . . . 622--633 Michael Benedikt and Pierre Bourhis and Clemens Ley Querying schemas with access restrictions . . . . . . . . . . . . . . 634--645 Goetz Graefe and Harumi Kuno Definition, detection, and recovery of single-page failures, a fourth class of database failures . . . . . . . . . . . 646--655 Goetz Graefe and Felix Halim and Stratos Idreos and Harumi Kuno and Stefan Manegold Concurrency control for adaptive indexing . . . . . . . . . . . . . . . . 656--667 Qiang Zeng and Hai Zhuge Comments on ``Stack-based Algorithms for Pattern Matching on DAGs'' . . . . . . . 668--679 Nilesh Dalvi and Ashwin Machanavajjhala and Bo Pang An analysis of structured data on the web . . . . . . . . . . . . . . . . . . 680--691
Kyriakos Mouratidis and Man Lung Yiu Shortest path computation with no information leakage . . . . . . . . . . 692--703 Ahmed Metwally and Christos Faloutsos V-SMART-join: a scalable MapReduce framework for all-pair similarity joins of multisets and vectors . . . . . . . . 704--715 Yucheng Low and Danny Bickson and Joseph Gonzalez and Carlos Guestrin and Aapo Kyrola and Joseph M. Hellerstein Distributed GraphLab: a framework for machine learning and data mining in the cloud . . . . . . . . . . . . . . . . . 716--727 Qiang Zeng and Xiaorui Jiang and Hai Zhuge Adding logical operators to tree pattern queries on graph-structured data . . . . 728--739 Rishabh Singh and Sumit Gulwani Learning semantic string transformations from examples . . . . . . . . . . . . . 740--751 Changbin Liu and Lu Ren and Boon Thau Loo and Yun Mao and Prithwish Basu Cologne: a declarative distributed constraint optimization platform . . . . 752--763 Yi Zhang and Jun Yang Optimizing I/O for big array analytics 764--775 Peter Bailis and Shivaram Venkataraman and Michael J. Franklin and Joseph M. Hellerstein and Ion Stoica Probabilistically bounded staleness for practical partial quorums . . . . . . . 776--787
Zhao Sun and Hongzhi Wang and Haixun Wang and Bin Shao and Jianzhong Li Efficient subgraph matching on billion node graphs . . . . . . . . . . . . . . 788--799 Ye Yuan and Guoren Wang and Lei Chen and Haixun Wang Efficient subgraph similarity search on large probabilistic graph databases . . 800--811 Jia Wang and James Cheng Truss decomposition in massive networks 812--823 Ju Fan and Guoliang Li and Lizhu Zhou and Shanshan Chen and Jun Hu Seal: spatio-textual similarity search 824--835 Theodoros Lappas and Marcos R. Vieira and Dimitrios Gunopulos and Vassilis J. Tsotras On the spatiotemporal burstiness of terms . . . . . . . . . . . . . . . . . 836--847 Houtan Shirani-Mehr and Farnoush Banaei-Kashani and Cyrus Shahabi Efficient reachability query evaluation in large spatiotemporal contact datasets 848--859 Thi Nguyen and Zhen He and Rui Zhang and Phillip Ward Boosting moving object indexing through velocity partitioning . . . . . . . . . 860--871 Nicole Bidoit-Tollu and Dario Colazzo and Federico Ulliana Type-based detection of XML query-update independence . . . . . . . . . . . . . . 872--883 Benjamin Sowell and Wojciech Golab and Mehul A. Shah Minuet: a scalable distributed multiversion B-tree . . . . . . . . . . 884--895 Hongzhi Yin and Bin Cui and Jing Li and Junjie Yao and Chen Chen Challenging the long tail recommendation 896--907
Rakesh Pimplikar and Sunita Sarawagi Answering table queries on the Web using column keywords . . . . . . . . . . . . 908--919 Michael T. Goodrich and Charalampos Papamanthou and Duy Nguyen and Roberto Tamassia and Cristina Videira Lopes and Olga Ohrimenko and Nikos Triandopoulos Efficient verification of web-content searching through authenticated web crawlers . . . . . . . . . . . . . . . . 920--931 Lukas Blunschi and Claudio Jossen and Donald Kossmann and Magdalini Mori and Kurt Stockinger SODA: generating SQL for business users 932--943 Manolis Terrovitis and Nikos Mamoulis and John Liagouris and Spiros Skiadopoulos Privacy preservation by disassociation 944--955 Bhargav Kanagal and Amr Ahmed and Sandeep Pandey and Vanja Josifovski and Jeff Yuan and Lluis Garcia-Pueyo Supercharging recommender systems using taxonomies for learning user purchase behavior . . . . . . . . . . . . . . . . 956--967 Yanif Ahmad and Oliver Kennedy and Christoph Koch and Milos Nikolic DBToaster: higher-order delta processing for dynamic, frequently fresh views . . 968--979 Manoj K. Agarwal and Krithi Ramamritham and Manish Bhide Real time discovery of dense clusters in highly dynamic graphs: identifying real world events in highly dynamic environments . . . . . . . . . . . . . . 980--991 Odysseas Papapetrou and Minos Garofalakis and Antonios Deligiannakis Sketch-based querying of distributed sliding-window data streams . . . . . . 992--1003 Hoang Tam Vo and Sheng Wang and Divyakant Agrawal and Gang Chen and Beng Chin Ooi LogBase: a scalable log-structured database system in the cloud . . . . . . 1004--1015 Wei Lu and Yanyan Shen and Su Chen and Beng Chin Ooi Efficient processing of $k$ nearest neighbor joins using MapReduce . . . . . 1016--1027 Nikolay Laptev and Kai Zeng and Carlo Zaniolo Early accurate results for advanced analytics on MapReduce . . . . . . . . . 1028--1039 Xuan Liu and Meiyu Lu and Beng Chin Ooi and Yanyan Shen and Sai Wu and Meihui Zhang CDAS: a crowdsourcing data analytics system . . . . . . . . . . . . . . . . . 1040--1051 Mayank Sachan and Arnab Bhattacharya Mining statistically significant substrings using the chi-square statistic . . . . . . . . . . . . . . . 1052--1063 Martina-Cezara Albutiu and Alfons Kemper and Thomas Neumann Massively parallel sort-merge joins in main memory multi-core database systems 1064--1075 Tian Luo and Rubao Lee and Michael Mesnier and Feng Chen and Xiaodong Zhang hStorage-DB: heterogeneity-aware data management to exploit the full capability of hybrid storage systems . . 1076--1087
Dong-Wan Choi and Chin-Wan Chung and Yufei Tao A scalable algorithm for maximizing range sum in spatial databases . . . . . 1088--1099 Ahmed M. Aly and Walid G. Aref and Mourad Ouzzani Spatial queries with two kNN predicates 1100--1111 Cheng Sheng and Nan Zhang and Yufei Tao and Xin Jin Optimal algorithms for crawling a hidden database in the web . . . . . . . . . . 1112--1123 Lu Qin and Jeffrey Xu Yu and Lijun Chang Diversifying top-$k$ results . . . . . . 1124--1135 Xin Cao and Lisi Chen and Gao Cong and Xiaokui Xiao Keyword-aware optimal route search . . . 1136--1147 Bogdan Cautis and Evgeny Kharlamov Answering queries using views over probabilistic XML: complexity and tractability . . . . . . . . . . . . . . 1148--1159 Abhay Jha and Dan Suciu Probabilistic databases with MarkoViews 1160--1171 Konstantinos Mamouras and Sigal Oren and Lior Seeman and Lucja Kot and Johannes Gehrke The complexity of social coordination 1172--1183 Xiaofei Zhang and Lei Chen and Min Wang Efficient multi-way theta-join processing using MapReduce . . . . . . . 1184--1195 Harold Lim and Herodotos Herodotou and Shivnath Babu Stubby: a transformation-based optimizer for MapReduce workflows . . . . . . . . 1196--1207 Zhuowei Bao and Susan B. Davidson and Tova Milo Labeling workflow views with fine-grained dependencies . . . . . . . 1208--1219 Jaros\law Szlichta and Parke Godfrey and Jarek Gryz Fundamentals of order dependencies . . . 1220--1231 Nurzhan Bakibayev and Dan Olteanu and Jakub Závodný FDB: a query engine for factorised relational databases . . . . . . . . . . 1232--1243 Yu Cao and Chee-Yong Chan and Jie Li and Kian-Lee Tan Optimization of analytic window functions . . . . . . . . . . . . . . . 1244--1255 Fabian Hueske and Mathias Peters and Matthias J. Sax and Astrid Rheinländer and Rico Bergmann and Aljoscha Krettek and Kostas Tzoumas Opening the black boxes in data flow optimization . . . . . . . . . . . . . . 1256--1267 Stephan Ewen and Kostas Tzoumas and Moritz Kaufmann and Volker Markl Spinning fast iterative data flows . . . 1268--1279 Svilen R. Mihaylov and Zachary G. Ives and Sudipto Guha REX: recursive, delta-based data-centric computation . . . . . . . . . . . . . . 1280--1291 James Cheng and Zechao Shang and Hong Cheng and Haixun Wang and Jeffrey Xu Yu K-reach: who is in your small world . . 1292--1303 Wenfei Fan and Xin Wang and Yinghui Wu Performance guarantees for distributed reachability queries . . . . . . . . . . 1304--1316 Pirooz Chubak and Davood Rafiei Efficient indexing and querying over syntactically annotated trees . . . . . 1316--1327 Vince Bárány and Balder ten Cate and Martin Otto Queries with guarded negation . . . . . 1328--1339 Ninghui Li and Wahbeh Qardaji and Dong Su and Jianneng Cao PrivBasis: frequent itemset mining with differential privacy . . . . . . . . . . 1340--1351 Ganzhao Yuan and Zhenjie Zhang and Marianne Winslett and Xiaokui Xiao and Yin Yang and Zhifeng Hao Low-rank mechanism: optimizing batch queries under differential privacy . . . 1352--1363 Jun Zhang and Zhenjie Zhang and Xiaokui Xiao and Yin Yang and Marianne Winslett Functional mechanism: regression analysis under differential privacy . . 1364--1375 Paolo Boldi and Francesco Bonchi and Aristides Gionis and Tamir Tassa Injecting uncertainty in graphs for identity obfuscation . . . . . . . . . . 1376--1387 Jianneng Cao and Panagiotis Karras Publishing microdata with a robust privacy guarantee . . . . . . . . . . . 1388--1399 Ziyu Guan and Xifeng Yan and Lance M. Kaplan Measuring two-event structural correlations on graphs . . . . . . . . . 1400--1411 Jeffrey Jestes and Jeff M. Phillips and Feifei Li and Mingwang Tang Ranking large temporal data . . . . . . 1412--1423 Florian Funke and Alfons Kemper and Thomas Neumann Compacting transactional data in hybrid OLTP&OLAP databases . . . . . . . . . . . 1424--1435 Alexander Hall and Olaf Bachmann and Robert Büssow and Silviu Ganceanu and Marc Nunkesser Processing a trillion cells per mouse click . . . . . . . . . . . . . . . . . 1436--1446 Danica Porobic and Ippokratis Pandis and Miguel Branco and Pinar Tözün and Anastasia Ailamaki OLTP on hardware islands . . . . . . . . 1447--1458 Stacy Patterson and Aaron J. Elmore and Faisal Nawab and Divyakant Agrawal and Amr El Abbadi Serializability, not serial: concurrency control and availability in multi-datacenter datastores . . . . . . 1459--1470 Alvin Cheung and Samuel Madden and Owen Arden and Andrew C. Myers Automatic partitioning of database applications . . . . . . . . . . . . . . 1471--1482 Jiannan Wang and Tim Kraska and Michael J. Franklin and Jianhua Feng CrowdER: crowdsourcing entity resolution 1483--1494 Caleb Chen Cao and Jieying She and Yongxin Tong and Lei Chen Whom to ask?: jury selection for decision making tasks on micro-blog services . . . . . . . . . . . . . . . . 1495--1506 Xiaochun Yang and Honglei Liu and Bin Wang ALAE: accelerating local alignment with affine gap exactly in biosequence databases . . . . . . . . . . . . . . . 1507--1518 K. Selçuk Candan and Rosaria Rossini and Xiaolan Wang and Maria Luisa Sapino sDTW: computing DTW distances using locally relevant constraints based on salient feature alignments . . . . . . . 1519--1530 Farhan Tauheed and Thomas Heinis and Felix Schürmann and Henry Markram and Anastasia Ailamaki SCOUT: prefetching for latent structure following queries . . . . . . . . . . . 1531--1542 Kaibo Wang and Yin Huai and Rubao Lee and Fusheng Wang and Xiaodong Zhang and Joel H. Saltz Accelerating pathology image data cross-comparison on CPU--GPU hybrid systems . . . . . . . . . . . . . . . . 1543--1554 Jiexing Li and Arnd Christian König and Vivek Narasayya and Surajit Chaudhuri Robust estimation of resource consumption for SQL queries using statistical techniques . . . . . . . . . 1555--1566 Mahashweta Das and Saravanan Thirumuruganathan and Sihem Amer-Yahia and Gautam Das and Cong Yu Who tags what?: an analysis framework 1567--1578 Haohan Zhu and George Kollios and Vassilis Athitsos A generic framework for efficient and effective subsequence retrieval . . . . 1579--1590 Jens Dittrich and Jorge-Arnulfo Quiané-Ruiz and Stefan Richter and Stefan Schuh and Alekh Jindal and Jörg Schad Only aggressive elephants are fast elephants . . . . . . . . . . . . . . . 1591--1602 Rui Li and Shengjie Wang and Kevin Chen-Chuan Chang Multiple location profiling for users and relationships from social network and content . . . . . . . . . . . . . . 1603--1614 Woon-Hak Kang and Sang-Won Lee and Bongki Moon Flash-based extended cache for higher throughput and faster recovery . . . . . 1615--1626 Michael A. Bender and Martin Farach-Colton and Rob Johnson and Russell Kraner and Bradley C. Kuszmaul and Dzejla Medjedovic and Pablo Montes and Pradeep Shetty and Richard P. Spillane and Erez Zadok Don't thrash: how to cache your hash on flash . . . . . . . . . . . . . . . . . 1627--1637 Robert Isele and Christian Bizer Learning expressive linkage rules using genetic programming . . . . . . . . . . 1638--1649 Yongxin Tong and Lei Chen and Yurong Cheng and Philip S. Yu Mining frequent itemsets over uncertain databases . . . . . . . . . . . . . . . 1650--1661 Michele Dallachiesa and Besmira Nushi and Katsiaryna Mirylenka and Themis Palpanas Uncertain time-series similarity: return to the basics . . . . . . . . . . . . . 1662--1673 Tamraparni Dasu and Ji Meng Loh Statistical distortion: consequences of data cleaning . . . . . . . . . . . . . 1674--1683 Willis Lang and Stavros Harizopoulos and Jignesh M. Patel and Mehul A. Shah and Dimitris Tsirogiannis Towards energy-efficient database cluster design . . . . . . . . . . . . . 1684--1695
Christian S. Jensen Data management on the spatial web . . . 1696--1696 Brenda Dietrich Data analytics opportunities in a smarter planet . . . . . . . . . . . . . 1697--1697 Kenan Sahin Challenges in economic massive content storage and management (MCSAM) in the era of self-organizing, self-expanding and self-linking data clusters . . . . . 1698--1698 Gurmeet Singh Manku and Rajeev Motwani Approximate frequency counts over data streams . . . . . . . . . . . . . . . . 1699--1699 Joseph M. Hellerstein and Christoper Ré and Florian Schoppmann and Daisy Zhe Wang and Eugene Fratkin and Aleksander Gorajek and Kee Siong Ng and Caleb Welton and Xixuan Feng and Kun Li and Arun Kumar The MADlib analytics library: or MAD skills, the SQL . . . . . . . . . . . . 1700--1711 Avrilia Floratou and Nikhil Teletia and David J. DeWitt and Jignesh M. Patel and Donghui Zhang Can the elephants handle the NoSQL onslaught? . . . . . . . . . . . . . . . 1712--1723 Tilmann Rabl and Sergio Gómez-Villamor and Mohammad Sadoghi and Victor Muntés-Mulero and Hans-Arno Jacobsen and Serge Mankovskii Solving big data challenges for enterprise application performance management . . . . . . . . . . . . . . . 1724--1735 Avraham Shinnar and David Cunningham and Vijay Saraswat and Benjamin Herta M3R: increased performance for in-memory Hadoop jobs . . . . . . . . . . . . . . 1736--1747 Philipp Rösch and Lars Dannecker and Franz Färber and Gregor Hackenbroich A storage advisor for hybrid-store databases . . . . . . . . . . . . . . . 1748--1758 Micha\l \'Switakowski and Peter Boncz and Marcin Zukowski From cooperative scans to predictive buffer management . . . . . . . . . . . 1759--1770 George Lee and Jimmy Lin and Chuang Liu and Andrew Lorek and Dmitriy Ryaboy The unified logging infrastructure for data analytics at Twitter . . . . . . . 1771--1780 Tomas Talius and Robin Dhamankar and Andrei Dumitrache and Hanuma Kodavalla Transaction log based application error recovery and point in-time query . . . . 1781--1789 Andrew Lamb and Matt Fuller and Ramakrishna Varadarajan and Nga Tran and Ben Vandiver and Lyric Doshi and Chuck Bear The Vertica Analytic Database: C-Store 7 years later . . . . . . . . . . . . . . 1790--1801 Yanpei Chen and Sara Alspaugh and Randy Katz Interactive analytical processing in big data systems: a cross-industry study of MapReduce workloads . . . . . . . . . . 1802--1813 Wang Lam and Lu Liu and Sts Prasad and Anand Rajaraman and Zoheb Vacheri and AnHai Doan Muppet: MapReduce-style processing of fast data . . . . . . . . . . . . . . . 1814--1825 Gabriela Jacques-Silva and Bugra Gedik and Rohit Wagle and Kun-Lung Wu and Vibhore Kumar Building user-defined runtime adaptation routines for stream processing applications . . . . . . . . . . . . . . 1826--1837 Junchen Jiang and Hongji Bao and Edward Y. Chang and Yuqian Li MOIST: a scalable and parallel moving object indexer with school tracking . . 1838--1849 Dan R. K. Ports and Kevin Grittner Serializable snapshot isolation in PostgreSQL . . . . . . . . . . . . . . . 1850--1861 Karin Murthy and Prasad M. Deshpande and Atreyee Dey and Ramanujam Halasipuram and Mukesh Mohania and P. Deepak and Jennifer Reed and Scott Schumacher Exploiting evidence from unstructured data to enhance master data management 1862--1873 Lili Wu and Roshan Sumbaly and Chris Riccomini and Gordon Koo and Hyung Jin Kim and Jay Kreps and Sam Shah Avatara: OLAP for web-scale analytics products . . . . . . . . . . . . . . . . 1874--1877 Lars Kolb and Andreas Thor and Erhard Rahm Dedoop: efficient deduplication with Hadoop . . . . . . . . . . . . . . . . . 1878--1881 Xiufeng Liu and Christian Thomsen and Torben Bach Pedersen MapReduce-based dimensional ETL made easy . . . . . . . . . . . . . . . . . . 1882--1885 Huiqi Xu and Zhen Li and Shumin Guo and Keke Chen CloudVista: interactive and economical visual cluster analysis for big data in the cloud . . . . . . . . . . . . . . . 1886--1889 Alexander Alexandrov and Kostas Tzoumas and Volker Markl Myriad: scalable and expressive data generation . . . . . . . . . . . . . . . 1890--1893 Eugene Wu and Samuel Madden and Michael Stonebraker A demonstration of DBWipes: clean as you query . . . . . . . . . . . . . . . . . 1894--1897 Sattam Alsubaiee and Yasser Altowim and Hotham Altwaijry and Alexander Behm and Vinayak Borkar and Yingyi Bu and Michael Carey and Raman Grover and Zachary Heilbron and Young-Seok Kim and Chen Li and Nicola Onose and Pouria Pirzadeh and Rares Vernica and Jian Wen ASTERIX: an open source system for ``Big Data'' management and analysis (demo) 1898--1901 Sameer Agarwal and Anand P. Iyer and Aurojit Panda and Samuel Madden and Barzan Mozafari and Ion Stoica Blink and it's done: interactive queries on very large data . . . . . . . . . . . 1902--1905 Abhishek Roy and Yanlei Diao and Evan Mauceli and Yiping Shen and Bai-Lin Wu Massive genomic data processing and deep analysis . . . . . . . . . . . . . . . . 1906--1909 Erietta Liarou and Stratos Idreos and Stefan Manegold and Martin Kersten MonetDB/DataCell: online analytics in a streaming column-store . . . . . . . . . 1910--1913 Xin Cao and Gao Cong and Christian S. Jensen and Jun Jie Ng and Beng Chin Ooi and Nhan-Tue Phan and Dingming Wu SWORS: a system for the efficient retrieval of relevant spatial web objects . . . . . . . . . . . . . . . . 1914--1917 Atsuyuki Morishima and Norihide Shinagawa and Tomomi Mitsuishi and Hideto Aoki and Shun Fukusumi CyLog/Crowd4U: a declarative platform for complex data-centric crowdsourcing 1918--1921 Yasin N. Silva and Spencer Pearson Exploiting database similarity joins for metric spaces . . . . . . . . . . . . . 1922--1925 Mrunal Gawade and Martin Kersten Stethoscope: a platform for interactive visual analysis of query execution plans 1926--1929 Alexios Kotsifakos and Panagiotis Papapetrou and Jaakko Hollmén and Dimitrios Gunopulos and Vassilis Athitsos and George Kollios Hum-a-song: a subsequence matching with gaps-range-tolerances query-by-humming system . . . . . . . . . . . . . . . . . 1930--1933 YongChul Kwon and Magdalena Balazinska and Bill Howe and Jerome Rolia SkewTune in action: mitigating skew in MapReduce applications . . . . . . . . . 1934--1937 Azza Abouzied and Joseph M. Hellerstein and Avi Silberschatz Playful query specification with DataPlay . . . . . . . . . . . . . . . . 1938--1941 Ioannis Alagiannis and Renata Borovica and Miguel Branco and Stratos Idreos and Anastasia Ailamaki NoDB in action: adaptive query processing on raw data . . . . . . . . . 1942--1945 Florian Wenzel and Markus Endres and Stefan Mandl and Werner Kießling Complex preference queries supporting spatial applications for user groups . . 1946--1949 Nurzhan Bakibayev and Dan Olteanu and Jakub Závodný Demonstration of the FDB query engine for factorised databases . . . . . . . . 1950--1953 Zichen Xu and Yi-Cheng Tu and Xiaorui Wang PET: reducing database energy cost via query optimization . . . . . . . . . . . 1954--1957 Andrés Letelier and Jorge Pérez and Reinhard Pichler and Sebastian Skritek SPAM: a SPARQL analysis and manipulation tool . . . . . . . . . . . . . . . . . . 1958--1961 Paraschos Koutris and Prasang Upadhyaya and Magdalena Balazinska and Bill Howe and Dan Suciu QueryMarket demonstration: pricing for online data markets . . . . . . . . . . 1962--1965 Siqiang Luo and Yifeng Luo and Shuigeng Zhou and Gao Cong and Jihong Guan DISKs: a system for distributed spatial group keyword search on road networks 1966--1969 Stefan Endrullis and Andreas Thor and Erhard Rahm WETSUIT: an efficient mashup tool for searching and fusing web entities . . . 1970--1973 Mohamed E. Khalefa and Ulrike Fischer and Torben Bach Pedersen and Wolfgang Lehner Model-based integration of past & future in TimeTravel . . . . . . . . . . . . . 1974--1977 Julian Eberius and Maik Thiele and Katrin Braunschweig and Wolfgang Lehner DrillBeyond: enabling business analysts to explore the Web of Open Data . . . . 1978--1981 Ndapandula Nakashole and Gerhard Weikum and Fabian Suchanek Discovering and exploring relations on the web . . . . . . . . . . . . . . . . 1982--1985 Saravanan Thirumuruganathan and Mahashweta Das and Shrikant Desai and Sihem Amer-Yahia and Gautam Das and Cong Yu MapRat: meaningful explanation, interactive exploration and geo-visualization of collaborative ratings . . . . . . . . . . . . . . . . 1986--1989 Hyunjung Park and Hector Garcia-Molina and Richard Pang and Neoklis Polyzotis and Aditya Parameswaran and Jennifer Widom Deco: a system for declarative crowdsourcing . . . . . . . . . . . . . 1990--1993 Wim Martens and Matthias Niewerth and Frank Neven and Thomas Schwentick Developing and analyzing XSDs through BonXai . . . . . . . . . . . . . . . . . 1994--1997 Aaron J. Elmore and Sudipto Das and Divyakant Agrawal and Amr El Abbadi InfoPuzzle: exploring group decision making in mobile peer-to-peer databases 1998--2001 Jianqiu Xu and Ralf Hartmut Güting Manage and query generic moving objects in SECONDO . . . . . . . . . . . . . . . 2002--2005 Pei Li and Christina Tziviskou and Haidong Wang and Xin Luna Dong and Xiaoguang Liu and Andrea Maurino and Divesh Srivastava Chronos: facilitating history discovery by linking temporal records . . . . . . 2006--2009 Manolis Koubarakis and Mihai Datcu and Charalambos Kontoes and Ugo Di Giammatteo and Stefan Manegold and Eva Klien TELEIOS: a database-powered virtual earth observatory . . . . . . . . . . . 2010--2013 Jens Dittrich and Jorge-Arnulfo Quiané-Ruiz Efficient big data processing in Hadoop MapReduce . . . . . . . . . . . . . . . 2014--2015 Kyuseok Shim MapReduce algorithms for big data analysis . . . . . . . . . . . . . . . . 2016--2017 Lise Getoor and Ashwin Machanavajjhala Entity resolution: theory, practice & open challenges . . . . . . . . . . . . 2018--2019 Jiri Schindler I/O characteristics of NoSQL databases 2020--2021 Yizhou Sun and Jiawei Han and Xifeng Yan and Philip S. Yu Mining knowledge from interconnected data: a heterogeneous information network analysis approach . . . . . . . 2022--2023 B. Aditya Prakash and Christos Faloutsos Understanding and managing cascades on large graphs . . . . . . . . . . . . . . 2024--2025 Asuman Dogac Interoperability in eHealth systems . . 2026--2027 Divyakant Agrawal and Amr El Abbadi and Shiyuan Wang Secure and privacy-preserving data services in the cloud: a data centric view . . . . . . . . . . . . . . . . . . 2028--2029 Sudipto Guha and Andrew McGregor Graph synopses, sketches, and streams: a survey . . . . . . . . . . . . . . . . . 2030--2031 Alexandros Labrinidis and H. V. Jagadish Challenges and opportunities with big data . . . . . . . . . . . . . . . . . . 2032--2033 Amr El Abbadi and Mohamed F. Mokbel Panel discussion on social networks and mobility in the cloud . . . . . . . . . 2034--2035
Panagiotis Bouros and Shen Ge and Nikos Mamoulis Spatio-textual similarity joins . . . . 1--12 Marina Drosou and Evaggelia Pitoura DisC diversity: result diversification based on dissimilarity and coverage . . 13--24 Chen Zeng and Jeffrey F. Naughton and Jin-Yi Cai On differentially private frequent itemset mining . . . . . . . . . . . . . 25--36
Xin Luna Dong and Barna Saha and Divesh Srivastava Less is more: selecting sources wisely for integration . . . . . . . . . . . . 37--48 Wenchao Zhou and Suyog Mapara and Yiqing Ren and Yang Li and Andreas Haeberlen and Zachary Ives and Boon Thau Loo and Micah Sherr Distributed time-aware provenance . . . 49--60 Diego Calvanese and Giuseppe De Giacomo and Maurizio Lenzerini and Moshe Y. Vardi Query processing under GLAV mappings for relational and graph databases . . . . . 61--72 Kyriakos Mouratidis and HweeHwa Pang Computing immutable regions for subspace top-$k$ queries . . . . . . . . . . . . 73--84 Feng Zhao and Anthony K. H. Tung Large scale cohesive subgraphs discovery for social network visual analysis . . . 85--96 Xian Li and Xin Luna Dong and Kenneth Lyons and Weiyi Meng and Divesh Srivastava Truth finding on the Deep Web: is the problem solved? . . . . . . . . . . . . 97--108 Adam Marcus and David Karger and Samuel Madden and Robert Miller and Sewoong Oh Counting with the crowd . . . . . . . . 109--120 Tao Zou and Ronan Le Bras and Marcos Vaz Salles and Alan Demers and Johannes Gehrke ClouDiA: a deployment advisor for public clouds . . . . . . . . . . . . . . . . . 121--132 Jinsoo Lee and Wook-Shin Han and Romans Kasperovics and Jeong-Hoon Lee An in-depth comparison of subgraph isomorphism algorithms in graph databases . . . . . . . . . . . . . . . 133--144 Kun Ren and Alexander Thomson and Daniel J. Abadi Lightweight locking for main memory database systems . . . . . . . . . . . . 145--156
Ye Zhang and Wai-Kit Wong and S. M. Yiu and Nikos Mamoulis and David W. Cheung Lightweight privacy-preserving peer-to-peer data integration . . . . . 157--168 Yang Li and Pegah Kamousi and Fangqiu Han and Shengqi Yang and Xifeng Yan and Subhash Suri Memory efficient minimum substring partitioning . . . . . . . . . . . . . . 169--180 Arijit Khan and Yinghui Wu and Charu C. Aggarwal and Xifeng Yan NeMa: fast graph search with label similarity . . . . . . . . . . . . . . . 181--192 Xika Lin and Abhishek Mukherji and Elke A. Rundensteiner and Carolina Ruiz and Matthew O. Ward PARAS: a parameter space framework for online association mining . . . . . . . 193--204 Zhepeng Yan and Nan Zheng and Zachary G. Ives and Partha Pratim Talukdar and Cong Yu Actively soliciting feedback for query answers in keyword search-based data integration . . . . . . . . . . . . . . 205--216 Lisi Chen and Gao Cong and Christian S. Jensen and Dingming Wu Spatial keyword query processing: an experimental evaluation . . . . . . . . 217--228
Milad Eftekhar and Nick Koudas Partitioning and ranking tagged data sources . . . . . . . . . . . . . . . . 229--240 Antonio Badia and Bin Cao Efficient implementation of generalized quantification in relational query languages . . . . . . . . . . . . . . . 241--252 Rui Liu and Ashraf Aboulnaga and Kenneth Salem DAX: a widely distributed multitenant storage service for DBMS hosting . . . . 253--264 Kai Zeng and Jiacheng Yang and Haixun Wang and Bin Shao and Zhongyuan Wang A distributed graph engine for web scale RDF data . . . . . . . . . . . . . . . . 265--276 Anish Das Sarma and Foto N. Afrati and Semih Salihoglu and Jeffrey D. Ullman Upper and lower bounds on the cost of a map-reduce computation . . . . . . . . . 277--288
Stephen Tu and M. Frans Kaashoek and Samuel Madden and Nickolai Zeldovich Processing analytical queries over encrypted data . . . . . . . . . . . . . 289--300 Georgios Kellaris and Stavros Papadopoulos Practical differential privacy via grouping and smoothing . . . . . . . . . 301--312 Raghav Kaushik and Yupeng Fu and Ravishankar Ramamurthy On scaling up sensitive data auditing 313--324 Maheswaran Sathiamoorthy and Megasthenis Asteris and Dimitris Papailiopoulos and Alexandros G. Dimakis and Ramkumar Vadali and Scott Chen and Dhruba Borthakur XORing elephants: novel erasure codes for big data . . . . . . . . . . . . . . 325--336 Steffen Rendle Scaling factorization machines to relational data . . . . . . . . . . . . 337--348
Steven Euijong Whang and Peter Lofgren and Hector Garcia-Molina Question selection for crowd entity resolution . . . . . . . . . . . . . . . 349--360 Alekh Jindal and Endre Palatinus and Vladimir Pavlov and Jens Dittrich A comparison of knives for bread slicing 361--372 Chuan Xiao and Jianbin Qin and Wei Wang and Yoshiharu Ishikawa and Koji Tsuda and Kunihiko Sadakane Efficient error-tolerant query autocompletion . . . . . . . . . . . . . 373--384 Alexander Shraer and Maxim Gurevich and Marcus Fontoura and Vanja Josifovski Top-$k$ publish-subscribe for social annotation of news . . . . . . . . . . . 385--396 Phokion G. Kolaitis and Enela Pema and Wang-Chiew Tan Efficient querying of inconsistent databases with binary integer programming . . . . . . . . . . . . . . 397--408 Aristides Gionis and Flavio Junqueira and Vincent Leroy and Marco Serafini and Ingmar Weber Piggybacking on social networks . . . . 409--420 Marco D. Adelfio and Hanan Samet Schema extraction for tabular data on the Web . . . . . . . . . . . . . . . . 421--432 Ahmet Erdem Saríyüce and Bugra Gedik and Gabriela Jacques-Silva and Kun-Lung Wu and Ümit V. Çatalyürek Streaming algorithms for $k$-core decomposition . . . . . . . . . . . . . 433--444 Oktie Hassanzadeh and Ken Q. Pu and Soheil Hassas Yeganeh and Renée J. Miller and Lucian Popa and Mauricio A. Hernández and Howard Ho Discovering linkage points over Web data 445--456 Ada Wai-Chee Fu and Huanhuan Wu and James Cheng and Raymond Chi-Wing Wong IS-Label: an independent-set based labeling scheme for point-to-point distance querying . . . . . . . . . . . 457--468 Thanh T. L. Tran and Yanlei Diao and Charles Sutton and Anna Liu Supporting user-defined functions on uncertain data . . . . . . . . . . . . . 469--480 Fanwei Zhu and Yuan Fang and Kevin Chen-Chuan Chang and Jing Ying Incremental and accuracy-aware Personalized PageRank through scheduled approximation . . . . . . . . . . . . . 481--492
Weiguo Zheng and Lei Zou and Yansong Feng and Lei Chen and Dongyan Zhao Efficient simrank-based similarity join over large graphs . . . . . . . . . . . 493--504 Guimei Liu and Andre Suchitra and Limsoon Wong A performance study of three disk-based structures for indexing and querying frequent itemsets . . . . . . . . . . . 505--516 Pingpeng Yuan and Pu Liu and Buwen Wu and Hai Jin and Wenya Zhang and Ling Liu TripleBit: a fast and compact system for large scale RDF data . . . . . . . . . . 517--528 Sumeet Bajaj and Radu Sion CorrectDB: SQL engine with practical query authentication . . . . . . . . . . 529--540
Xin Liu and Kenneth Salem Hybrid storage management for database systems . . . . . . . . . . . . . . . . 541--552 Eugene Wu and Samuel Madden Scorpion: explaining away outliers in aggregate queries . . . . . . . . . . . 553--564 Rajeev Gupta and Krithi Ramamritham and Mukesh Mohania Ratio threshold queries over distributed data sources . . . . . . . . . . . . . . 565--576 Ting Deng and Wenfei Fan On the complexity of query result diversification . . . . . . . . . . . . 577--588 Sourav Dutta and Ankur Narang and Suman K. Bera Streaming quotient filter: a near optimal approximate duplicate detection approach for data streams . . . . . . . 589--600
Flip Korn and Barna Saha and Divesh Srivastava and Shanshan Ying On repairing structural problems in semi-structured data . . . . . . . . . . 601--612 Faraz Makari Manshadi and Baruch Awerbuch and Rainer Gemulla and Rohit Khandekar and Julián Mestre and Mauro Sozio A distributed algorithm for large-scale generalized matching . . . . . . . . . . 613--624 Floris Geerts and Giansalvatore Mecca and Paolo Papotti and Donatello Santoro The LLUNATIC data-cleaning framework . . 625--636 Iraklis Psaroudakis and Manos Athanassoulis and Anastasia Ailamaki Sharing data and work across concurrent analytical queries . . . . . . . . . . . 637--648 Haichuan Shang and Masaru Kitsuregawa Skyline operator on anti-correlated distributions . . . . . . . . . . . . . 649--660 Hatem Mahmoud and Faisal Nawab and Alexander Pucher and Divyakant Agrawal and Amr El Abbadi Low-latency multi-datacenter databases using replicated commit . . . . . . . . 661--672 Yun Chi and Hakan Hacígümüs and Wang-Pin Hsiung and Jeffrey F. Naughton Distribution-based query scheduling . . 673--684 Wenfei Fan and Floris Geerts and Frank Neven Making queries tractable on big data with preprocessing: through the eyes of complexity theory . . . . . . . . . . . 685--696 Haim Kaplan and Ilia Lotosh and Tova Milo and Slava Novgorodov Answering planning queries with the crowd . . . . . . . . . . . . . . . . . 697--708 Max Heimel and Michael Saecker and Holger Pirk and Stefan Manegold and Volker Markl Hardware-oblivious parallelism for in-memory column-stores . . . . . . . . 709--720 Risi Thonangi and Jun Yang Permuting data on random-access block storage . . . . . . . . . . . . . . . . 721--732 Radu Stoica and Anastasia Ailamaki Improving flash write performance by using update frequency . . . . . . . . . 733--744 Lu Li and Chee-Yong Chan Efficient indexing for diverse query results . . . . . . . . . . . . . . . . 745--756 Chen Jason Zhang and Lei Chen and H. V. Jagadish and Chen Caleb Cao Reducing uncertainty of schema matching via crowdsourcing . . . . . . . . . . . 757--768 Bin Yang and Chenjuan Guo and Christian S. Jensen Travel cost inference from sparse, spatio temporally correlated time series using Markov models . . . . . . . . . . 769--780
Hyunjung Park and Jennifer Widom Query optimization over crowdsourced data . . . . . . . . . . . . . . . . . . 781--792 Yang Wang and Peng Wang and Jian Pei and Wei Wang and Sheng Huang A data-adaptive and dynamic segmentation index for whole matching on time series 793--804 Mirko Bronzi and Valter Crescenzi and Paolo Merialdo and Paolo Papotti Extraction and integration of partially overlapping web sources . . . . . . . . 805--816 Yuan Yuan and Rubao Lee and Xiaodong Zhang The Yin and Yang of processing data warehousing queries on GPU devices . . . 817--828 Dayu Yuan and Prasenjit Mitra and C. Lee Giles Mining and indexing graphs for supergraph search . . . . . . . . . . . 829--840 Jianmin Wang and Shaoxu Song and Xiaochen Zhu and Xuemin Lin Efficient recovery of missing events . . 841--852 Kai Ren and YongChul Kwon and Magdalena Balazinska and Bill Howe Hadoop's adolescence: an analysis of Hadoop usage in scientific workloads . . 853--864 Essam Mansour and Ahmed El-Roby and Panos Kalnis and Aron Ahmadia and Ashraf Aboulnaga RACE: a scalable and elastic parallel system for discovering repeats in very long sequences . . . . . . . . . . . . . 865--876 Justin Levandoski and David Lomet and Sudipta Sengupta LLAMA: a cache/storage subsystem for modern hardware . . . . . . . . . . . . 877--888 Jiong He and Mian Lu and Bingsheng He Revisiting co-processing for hash joins on the coupled CPU--GPU architecture . . 889--900 Miao Qiao and Lu Qin and Hong Cheng and Jeffrey Xu Yu and Wentao Tian Top-$k$ nearest keyword search on large graphs . . . . . . . . . . . . . . . . . 901--912 Nikos Armenatzoglou and Stavros Papadopoulos and Dimitris Papadias A general framework for geo-social query processing . . . . . . . . . . . . . . . 913--924 Wentao Wu and Yun Chi and Hakan Hacígümüs and Jeffrey F. Naughton Towards predicting query execution time for concurrent and dynamic database workloads . . . . . . . . . . . . . . . 925--936 Minos Garofalakis and Daniel Keren and Vasilis Samoladas Sketch-based geometric monitoring of distributed stream queries . . . . . . . 937--948 Cheng Long and Raymond Chi-Wing Wong and H. V. Jagadish Direction-preserving trajectory simplification . . . . . . . . . . . . . 949--960
Nicolas Bruno and Sapna Jain and Jingren Zhou Continuous cloud-scale query optimization and processing . . . . . . 961--972 Andrii Cherniak and Huma Zaidi and Vladimir Zadorozhny Optimization strategies for A/B testing on HADOOP . . . . . . . . . . . . . . . 973--984 Khaled Elmeleegy Piranha: optimizing short jobs in Hadoop 985--996 Mohammad Sadoghi and Kenneth A. Ross and Mustafa Canim and Bishwaranjan Bhattacharjee Making updates disk-I/O friendly using SSDs . . . . . . . . . . . . . . . . . . 997--1008 Ablimit Aji and Fusheng Wang and Hoang Vo and Rubao Lee and Qiaoling Liu and Xiaodong Zhang and Joel Saltz Hadoop GIS: a high performance spatial data warehousing system over MapReduce 1009--1020 Bhuvan Bamba and Siva Ravada and Ying Hu and Richard Anderson Statistics collection in Oracle Spatial and Graph: fast histogram construction for complex geometry objects . . . . . . 1021--1032 Tyler Akidau and Alex Balikov and Kaya Bekiroglu and Slava Chernyak and Josh Haberman and Reuven Lax and Sam McVeety and Daniel Mills and Paul Nordstrom and Sam Whittle MillWheel: fault-tolerant stream processing at Internet scale . . . . . . 1033--1044 Ian Rae and Eric Rollins and Jeff Shute and Sukhdeep Sodhi and Radek Vingralek Online, asynchronous schema change in F1 1045--1056 Lior Abraham and John Allen and Oleksandr Barykin and Vinayak Borkar and Bhuwan Chopra and Ciprian Gerea and Daniel Merl and Josh Metzler and David Reiss and Subbu Subramanian and Janet L. Wiener and Okay Zed Scuba: diving into data at Facebook . . 1057--1067 Jeff Shute and Radek Vingralek and Bart Samwel and Ben Handy and Chad Whipkey and Eric Rollins and Mircea Oancea and Kyle Littlefield and David Menestrina and Stephan Ellner and John Cieslewicz and Ian Rae and Traian Stancescu and Himani Apte F1: a distributed SQL database that scales . . . . . . . . . . . . . . . . . 1068--1079 Vijayshankar Raman and Gopi Attaluri and Ronald Barber and Naresh Chainani and David Kalmuk and Vincent KulandaiSamy and Jens Leenstra and Sam Lightstone and Shaorong Liu and Guy M. Lohman and Tim Malkemus and Rene Mueller and Ippokratis Pandis and Berni Schiefer and David Sharpe and Richard Sidle and Adam Storm and Liping Zhang DB2 with BLU acceleration: so much more than just a column store . . . . . . . . 1080--1091 Michael Ovsiannikov and Silvius Rus and Damian Reeves and Paul Sutter and Sriram Rao and Jim Kelly The Quantcast File System . . . . . . . 1092--1101 Srikanth Bellamkonda and Hua-Gang Li and Unmesh Jagtap and Yali Zhu and Vince Liang and Thierry Cruanes Adaptive and big data scale parallel execution in Oracle . . . . . . . . . . 1102--1113 Kedar Bellare and Carlo Curino and Ashwin Machanavajihala and Peter Mika and Mandar Rahurkar and Aamod Sane WOO: a scalable and multi-tenant platform for continuous knowledge base synthesis . . . . . . . . . . . . . . . 1114--1125 Abhishek Gattani and Digvijay S. Lamba and Nikesh Garera and Mitul Tiwari and Xiaoyong Chai and Sanjib Das and Sri Subramaniam and Anand Rajaraman and Venky Harinarayan and AnHai Doan Entity extraction, linking, classification, and tagging for social media: a Wikipedia-based approach . . . 1126--1137 Hazem Elmeleegy and Yinan Li and Yan Qi and Peter Wilmot and Mingxi Wu and Santanu Kolay and Ali Dasdan and Songting Chen Overview of turn data management platform for digital advertising . . . . 1138--1149 Michael Curtiss and Iain Becker and Tudor Bosman and Sergey Doroshenko and Lucian Grijincu and Tom Jackson and Sandhya Kunnatur and Soren Lassen and Philip Pronin and Sriram Sankar and Guanghao Shen and Gintaras Woss and Chao Yang and Ning Zhang Unicorn: a system for searching the social graph . . . . . . . . . . . . . . 1150--1161 Sergio Ramazzina and Chiara L. Ballari and Daniela Somenzi A new service for customer care based on the Trentorise bigdata platform . . . . 1162--1163 Fabrizio Antonelli and Antonino Casella and Cristiana Chitic and Roberto Larcher and Giovanni Torrisi Exploiting the diversity, mass and speed of territorial data by TELCO Operator for better user services . . . . . . . . 1164--1165 Ivan Bedini and Benedikt Elser and Yannis Velegrakis The Trento big data platform for public administration and large companies: use cases and opportunities . . . . . . . . 1166--1167 Nga Tran and Sreenath Bodagala and Jaimin Dave Designing query optimizers for big data problems of the future . . . . . . . . . 1168--1169 Monica Franceschini How to maximize the value of big data with the open source SpagoBI suite through a comprehensive approach . . . . 1170--1171 Edward Y. Chang Context-aware computing: opportunities and open issues . . . . . . . . . . . . 1172--1173 Oktie Hassanzadeh and Anastasios Kementsietsidis and Benny Kimelfeld and Rajasekar Krishnamurthy and Fatma Özcan and Ippokratis Pandis Next generation data analytics at IBM research . . . . . . . . . . . . . . . . 1174--1175 Mauro Brunato and Roberto Battiti Learning and intelligent optimization (LION): one ring to rule them all . . . 1176--1177 David Lomet Microsoft SQL server's integrated database approach for modern applications and hardware . . . . . . . 1178--1179 Hakan Hacígümüs and Jagan Sankaranarayanan and Junichi Tatemura and Jeff LeFevre and Neoklis Polyzotis Odyssey: a multistore system for evolutionary analytics . . . . . . . . . 1180--1181 Paolo Bouquet and Andrea Molinari A global Entity Name System (ENS) for data ecosystems . . . . . . . . . . . . 1182--1183 Vishal Sikka and Franz Färber and Anil Goel and Wolfgang Lehner SAP HANA: the evolution from a modern main-memory data platform to an enterprise application platform . . . . 1184--1185 Raghunath Nambiar and Meikel Poess Keeping the TPC relevant! . . . . . . . 1186--1187 Xin Luna Dong and Divesh Srivastava Big data integration . . . . . . . . . . 1188--1189 Stratis D. Viglas Just-in-time compilation for SQL query processing . . . . . . . . . . . . . . . 1190--1191 Anastasia Ailamaki and Ryan Johnson and Ippokratis Pandis and Pínar Tözün Toward scalable transaction processing: evolution of Shore-MT . . . . . . . . . 1192--1193 Aaron J. Elmore and Carlo Curino and Divyakant Agrawal and Amr El Abbadi Towards database virtualization for database as a service . . . . . . . . . 1194--1195 Mohamed F. Mokbel and Mohamed Sarwat Mobility and social networking: a data management perspective . . . . . . . . . 1196--1197
Andy Yuan Xue and Rui Zhang and Yu Zheng and Xing Xie and Jianhui Yu and Yong Tang DesTeller: a system for destination prediction based on trajectories with privacy protection . . . . . . . . . . . 1198--1201 Zhe Chen and Michael Cafarella and Jun Chen and Daniel Prevo and Junfeng Zhuang Senbazuru: a prototype spreadsheet database management system . . . . . . . 1202--1205 Grégory Smits and Olivier Pivert and Thomas Girault ReqFlex: fuzzy queries for everyone . . 1206--1209 Martin Kaufmann and Panagiotis Vagenas and Peter M. Fischer and Donald Kossmann and Franz Färber Comprehensive and interactive temporal query processing with SAP HANA . . . . . 1210--1213 Torsten Grust and Nils Schweinsberg and Alexander Ulrich Functions are data too: defunctionalization for PL/SQL . . . . . 1214--1217 Amr Ebaid and Ahmed Elmagarmid and Ihab F. Ilyas and Mourad Ouzzani and Jorge-Arnulfo Quiane-Ruiz and Nan Tang and Si Yin NADEEF: a generalized data cleaning system . . . . . . . . . . . . . . . . . 1218--1221 Sonia Bergamaschi and Francesco Guerra and Matteo Interlandi and Raquel Trillo-Lado and Yannis Velegrakis QUEST: a keyword search system for relational data based on semantic and machine learning techniques . . . . . . 1222--1225 Kenneth S. Bògh and Anders Skovsgaard and Christian S. Jensen GroupFinder: a new approach to top-$k$ point-of-interest group retrieval . . . 1226--1229 Ahmed Eldawy and Mohamed F. Mokbel A demonstration of SpatialHadoop: an efficient MapReduce framework for spatial data . . . . . . . . . . . . . . 1230--1233 Mehmet Ali Abbasoglu and Bugra Gedik and Hakan Ferhatosmanoglu Aggregate profile clustering for telco analytics . . . . . . . . . . . . . . . 1234--1237 Luying Chen and Stefano Ortona and Giorgio Orsi and Michael Benedikt ROSeAnn: reconciling opinions of semantic annotators . . . . . . . . . . 1238--1241 Mohamed Sarwat and James Avery and Mohamed F. Mokbel RecDB in action: recommendation made easy in relational databases . . . . . . 1242--1245 Marina Drosou and Evaggelia Pitoura POIKILO: a tool for evaluating the results of diversification models and algorithms . . . . . . . . . . . . . . . 1246--1249 Yael Amsterdamer and Yael Grossman and Tova Milo and Pierre Senellart CrowdMiner: mining association rules from the crowd . . . . . . . . . . . . . 1250--1253 Chen Chen and Hongzhi Yin and Junjie Yao and Bin Cui TeRec: a temporal recommender system over tweet stream . . . . . . . . . . . 1254--1257 Alexander Shkapsky and Kai Zeng and Carlo Zaniolo Graph queries in a next-generation Datalog system . . . . . . . . . . . . . 1258--1261 Abdeltawab M. Hendawi and Jie Bao and Mohamed F. Mokbel iRoad: a framework for scalable predictive query processing on road networks . . . . . . . . . . . . . . . . 1262--1265 Mithila Nagendra and K. Selçuk Candan SkySuite: a framework of skyline-join operators for static and stream environments . . . . . . . . . . . . . . 1266--1269 Jianlong Zhong and Bingsheng He Parallel graph processing on graphics processors made easy . . . . . . . . . . 1270--1273 Stefan Richter and Jens Dittrich and Stefan Schuh and Tobias Frey Mosquito: another one bites the data upload stream . . . . . . . . . . . . . 1274--1277 Sergej Hardock and Ilia Petrov and Robert Gottstein and Alejandro Buchmann NoFTL: database systems on FTL-less flash storage . . . . . . . . . . . . . 1278--1281 Dimitrios Kotsakos and Panos Sakkos and Vana Kalogeraki and Dimitirios Gunopulos SmartMonitor: using smart devices to perform structural health monitoring . . 1282--1285 Yagíz Kargín and Milena Ivanova and Ying Zhang and Stefan Manegold and Martin Kersten Lazy ETL in action: ETL technology dates scientific data . . . . . . . . . . . . 1286--1289 Niv Dayan and Martin Kjær Svendsen and Matias Bjòrling and Philippe Bonnet and Luc Bouganim EagleTree: exploring the design space of SSD-based algorithms . . . . . . . . . . 1290--1293 Saket Sathe and Arthur Oviedo and Dipanjan Chakraborty and Karl Aberer EnviroMeter: a platform for querying community-sensed data . . . . . . . . . 1294--1297 Alper Okcan and Mirek Riedewald and Biswanath Panda and Daniel Fink Scolopax: exploratory analysis of scientific data . . . . . . . . . . . . 1298--1301 Daniel Deutch and Yuval Moskovitch and Val Tannen PROPOLIS: provisioned analysis of data-centric processes . . . . . . . . . 1302--1305 Pradap Konda and Arun Kumar and Christopher Ré and Vaishnavi Sashikanth Feature selection in enterprise analytics: a demonstration using an R-based data analytics system . . . . . 1306--1309 Mohammadreza Najafi and Mohammad Sadoghi and Hans-Arno Jacobsen Flexible query processor on FPGAs . . . 1310--1313 Cristina Civili and Marco Console and Giuseppe De Giacomo and Domenico Lembo and Maurizio Lenzerini and Lorenzo Lepore and Riccardo Mancini and Antonella Poggi and Riccardo Rosati and Marco Ruzzi and Valerio Santarelli and Domenico Fabio Savo MASTRO STUDIO: managing ontology-based data access applications . . . . . . . . 1314--1317 David Fuhry and Yang Zhang and Venu Satuluri and Arnab Nandi and Srinivasan Parthasarathy PLASMA-HD: probing the lattice structure and makeup of high-dimensional data . . 1318--1321 Matthew Moyers and Emad Soroush and Spencer C. Wallace and Simon Krughoff and Jake Vanderplas and Magdalena Balazinska and Andrew Connolly A demonstration of iterative parallel array processing in support of telescope image analysis . . . . . . . . . . . . . 1322--1325 Hamed Abdelhaq and Christian Sengstock and Michael Gertz EvenTweet: online localized event detection from Twitter . . . . . . . . . 1326--1329 Hamid Mousavi and Shi Gao and Carlo Zaniolo IBminer: a text mining tool for constructing and populating InfoBox databases and knowledge bases . . . . . 1330--1333 Nicholas L. Farnan and Adam J. Lee and Panos K. Chrysanthis and Ting Yu PAQO: a preference-aware query optimizer for PostgreSQL . . . . . . . . . . . . . 1334--1337 Suvarna Bothe and Panagiotis Karras and Akrivi Vlachou eSkyline: processing skyline queries over encrypted data . . . . . . . . . . 1338--1341 Lilong Jiang and Michael Mandel and Arnab Nandi GestureQuery: a multitouch database query interface . . . . . . . . . . . . 1342--1345 Di Yang and Kaiyu Zhao and Maryam Hasan and Hanyuan Lu and Elke Rundensteiner and Matthew Ward Mining and linking patterns across live data streams and stream archives . . . . 1346--1349 Hanan Samet and Marco D. Adelfio and Brendan C. Fruin and Michael D. Lieberman and Jagan Sankaranarayanan PhotoStand: a map query interface for a database of news photos . . . . . . . . 1350--1353 K. Ashwin Kumar and Jonathan Gluck and Amol Deshpande and Jimmy Lin Hone: ``Scaling down'' Hadoop on shared-memory systems . . . . . . . . . 1354--1357 Dolan Antenucci and Erdong Li and Shaobo Liu and Bochun Zhang and Michael J. Cafarella and Christopher Ré Ringtail: a generalized nowcasting system . . . . . . . . . . . . . . . . . 1358--1361 Min Xie and Laks V. S. Lakshmanan and Peter T. Wood IPS: an interactive package configuration system for trip planning 1362--1365 Jingbo Zhou and Anthony K. H. Tung and Wei Wu and Wee Siong Ng R2-D2: a system to support probabilistic path prediction in dynamic environments via ``Semi-lazy'' learning . . . . . . . 1366--1369 Byung-Gon Chun and Tyson Condie and Carlo Curino and Chris Douglas and Sergiy Matusevych and Brandon Myers and Shravan Narayanamurthy and Raghu Ramakrishnan and Sriram Rao and Josh Rosen and Russell Sears and Markus Weimer REEF: retainable evaluator execution framework . . . . . . . . . . . . . . . 1370--1373 Shuhao Zhang and Jiong He and Bingsheng He and Mian Lu OmniDB: towards portable and efficient query processing on parallel CPU/GPU architectures . . . . . . . . . . . . . 1374--1377 Ognjen Savkovi\'c and Paramita Mirza and Alex Tomasi and Werner Nutt Complete approximations of incomplete queries . . . . . . . . . . . . . . . . 1378--1381 Georgia Koutrika and Qian Lin and Jerry Liu User analytics with UbeOne: insights into web printing . . . . . . . . . . . 1382--1385 Ivo Santos and Marcel Tilly and Badrish Chandramouli and Jonathan Goldstein DiAl: distributed streaming analytics anywhere, anytime . . . . . . . . . . . 1386--1389 Rada Chirkova and Jun Yang Big and useful: what's in the data for me? . . . . . . . . . . . . . . . . . . 1390--1391 Tomás Bartos Universal indexing of arbitrary similarity models . . . . . . . . . . . 1392--1397 Sebastian Breß and Gunter Saake Why it is time for a HyPE: a hybrid query processing engine for efficient GPU coprocessing in DBMS . . . . . . . . 1398--1403 Alireza Rezaei Mahdiraji and Peter Baumann Database support for unstructured meshes 1404--1409 Aastha Madaan and Subhash Bhalla Domain specific multistage query language for medical document repositories . . . . . . . . . . . . . . 1410--1415 Io Taxidou and Peter Fischer Realtime analysis of information diffusion in social media . . . . . . . 1416--1421 Luca Bonomi and Li Xiong Mining frequent patterns with differential privacy . . . . . . . . . . 1422--1427 Anett Hoppe and C. Nicolle and A. Roxin Automatic ontology-based user profile learning from heterogeneous Web resources in a big data context . . . . 1428--1433 Akon Dey and Alan Fekete and Uwe Röhm Scalable transactions across heterogeneous NoSQL key--value data stores . . . . . . . . . . . . . . . . . 1434--1439 Nhung Ngo and Enrico Franconi Getting unique solution in data exchange 1440--1443 Martin Kaufmann and Donald Kossmann Storing and processing temporal data in a main memory column store . . . . . . . 1444--1449 Stepan Kozak and Pavel Zezula Efficiency and security in similarity cloud services . . . . . . . . . . . . . 1450--1455 Thibault Sellam and Martin Kersten Fast cartography for data explorers . . 1456--1461
Gonçalo Simões and Helena Galhardas and Luis Gravano When speed has a price: fast information extraction using approximate algorithms 1462--1473 Craig Chasseur and Jignesh M. Patel Design and evaluation of storage organizations for read-optimized main memory databases . . . . . . . . . . . . 1474--1485 Luying Chen and Stefano Ortona and Giorgio Orsi and Michael Benedikt Aggregating semantic annotators . . . . 1486--1497 Xu Chu and Ihab F. Ilyas and Paolo Papotti Discovering denial constraints . . . . . 1498--1509 Wenfei Fan and Xin Wang and Yinghui Wu Diversified top-$k$ graph pattern matching . . . . . . . . . . . . . . . . 1510--1521 Weixiong Rao and Lei Chen and Pan Hui and Sasu Tarkoma Bitlist: new full-text index for low space cost and efficient keyword search 1522--1533 Sebastian Wandelt and Johannes Starlinger and Marc Bux and Ulf Leser RCSI: scalable similarity search in thousand(s) of genomes . . . . . . . . . 1534--1545 Yufei Tao and Xiaocheng Hu and Dong-Wan Choi and Chin-Wan Chung Approximate MaxRS in spatial databases 1546--1557 Benny Kimelfeld and Jan Vondrák and David P. Woodruff Multi-tuple deletion propagation: approximations and complexity . . . . . 1558--1569 Badrish Chandramouli and Suman Nath and Wenchao Zhou Supporting distributed feed-following apps over edge devices . . . . . . . . . 1570--1581 Saravanan Thirumuruganathan and Nan Zhang and Gautam Das Rank discovery from web databases . . . 1582--1593 Theodoros Rekatsinas and Amol Deshpande and Ashwin Machanavajjhala SPARSI: partitioning sensitive data amongst multiple adversaries . . . . . . 1594--1605 Dong Deng and Yu Jiang and Guoliang Li and Jian Li and Cong Yu Scalable column concept determination for Web tables using large knowledge bases . . . . . . . . . . . . . . . . . 1606--1617 Xin Huang and Hong Cheng and Rong-Hua Li and Lu Qin and Jeffrey Xu Yu top-$k$ structural diversity search in large networks . . . . . . . . . . . . . 1618--1629 Federico Cavalieri and Alessandro Solimando and Giovanna Guerrini Synthetising changes in XML documents as PULs . . . . . . . . . . . . . . . . . . 1630--1641
Lei Zhang and Thanh Tran and Achim Rettinger Probabilistic query rewriting for efficient and effective keyword search on graph data . . . . . . . . . . . . . 1642--1653 Martin Schäler and Alexander Grebhahn and Reimar Schröter and Sandro Schulze and Veit Köppen and Gunter Saake QuEval: beyond high-dimensional indexing \`a la carte . . . . . . . . . . . . . . 1654--1665 Yuhong Li and Leong Hou U. and Man Lung Yiu and Zhiguo Gong Discovering longest-lasting correlation in sequence databases . . . . . . . . . 1666--1677 Adrian Daniel Popescu and Andrey Balmin and Vuk Ercegovac and Anastasia Ailamaki PREDIcT: towards predicting the runtime of large scale iterative analytics . . . 1678--1689 Xiaohan Zhao and Adelbert Chang and Atish Das Sarma and Haitao Zheng and Ben Y. Zhao On the embeddability of random walk distances . . . . . . . . . . . . . . . 1690--1701 Tobias Mühlbauer and Wolf Rödiger and Robert Seilbeck and Angelika Reiser and Alfons Kemper and Thomas Neumann Instant loading for main memory databases . . . . . . . . . . . . . . . 1702--1713 Karolina Alexiou and Donald Kossmann and Per-Åke Larson Adaptive range filters for cold data: avoiding trips to Siberia . . . . . . . 1714--1725 Badrish Chandramouli and Jonathan Goldstein and Abdul Quamar Scalable progressive analytics on big data in the Cloud . . . . . . . . . . . 1726--1737 Peter Ogden and David Thomas and Peter Pietzuch Scalable XML query processing using parallel pushdown transducers . . . . . 1738--1749 Yin Huai and Siyuan Ma and Rubao Lee and Owen O'Malley and Xiaodong Zhang Understanding insights into the basic structure and essential issues of table placement methods in clusters . . . . . 1750--1761 Davide Mottin and Alice Marascu and Senjuti Basu Roy and Gautam Das and Themis Palpanas and Yannis Velegrakis A probabilistic optimization framework for the empty-answer problem . . . . . . 1762--1773 Yinghui Wu and Shengqi Yang and Mudhakar Srivatsa and Arun Iyengar and Xifeng Yan Summarizing answer graphs induced by keyword queries . . . . . . . . . . . . 1774--1785 Huizhong Duan and ChengXiang Zhai and Jinxing Cheng and Abhishek Gattani Supporting keyword search in product database: a probabilistic approach . . . 1786--1797 Supriya Nirkhiwale and Alin Dobra and Christopher Jermaine A sampling algebra for aggregate estimation . . . . . . . . . . . . . . . 1798--1809 Maximilian Dylla and Iris Miliaraki and Martin Theobald A temporal-probabilistic database model for information extraction . . . . . . . 1810--1821 Pit Fender and Guido Moerkotte Counter strike: generic top-down join enumeration for hypergraphs . . . . . . 1822--1833 Daniar Achakeev and Bernhard Seeger Efficient bulk updates on multiversion B-trees . . . . . . . . . . . . . . . . 1834--1845 Hotham Altwaijry and Dmitri V. Kalashnikov and Sharad Mehrotra Query-driven approach to entity resolution . . . . . . . . . . . . . . . 1846--1857 Jaros\law Szlichta and Parke Godfrey and Jarek Gryz and Calisto Zuzarte Expressiveness and complexity of order dependencies . . . . . . . . . . . . . . 1858--1869 A. Pavan and Kanat Tangwongsan and Srikanta Tirthapura and Kun-Lung Wu Counting and sampling triangles from a graph stream . . . . . . . . . . . . . . 1870--1881 Benjamin Sowell and Marcos Vaz Salles and Tuan Cao and Alan Demers and Johannes Gehrke An experimental analysis of iterated spatial joins in main memory . . . . . . 1882--1893 Kisung Lee and Ling Liu Scaling queries over big RDF graphs with semantic hash partitioning . . . . . . . 1894--1905 Jiwon Seo and Jongsoo Park and Jaeho Shin and Monica S. Lam Distributed socialite: a datalog-based language for large-scale graph analysis 1906--1917 Mohamed Sarwat and Sameh Elnikety and Yuxiong He and Mohamed F. Mokbel Horton+: a distributed system for processing declarative reachability queries over partitioned graphs . . . . 1918--1929 Narayanan Sundaram and Aizana Turmukhametova and Nadathur Satish and Todd Mostak and Piotr Indyk and Samuel Madden and Pradeep Dubey Streaming similarity search over one billion tweets using parallel locality-sensitive hashing . . . . . . . 1930--1941 Justin DeBrabant and Andrew Pavlo and Stephen Tu and Michael Stonebraker and Stan Zdonik Anti-caching: a new approach to database management system architecture . . . . . 1942--1953 Wahbeh Qardaji and Weining Yang and Ninghui Li Understanding hierarchical methods for differentially private histograms . . . 1954--1965 Rui Li and Shengjie Wang and Kevin Chen-Chuan Chang Towards social data platform: automatic topic-focused monitor for Twitter stream 1966--1977 Ruoming Jin and Guan Wang Simple, fast, and scalable reachability oracle . . . . . . . . . . . . . . . . . 1978--1989 Nurzhan Bakibayev and Tomás Kociský and Dan Olteanu and Jakub Závodný Aggregation and ordering in factorised databases . . . . . . . . . . . . . . . 1990--2001 Yoonjae Park and Jun-Ki Min and Kyuseok Shim Parallel computation of skyline and reverse skyline queries using MapReduce 2002--2013 Wenlei Xie and Guozhang Wang and David Bindel and Alan Demers and Johannes Gehrke Fast iterative graph computation with block updates . . . . . . . . . . . . . 2014--2025
Xiaoli Wang and Xiaofeng Ding and Anthony K. H. Tung and Zhenjie Zhang Efficient and effective KNN sequence search with approximate $n$-grams . . . 1--12 Weiren Yu and Xuemin Lin and Wenjie Zhang and Lijun Chang and Jian Pei More is simpler: effectively and efficiently assessing node-pair similarities based on hyperlinks . . . . 13--24 Marc Gyssens and Jan Paredaens and Dirk Van Gucht and Jef Wijsen and Yuqing Wu An approach towards the study of symmetric queries . . . . . . . . . . . 25--36 Sudipto Das and Vivek R. Narasayya and Feng Li and Manoj Syamala CPU sharing techniques for performance isolation in multi-tenant relational database-as-a-service . . . . . . . . . 37--48 Qian Chen and Haibo Hu and Jianliang Xu Authenticating top-$k$ queries in location-based services with confidentiality . . . . . . . . . . . . 49--60 Zichao Qi and Yanghua Xiao and Bin Shao and Haixun Wang Toward a distance oracle for billion-node graphs . . . . . . . . . . 61--72 Manohar Kaul and Raymond Chi-Wing Wong and Bin Yang and Christian S. Jensen Finding shortest paths on terrains by killing two birds with one stone . . . . 73--84 Cagri Balkesen and Gustavo Alonso and Jens Teubner and M. Tamer Özsu Multi-core, main-memory joins: sort vs. hash revisited . . . . . . . . . . . . . 85--96
Felix Martin Schuhknecht and Alekh Jindal and Jens Dittrich The uncracked pieces in database cracking . . . . . . . . . . . . . . . . 97--108 Bahaeddin Eravci and Hakan Ferhatosmanoglu Diversity based relevance feedback for time series search . . . . . . . . . . . 109--120 Steven Pelley and Thomas F. Wenisch and Brian T. Gold and Bill Bridge Storage management in the NVRAM era . . 121--132
Mariam Salloum and Xin Luna Dong and Divesh Srivastava and Vassilis J. Tsotras Online ordering of overlapping data sources . . . . . . . . . . . . . . . . 133--144 Guoping Wang and Chee-Yong Chan Multi-query optimization in MapReduce framework . . . . . . . . . . . . . . . 145--156 Zhenhui Li and Bolin Ding and Fei Wu and Tobias Kin Hou Lei and Roland Kays and Margaret C. Crofoot Attraction and avoidance detection from movements . . . . . . . . . . . . . . . 157--168 Xiang Zhao and Chuan Xiao and Xuemin Lin and Qing Liu and Wenjie Zhang A partition-based approach to structure similarity search . . . . . . . . . . . 169--180 Peter Bailis and Aaron Davidson and Alan Fekete and Ali Ghodsi and Joseph M. Hellerstein and Ion Stoica Highly available transactions: virtues and limitations . . . . . . . . . . . . 181--192 Yuanyuan Tian and Andrey Balmin and Severin Andreas Corsten and Shirish Tatikonda and John McPherson From ``think like a vertex'' to ``think like a graph'' . . . . . . . . . . . . . 193--204 Johannes Niedermayer and Andreas Züfle and Tobias Emrich and Matthias Renz and Nikos Mamoulis and Lei Chen and Hans-Peter Kriegel Probabilistic nearest neighbor queries on uncertain moving object trajectories 205--216
Konstantinos Karanasos and Asterios Katsifodimos and Ioana Manolescu Delta: scalable data dissemination under capacity constraints . . . . . . . . . . 217--228 Ceren Budak and Theodore Georgiou and Divyakant Agrawal and Amr El Abbadi GeoScope: online detection of geo-correlated information trends in social networks . . . . . . . . . . . . 229--240 Makoto Onizuka and Hiroyuki Kato and Soichiro Hidaka and Keisuke Nakano and Zhenjiang Hu Optimization for iterative queries on MapReduce . . . . . . . . . . . . . . . 241--252 Hong-Han Shuai and De-Nian Yang and Philip S. Yu and Ming-Syan Chen Willingness optimization for social group activity . . . . . . . . . . . . . 253--264 Lei Cao and Elke A. Rundensteiner High performance stream query processing with correlation-aware partitioning . . 265--276 Djellel Eddine Difallah and Andrew Pavlo and Carlo Curino and Philippe Cudre-Mauroux OLTP-Bench: an extensible testbed for benchmarking relational databases . . . 277--288 Arnab Nandi and Lilong Jiang and Michael Mandel Gestural query specification . . . . . . 289--300 Arvid Heise and Jorge-Arnulfo Quiané-Ruiz and Ziawasch Abedjan and Anja Jentzsch and Felix Naumann Scalable discovery of unique column combinations . . . . . . . . . . . . . . 301--312 Yu Tang and Leong Hou U. and Yilun Cai and Nikos Mamoulis and Reynold Cheng Earth Mover's Distance based similarity search at scale . . . . . . . . . . . . 313--324 Aditya Parameswaran and Neoklis Polyzotis and Hector Garcia-Molina SeeDB: visualizing database queries efficiently . . . . . . . . . . . . . . 325--328
Hatem A. Mahmoud and Vaibhav Arora and Faisal Nawab and Divyakant Agrawal and Amr El Abbadi MaaT: effective and scalable coordination of distributed transactions in the cloud . . . . . . . . . . . . . . 329--340 Chao Li and Michael Hay and Gerome Miklau and Yue Wang A data- and workload-aware algorithm for range queries under differential privacy 341--352 Sergio Greco and Fabian Pijcke and Jef Wijsen Certain query answering in partially consistent databases . . . . . . . . . . 353--364 Davide Mottin and Matteo Lissandrini and Yannis Velegrakis and Themis Palpanas Exemplar queries: give me an example of what you need . . . . . . . . . . . . . 365--376 Nitish Korula and Silvio Lattanzi An efficient reconciliation algorithm for social networks . . . . . . . . . . 377--388 Sean Chester and Alex Thomo and S. Venkatesh and Sue Whitesides Computing $k$-regret minimizing sets . . 389--400 Adams Wei Yu and Nikos Mamoulis and Hao Su Reverse top-$k$ search using random walk with restart . . . . . . . . . . . . . . 401--412 Stratis D. Viglas Write-limited sorts and joins for persistent memory . . . . . . . . . . . 413--424 N. Anciaux and L. Bouganim and T. Delot and S. Ilarri and L. Kloul and N. Mitton and P. Pucheral Folk-IS: opportunistic data services in least developed countries . . . . . . . 425--428
Georgios Giannikis and Darko Makreshanski and Gustavo Alonso and Donald Kossmann Shared workload optimization . . . . . . 429--440 Mohammed Elseidy and Abdallah Elguindy and Aleksandar Vitorovic and Christoph Koch Scalable and adaptive online joins . . . 441--452 Kristi Morton and Magdalena Balazinska and Dan Grossman and Jock Mackinlay Support the data enthusiast: challenges for next-generation data-analysis systems . . . . . . . . . . . . . . . . 453--456 Daniel Deutch and Yuval Moskovitch and Val Tannen A provenance framework for data-dependent process analysis . . . . 457--468 Yueh-Hsuan Chiang and AnHai Doan and Jeffrey F. Naughton Tracking entities in the dynamic world: a fast algorithm for matching temporal records . . . . . . . . . . . . . . . . 469--480 Neil Conway and Peter Alvaro and Emily Andrews and Joseph M. Hellerstein Edelweiss: automatic storage reclamation for distributed programming . . . . . . 481--492
Nikos Ntarmos and Ioannis Patlakas and Peter Triantafillou Rank join queries in NoSQL databases . . 493--504 Rahul Gupta and Alon Halevy and Xuezhi Wang and Steven Euijong Whang and Fei Wu Biperpedia: an ontology for search applications . . . . . . . . . . . . . . 505--516 Mohammed Elseidy and Ehab Abdelhamid and Spiros Skiadopoulos and Panos Kalnis GraMi: frequent subgraph and pattern mining in a single large graph . . . . . 517--528 Sheng Wang and David Maier and Beng Chin Ooi Lightweight indexing of observational data in log-structured storage . . . . . 529--540 Dawei Jiang and Gang Chen and Beng Chin Ooi and Kian-Lee Tan and Sai Wu epiC: an extensible and scalable system for processing big data . . . . . . . . 541--552 Matthias Boehm and Shirish Tatikonda and Berthold Reinwald and Prithviraj Sen and Yuanyuan Tian and Douglas R. Burdick and Shivakumar Vaithyanathan Hybrid parallelization strategies for large-scale machine learning in SystemML 553--564 Shengqi Yang and Yinghui Wu and Huan Sun and Xifeng Yan Schemaless and structureless graph querying . . . . . . . . . . . . . . . . 565--576 Semih Salihoglu and Jennifer Widom Optimizing graph algorithms on Pregel-like systems . . . . . . . . . . 577--588 You Wu and Pankaj K. Agarwal and Chengkai Li and Jun Yang and Cong Yu Toward computational fact-checking . . . 589--600
Marcelo Arenas and Gonzalo Díaz and Achille Fokoue and Anastasios Kementsietsidis and Kavitha Srinivas A principled approach to bridging the gap between graph data and their schemas 601--612 Dongxiang Zhang and Chee-Yong Chan and Kian-Lee Tan An efficient publish/subscribe index for e-commerce databases . . . . . . . . . . 613--624 Yu Jiang and Guoliang Li and Jianhua Feng and Wen-Syan Li String similarity joins: an experimental evaluation . . . . . . . . . . . . . . . 625--636 Davide Proserpio and Sharon Goldberg and Frank McSherry Calibrating data to sensitivity in private data analysis: a platform for differentially-private analysis of weighted datasets . . . . . . . . . . . 637--648 Wei Wang and Beng Chin Ooi and Xiaoyan Yang and Dongxiang Zhang and Yueting Zhuang Effective multi-modal retrieval based on stacked auto-encoders . . . . . . . . . 649--660
Renchu Song and Weiwei Sun and Baihua Zheng and Yu Zheng PRESS: a novel framework of trajectory compression in road networks . . . . . . 661--672 Yajun Yang and Hong Gao and Jeffrey Xu Yu and Jianzhong Li Finding the cost-optimal path with time constraint over time-dependent graphs 673--684 Aditya Parameswaran and Stephen Boyd and Hector Garcia-Molina and Ashish Gupta and Neoklis Polyzotis and Jennifer Widom Optimal crowd-powered rating and filtering algorithms . . . . . . . . . . 685--696 Anja Gruenheid and Xin Luna Dong and Divesh Srivastava Incremental record linkage . . . . . . . 697--708 Pratanu Roy and Jens Teubner and Rainer Gemulla Low-latency handshake join . . . . . . . 709--720 Huanhuan Wu and James Cheng and Silu Huang and Yiping Ke and Yi Lu and Yanyan Xu Path problems in temporal graphs . . . . 721--732 Xin Cao and Gao Cong and Christian S. Jensen and Man Lung Yiu Retrieving regions of interest for user exploration . . . . . . . . . . . . . . 733--744 Yingfan Liu and Jiangtao Cui and Zi Huang and Hui Li and Heng Tao Shen SK--LSH: an efficient index structure for approximate nearest neighbor search 745--756 Bing-Rong Lin and Daniel Kifer On arbitrage-free pricing for general data queries . . . . . . . . . . . . . . 757--768 Chao Zhang and Jiawei Han and Lidan Shou and Jiajun Lu and Thomas La Porta Splitter: mining fine-grained sequential patterns in semantic trajectories . . . 769--780 Avrilia Floratou and Frank Bertsch and Jignesh M. Patel and Georgios Laskaris Towards building wind tunnels for data center design . . . . . . . . . . . . . 781--784
Zhao Zhang and Cheqing Jin and Qiangqiang Kang Reverse $k$-ranks query . . . . . . . . 785--796 Uwe Jugel and Zbigniew Jerzak and Gregor Hackenbroich and Gregor Hackenbroich and Volker Markl M4: a visualization-oriented time series data aggregation . . . . . . . . . . . . 797--808 Mina Ghashami and Jeff M. Phillips and Feifei Li Continuous matrix approximation on distributed data . . . . . . . . . . . . 809--820 Kun Ren and Alexander Thomson and Daniel J. Abadi An evaluation of the advantages and disadvantages of deterministic database systems . . . . . . . . . . . . . . . . 821--832 Hao Zhang and Bogdan Marius Tudor and Gang Chen and Beng Chin Ooi Efficient in-memory data management: an analysis . . . . . . . . . . . . . . . . 833--836 Günes Aluç and M. Tamer Özsu and Khuzaima Daudjee Workload matters: why RDF databases need a new design . . . . . . . . . . . . . . 837--840 Sattam Alsubaiee and Alexander Behm and Vinayak Borkar and Zachary Heilbron and Young-Seok Kim and Michael J. Carey and Markus Dreseler and Chen Li Storage management in AsterixDB . . . . 841--852 Yannis Klonatos and Christoph Koch and Tiark Rompf and Hassan Chafi Building efficient query engines in a high-level language . . . . . . . . . . 853--864 Tianzheng Wang and Ryan Johnson Scalable logging through emerging non-volatile memory . . . . . . . . . . 865--876 Bingsheng He When data management systems meet approximate hardware: challenges and opportunities . . . . . . . . . . . . . 877--880 Xin Luna Dong and Evgeniy Gabrilovich and Geremy Heitz and Wilko Horn and Kevin Murphy and Shaohua Sun and Wei Zhang From data fusion to knowledge fusion . . 881--892 Stefan Funke and André Nusser and Sabine Storandt On $k$-path covers and their applications . . . . . . . . . . . . . . 893--902 Eugene Wu and Leilani Battle and Samuel R. Madden The case for data visualization management systems: vision paper . . . . 903--906 Yinan Li and Jignesh M. Patel WideTable: an accelerator for analytical data processing . . . . . . . . . . . . 907--918 Hien To and Gabriel Ghinita and Cyrus Shahabi A framework for protecting worker location privacy in spatial crowdsourcing . . . . . . . . . . . . . 919--930
Ahmed Eldawy and Justin Levandoski and Per-Åke Larson Trekking through Siberia: managing cold data in a memory-optimized database . . 931--942 Jennie Duggan The case for personal data-driven decision making . . . . . . . . . . . . 943--946 Prima Chairunnanda and Khuzaima Daudjee and M. Tamer Özsu ConfluxDB: multi-master replication for partitioned snapshot isolation databases 947--958 Bernardo Gonçalves and Fabio Porto $ \gamma $-DB: managing scientific hypotheses as uncertain data . . . . . . 959--962 Louis Woods and Zsolt István and Gustavo Alonso Ibex: an intelligent storage engine with support for advanced SQL offloading . . 963--974 Hyokun Yun and Hsiang-Fu Yu and Cho-Jui Hsieh and S. V. N. Vishwanathan and Inderjit Dhillon NOMAD: non-locking, stochastic multi-machine algorithm for asynchronous and decentralized matrix completion . . 975--986 Shaoxu Song and Hong Cheng and Jeffrey Xu Yu and Lei Chen Repairing vertex labels under neighborhood constraints . . . . . . . . 987--998 Yasser Altowim and Dmitri V. Kalashnikov and Sharad Mehrotra Progressive approach to relational entity resolution . . . . . . . . . . . 999--1010 Kaibo Wang and Kai Zhang and Yuan Yuan and Siyuan Ma and Rubao Lee and Xiaoning Ding and Xiaodong Zhang Concurrent analytical query processing with GPUs . . . . . . . . . . . . . . . 1011--1022
Takanori Maehara and Takuya Akiba and Yoichi Iwata and Ken-ichi Kawarabayashi Computing personalized PageRank quickly by exploiting graph structures . . . . . 1023--1034 Marco Serafini and Essam Mansour and Ashraf Aboulnaga and Kenneth Salem and Taha Rafiq and Umar Farooq Minhas Accordion: elastic scalability for database systems supporting distributed transactions . . . . . . . . . . . . . . 1035--1046 Minyang Han and Khuzaima Daudjee and Khaled Ammar and M. Tamer Özsu and Xingfang Wang and Tianqi Jin An experimental comparison of Pregel-like graph processing systems . . 1047--1058 Akash Das Sarma and Yeye He and Surajit Chaudhuri ClusterJoin: a similarity joins framework using map-reduce . . . . . . . 1059--1070 Norases Vesdapunt and Kedar Bellare and Nilesh Dalvi Crowdsourcing algorithms for entity resolution . . . . . . . . . . . . . . . 1071--1082 Wenfei Fan and Xin Wang and Yinghui Wu and Dong Deng Distributed graph simulation: impossibility and possibility . . . . . 1083--1094 Fabian Nagel and Gavin Bierman and Stratis D. Viglas Code generation for efficient query processing in managed runtimes . . . . . 1095--1106 Weimo Liu and Saravanan Thirumuruganathan and Nan Zhang and Gautam Das Aggregate estimation over dynamic hidden web databases . . . . . . . . . . . . . 1107--1118 Manos Karpathiotakis and Miguel Branco and Ioannis Alagiannis and Anastasia Ailamaki Adaptive query processing on RAW data 1119--1130 Foto N. Afrati and Dan Delorey and Mosha Pasumansky and Jeffrey D. Ullman Storing and querying tree-structured records in Dremel . . . . . . . . . . . 1131--1142 Johannes Starlinger and Bryan Brancotte and Sarah Cohen-Boulakia and Ulf Leser Similarity search for scientific workflows . . . . . . . . . . . . . . . 1143--1154 Georgios Kellaris and Stavros Papadopoulos and Xiaokui Xiao and Dimitris Papadias Differentially private event sequences over infinite streams . . . . . . . . . 1155--1166 Nikhil Londhe and Vishrawas Gopalakrishnan and Aidong Zhang and Hung Q. Ngo and Rohini Srihari Matching titles with cross title web-search enrichment and community detection . . . . . . . . . . . . . . . 1167--1178 Shaoxu Song and Lei Chen and Hong Cheng On concise set of relative candidate keys . . . . . . . . . . . . . . . . . . 1179--1190 Hao Wei and Jeffrey Xu Yu and Can Lu and Ruoming Jin Reachability querying: an independent permutation labeling approach . . . . . 1191--1202 Minhao Jiang and Ada Wai-Chee Fu and Raymond Chi-Wing Wong and Yanyan Xu Hop doubling label indexing for point-to-point distance querying on scale-free networks . . . . . . . . . . 1203--1214 Fabian M. Suchanek and Nicoleta Preda Semantic culturomics . . . . . . . . . . 1215--1218 Jörn Kuhlenkamp and Markus Klems and Oliver Röss Benchmarking scalability and elasticity of distributed database systems . . . . 1219--1230 Yang Cao and Wenfei Fan and Tianyu Wo and Wenyuan Yu Bounded conjunctive queries . . . . . . 1231--1242 Anil Shanbhag and S. Sudarshan Optimizing join enumeration in transformation-based query optimizers 1243--1254 Marie Jacob and Benny Kimelfeld and Julia Stoyanovich A system for management and analysis of preference data . . . . . . . . . . . . 1255--1258 Ashish Gupta and Fan Yang and Jason Govig and Adam Kirsch and Kelvin Chan and Kevin Lai and Shuo Wu and Sandeep Govind Dhoot and Abhilash Rajesh Kumar and Ankur Agiwal and Sanjay Bhansali and Mingsheng Hong and Jamie Cameron and Masood Siddiqi and David Jones and Jeff Shute and Andrey Gubarev and Shivakumar Venkataraman and Divyakant Agrawal Mesa: geo-replicated, near real-time, scalable data warehousing . . . . . . . 1259--1270 John Liagouris and Nikos Mamoulis and Panagiotis Bouros and Manolis Terrovitis An effective encoding scheme for spatial RDF data . . . . . . . . . . . . . . . . 1271--1282 Ce Zhang and Christopher Ré DimmWitted: a study of main-memory statistical analytics . . . . . . . . . 1283--1294 Avrilia Floratou and Umar Farooq Minhas and Fatma Özcan SQL-on-Hadoop: full circle back to shared-nothing database architectures 1295--1306 Marco Guarnieri and David Basin Optimal security-aware query processing 1307--1318
Juwei Shi and Jia Zou and Jiaheng Lu and Zhao Cao and Shiqiang Li and Chen Wang MRTuner: a toolkit to enable holistic optimization for MapReduce jobs . . . . 1319--1330 Mohammad Sadoghi and Mustafa Canim and Bishwaranjan Bhattacharjee and Fabian Nagel and Kenneth A. Ross Reducing database locking contention through multi-version concurrency . . . 1331--1342 Xueyuan Su and Garret Swart and Brian Goetz and Brian Oliver and Paul Sandoz Changing engines in midstream: a Java stream computational model for big data processing . . . . . . . . . . . . . . . 1343--1354 Jae-Gil Lee and Gopi Attaluri and Ronald Barber and Naresh Chainani and Oliver Draese and Frederick Ho and Stratos Idreos and Min-Soo Kim and Sam Lightstone and Guy Lohman and Konstantinos Morfonios and Keshava Murthy and Ippokratis Pandis and Lin Qiao and Vijayshankar Raman and Vincent Kulandai Samy and Richard Sidle and Knut Stolze and Liping Zhang Joins on encoded and partitioned data 1355--1366 Meikel Poess and Tilmann Rabl and Hans-Arno Jacobsen and Brian Caufield TPC--DI: the first industry benchmark for data integration . . . . . . . . . . 1367--1378 Pankaj Gupta and Venu Satuluri and Ajeet Grewal and Siva Gurumurthy and Volodymyr Zhabiuk and Quannan Li and Jimmy Lin Real-time Twitter recommendation: online motif detection in large dynamic graphs 1379--1380 Sang K. Cha and Kunsoo Park and Changbin Song and Kihong Kim and Cheol Ryu and Sunho Lee Interval disaggregate: a new operator for business planning . . . . . . . . . 1381--1392 Zhuo Zhang and Chao Li and Yangyu Tao and Renyu Yang and Hong Tang and Jie Xu Fuxi: a fault-tolerant resource management and job scheduling system at Internet scale . . . . . . . . . . . . . 1393--1404 David Simmen and Karl Schnaitter and Jeff Davis and Yingjie He and Sangeet Lohariwala and Ajay Mysore and Vinayak Shenoi and Mingfeng Tan and Yu Xiao Large-scale graph analytics in Aster 6: bringing context to big data discovery 1405--1416 Zhimin Chen and Vivek Narasayya and Surajit Chaudhuri Fast foreign-key detection in Microsoft SQL server PowerPivot for Excel . . . . 1417--1428 Meng-Chieh Yu and Tong Yu and Shao-Chen Wang and Chih-Jen Lin and Edward Y. Chang Big data small footprint: the design of a low-power classifier for detecting transportation modes . . . . . . . . . . 1429--1440 Oscar Boykin and Sam Ritchie and Ian O'Connell and Jimmy Lin Summingbird: a framework for integrating batch and online MapReduce computations 1441--1451 Rafi Ahmed and Rajkumar Sen and Meikel Poess and Sunil Chakkappen Of snowstorms and bushy trees . . . . . 1452--1461 Srinivas Vemuri and Maneesh Varshney and Krishna Puttaswamy and Rui Liu Execution primitives for scalable joins and aggregations in MapReduce . . . . . 1462--1473 Javier Arauz CAP limits in telecom subscriber database design . . . . . . . . . . . . 1474--1483 Nicolas Bruno and YongChul Kwon and Ming-Chuan Wu Advanced join strategies for large-scale distributed computation . . . . . . . . 1484--1495 Yue Liu and Songlin Hu and Tilmann Rabl and Wantao Liu and Hans-Arno Jacobsen and Kaifeng Wu and Jian Chen and Jintao Li DGFIndex for smart grid: enhancing Hive with a cost-effective multidimensional range index . . . . . . . . . . . . . . 1496--1507 Ying Yan and Liang Jeff Chen and Zheng Zhang Error-bounded sampling for analytics on big sparse data . . . . . . . . . . . . 1508--1519 Vinitha Reddy Gankidi and Nikhil Teletia and Jignesh M. Patel and Alan Halverson and David J. DeWitt Indexing HDFS data in PDW: splitting the data from the index . . . . . . . . . . 1520--1528 Chong Sun and Narasimhan Rampalli and Frank Yang and AnHai Doan Chimera: large-scale classification using machine learning, rules, and crowdsourcing . . . . . . . . . . . . . 1529--1540 Angela Bonifati and Radu Ciucanu and S\LAwek Staworko Interactive join query inference with JIM . . . . . . . . . . . . . . . . . . 1541--1544 Yuxin Zheng and Zhifeng Bao and Lidan Shou and Anthony K. H. Tung MESA: a map service to support fuzzy type-ahead search over geo-textual data 1545--1548 Henan Wang and Guoliang Li and Huiqi Hu and Shuo Chen and Bingwen Shen and Hao Wu and Wen-Syan Li and Kian-Lee Tan R3: a real-time route recommendation system . . . . . . . . . . . . . . . . . 1549--1552 Michael Benedikt and Julien Leblay and Efthymia Tsamoura PDQ: proof-driven query answering over Web-based data . . . . . . . . . . . . . 1553--1556 Naeemul Hassan and Afroza Sultana and You Wu and Gensheng Zhang and Chengkai Li and Jun Yang and Cong Yu Data in, fact out: automated monitoring of facts by FactWatcher . . . . . . . . 1557--1560 Mingxuan Yuan and Ke Deng and Jia Zeng and Yanhua Li and Bing Ni and Xiuqiang He and Fei Wang and Wenyuan Dai and Qiang Yang OceanST: a distributed analytic system for large-scale spatiotemporal mobile broadband data . . . . . . . . . . . . . 1561--1564 Floris Geerts and Giansalvatore Mecca and Paolo Papotti and Donatello Santoro That's all folks!: Llunatic goes open source . . . . . . . . . . . . . . . . . 1565--1568 Weimo Liu and Saad Bin Suhaim and Saravanan Thirumuruganathan and Nan Zhang and Gautam Das and Ali Jaoua HDBTracker: monitoring the aggregates on dynamic hidden web databases . . . . . . 1569--1572 Fan Xia and Ye Li and Chengcheng Yu and Haixin Ma and Weining Qian BSMA: a benchmark for analytical queries over social media data . . . . . . . . . 1573--1576 André Petermann and Martin Junghanns and Robert Müller and Erhard Rahm Graph-based data integration and business intelligence with BIIIG . . . . 1577--1580 Manasi Vartak and Samuel Madden and Aditya Parameswaran and Neoklis Polyzotis SeeDB: automatically generating query visualizations . . . . . . . . . . . . . 1581--1584 Anshuman Dutt and Sumit Neelam and Jayant R. Haritsa QUEST: an exploratory approach to robust query processing . . . . . . . . . . . . 1585--1588 Chuan Lei and Zhongfang Zhuang and Elke A. Rundensteiner and Mohamed Y. Eltabakh Redoop infrastructure for recurring big data queries . . . . . . . . . . . . . . 1589--1592 Matteo Brucato and Rahul Ramakrishna and Azza Abouzied and Alexandra Meliou PackageBuilder: from tuples to packages 1593--1596 Yael Amsterdamer and Susan B. Davidson and Tova Milo and Slava Novgorodov and Amit Somech Ontology assisted crowd mining . . . . . 1597--1600 Lisi Chen and Yan Cui and Gao Cong and Xin Cao SOPS: a system for efficient processing of spatial-keyword publish/subscribe . . 1601--1604 Masumi Shirakawa and Takahiro Hara and Shojiro Nishio MLJ: language-independent real-time search of tweets reported by media outlets and journalists . . . . . . . . 1605--1608 Sebastian Breß and Bastian Köcher and Max Heimel and Volker Markl and Michael Saecker and Gunter Saake Ocelot/HyPE: optimized data processing on heterogeneous hardware . . . . . . . 1609--1612 Fei Wu and Tobias Kin Hou Lei and Zhenhui Li and Jiawei Han MoveMine 2.0: mining object relationships from movement data . . . . 1613--1616 Liwen Sun and Sanjay Krishnan and Reynold S. Xin and Michael J. Franklin A partitioning framework for aggressive data skipping . . . . . . . . . . . . . 1617--1620 Lei Cao and Qingyang Wang and Elke A. Rundensteiner Interactive outlier exploration in big data streams . . . . . . . . . . . . . . 1621--1624 Quoc-Cuong To and Benjamin Nguyen and Philippe Pucheral SQL/AA: executing SQL on an asymmetric architecture . . . . . . . . . . . . . . 1625--1628 Zhao Chen and Rui Fu and Ziyuan Zhao and Zheng Liu and Leihao Xia and Lei Chen and Peng Cheng and Caleb Chen Cao and Yongxin Tong and Chen Jason Zhang gMission: a general spatial crowdsourcing platform . . . . . . . . . 1629--1632 Ugur Cetintemel and Jiang Du and Tim Kraska and Samuel Madden and David Maier and John Meehan and Andrew Pavlo and Michael Stonebraker and Erik Sutherland and Nesime Tatbul and Kristin Tufte and Hao Wang and Stanley Zdonik S-Store: a streaming NewSQL system for big velocity applications . . . . . . . 1633--1636 Runquan Xie and Feida Zhu and Hui Ma and Wei Xie and Chen Lin CLEar: a real-time online observatory for bursty and viral events . . . . . . 1637--1640 Young-Kyoon Suh and Richard T. Snodgrass and Rui Zhang AZDBLab: a laboratory information system for large-scale empirical DBMS studies 1641--1644 Qi Wang and Manohar Kaul and Cheng Long and Raymond Chi-Wing Wong Terrain-Toolkit: a multi-functional tool for terrain data . . . . . . . . . . . . 1645--1648 Yupeng Fu and Kian Win Ong and Yannis Papakonstantinou and Erick Zamora Forward: data-centric ULS using declarative templates that efficiently wrap third-party JavaScript components 1649--1652 Xika Lin and Abhishek Mukherji and Elke A. Rundensteiner and Matthew O. Ward SPIRE: supporting parameter-driven interactive rule mining and exploration 1653--1656 Michael R. Anderson and Michael Cafarella and Yixing Jiang and Guan Wang and Bochun Zhang An integrated development environment for faster feature engineering . . . . . 1657--1660 Pengcheng Xiong and Hakan Hacigümüs Pronto: a software-defined networking based system for performance management of analytical queries on distributed data stores . . . . . . . . . . . . . . 1661--1664 Rui Zhang and Reshu Jain and Prasenjit Sarkar and Lukas Rupprecht Getting your big data priorities straight: a demonstration of priority-based QoS using social-network-driven stock recommendation . . . . . . . . . . . . . 1665--1668 Alekh Jindal and Praynaa Rawlani and Eugene Wu and Samuel Madden and Amol Deshpande and Mike Stonebraker Vertexica: your relational friend for graph analytics! . . . . . . . . . . . . 1669--1672 Abdul Quamar and Amol Deshpande and Jimmy Lin NScale: neighborhood-centric analytics on large graphs . . . . . . . . . . . . 1673--1676 Haoran Li and Li Xiong and Lifan Zhang and Xiaoqian Jiang DPSynthesizer: differentially private data synthesizer for privacy preserving data sharing . . . . . . . . . . . . . . 1677--1680 Longbo Kong and Zhi Liu and Yan Huang SPOT: locating social media users based on social network context . . . . . . . 1681--1684 Zohreh Alavi and Lu Zhou and James Powers and Keke Chen RASP-QS: efficient and confidential query services in the cloud . . . . . . 1685--1688 Mayuresh Kunjir and Prajakta Kalmegh and Shivnath Babu Thoth: towards managing a multi-system cluster . . . . . . . . . . . . . . . . 1689--1692 Lei Zhang and Achim Rettinger X-LiSA: cross-lingual semantic annotation . . . . . . . . . . . . . . . 1693--1696 Prasanth Jayachandran and Karthik Tunga and Niranjan Kamat and Arnab Nandi Combining user interaction, speculative query execution and sampling in the DICE system . . . . . . . . . . . . . . . . . 1697--1700 Han Su and Kai Zheng and Kai Zeng and Jiamin Huang and Xiaofang Zhou STMaker: a system to make sense of trajectory data . . . . . . . . . . . . 1701--1704 Uwe Jugel and Zbigniew Jerzak and Gregor Hackenbroich and Volker Markl Faster visual analytics through pixel-perfect aggregation . . . . . . . 1705--1708 Arijit Khan and Sameh Elnikety Systems for big-graphs . . . . . . . . . 1709--1710 Avigdor Gal Uncertain entity resolution: re-evaluating entity resolution in the big data era: tutorial . . . . . . . . . 1711--1712 Fabian M. Suchanek and Gerhard Weikum Knowledge bases in the age of big data analytics . . . . . . . . . . . . . . . 1713--1714 Alexandra Meliou and Sudeepa Roy and Dan Suciu Causality and explanations in databases 1715--1716 Yunyao Li and Ziyang Liu and Huaiyu Zhu Enterprise search in the big data era: recent developments and open challenges 1717--1718 Yunyao Li and Erich Neuhold VLDB 2014 Ph.D. workshop: an overview 1719--1719 Shivakumar Venkataraman and Divyakant Agrawal Datacenters as computers: Google engineering & database research perspectives . . . . . . . . . . . . . . 1720--1721 Hasso Plattner The impact of columnar in-memory databases on enterprise systems: implications of eliminating transaction-maintained aggregates . . . 1722--1729 Volker Markl Breaking the chains: on declarative data analysis and data independence in the big data era . . . . . . . . . . . . . . 1730--1733 Thomas Neumann Engineering high-performance database engines . . . . . . . . . . . . . . . . 1734--1741 Wei Cao and Feng Yu and Jiasen Xie Realization of the low cost and high performance MySQL cloud database . . . . 1742--1747 An Qin and Dianming Hu and Jun Liu and Wenjun Yang and Dai Tan Fatman: cost-saving and reliable archival storage based on volunteer resources . . . . . . . . . . . . . . . 1748--1753 Shiming Zhang and Yin Yang and Wei Fan and Marianne Winslett Design and implementation of a real-time interactive analytics system for large spatio-temporal data . . . . . . . . . . 1754--1759 Chaoyue Dai and Feng Qian and Wei Jiang and Zhoutian Wang and Zenghong Wu A personalized recommendation system for NetEase dating site . . . . . . . . . . 1760--1765 Zheng Jye Ling and Quoc Trung Tran and Ju Fan and Gerald C. H. Koh and Thi Nguyen and Chuen Seng Tan and James W. L. Yip and Meihui Zhang GEMINI: an integrative healthcare analytics system . . . . . . . . . . . . 1766--1771 Yongqiang Zou and Xing Jin and Yi Li and Zhimao Guo and Eryu Wang and Bin Xiao Mariana: Tencent deep learning platform and its applications . . . . . . . . . . 1772--1777 Sai Wu and Chun Chen and Gang Chen and Ke Chen and Lidan Shou and Hui Cao and He Bai YZStack: provisioning customizable solution for big data . . . . . . . . . 1778--1783 Yannis Klonatos and Christoph Koch and Tiark Rompf and Hassan Chafi Errata for ``Building efficient query engines in a high-level language'': PVLDB \bf 7(10):853--864 . . . . . . . . 1784--1784
Wei Lu and Shanshan Chen and Keqian Li and Laks V. S. Lakshmanan Show me the money: dynamic recommendations for revenue maximization 1785--1796 Peng Lu and Gang Chen and Beng Chin Ooi and Hoang Tam Vo and Sai Wu ScalaGiST: scalable generalized search trees for MapReduce systems [innovative systems paper] . . . . . . . . . . . . . 1797--1808 Mohan Yang and Bolin Ding and Surajit Chaudhuri and Kaushik Chakrabarti Finding patterns in a knowledge base using keywords to compose table answers 1809--1820 Da Yan and James Cheng and Kai Xing and Yi Lu and Wilfred Ng and Yingyi Bu Pregel algorithms for graph connectivity problems with performance guarantees . . 1821--1832 Zechao Shang and Jeffrey Xu Yu Auto-approximation of graph computing 1833--1844 Tim Furche and Georg Gottlob and Giovanni Grasso and Xiaonan Guo and Giorgio Orsi and Christian Schallhart and Cheng Wang DIADEM: thousands of websites to a single database . . . . . . . . . . . . 1845--1856 Wentao Wu and Xi Wu and Hakan Hacigümüs and Jeffrey F. Naughton Uncertainty aware query execution time prediction . . . . . . . . . . . . . . . 1857--1868 George Konstantinidis and José Luis Ambite Optimizing the chase: scalable data integration under constraints . . . . . 1869--1880 Manos Athanassoulis and Anastasia Ailamaki BF-tree: approximate tree indexing . . . 1881--1892 Pinar Tözün and Islam Atta and Anastasia Ailamaki and Andreas Moshovos ADDICT: advanced instruction chasing for transactions . . . . . . . . . . . . . . 1893--1904 Sattam Alsubaiee and Yasser Altowim and Hotham Altwaijry and Alexander Behm and Vinayak Borkar and Yingyi Bu and Michael Carey and Inci Cetindil and Madhusudan Cheelangi and Khurram Faraaz and Eugenia Gabrielova and Raman Grover and Zachary Heilbron and Young-Seok Kim and Chen Li and Guangqiang Li and Ji Mahn Ok and Nicola Onose and Pouria Pirzadeh and Vassilis Tsotras and Rares Vernica and Jian Wen and Till Westmann AsterixDB: a scalable, open source BDMS 1905--1916 Ning Xu and Lei Chen and Bin Cui LogGP: a log-based dynamic graph partitioning method . . . . . . . . . . 1917--1928 George Papadakis and George Papastefanatos and Georgia Koutrika Supervised meta-blocking . . . . . . . . 1929--1940 Min Xie and Laks V. S. Lakshmanan and Peter T. Wood Generating top-$k$ packages via preference elicitation . . . . . . . . . 1941--1952 Rui Li and Alex X. Liu and Ann L. Wang and Bezawada Bruhadeshwar Fast range query processing with strong privacy protection for cloud computing 1953--1964 Yihan Gao and Aditya Parameswaran Finish them!: pricing algorithms for human computation . . . . . . . . . . . 1965--1976 Michele Catasta and Alberto Tonon and Djellel Eddine Difallah and Gianluca Demartini and Karl Aberer and Philippe Cudre-Mauroux TransactiveDB: tapping into collective human memories . . . . . . . . . . . . . 1977--1980 Da Yan and James Cheng and Yi Lu and Wilfred Ng Blogel: a block-centric framework for distributed computation on real-world graphs . . . . . . . . . . . . . . . . . 1981--1992 John Liagouris and Manolis Terrovitis Efficient identification of implicit facts in incomplete OWL2-EL knowledge bases . . . . . . . . . . . . . . . . . 1993--2004 Chen Jason Zhang and Yongxin Tong and Lei Chen Where to: crowd-aided path selection . . 2005--2016 Yan Huang and Favyen Bastani and Ruoming Jin and Xiaoyang Sean Wang Large scale real-time ridesharing with service guarantee on road networks . . . 2017--2028
Yifang Sun and Wei Wang and Jianbin Qin and Ying Zhang and Xuemin Lin SRS: solving $c$-approximate nearest neighbor queries in high dimensional Euclidean space with a tiny index . . . 1--12 Michele Dallachiesa and Themis Palpanas and Ihab F. Ilyas Top-$k$ nearest neighbor search in uncertain data series . . . . . . . . . 13--24 Jiexing Li and Jeffrey Naughton and Rimma V. Nehme Resource bricolage for parallel database systems . . . . . . . . . . . . . . . . 25--36 Goetz Graefe and Haris Volos and Hideaki Kimura and Harumi Kuno and Joseph Tucek and Mark Lillibridge and Alistair Veitch In-memory performance for big data . . . 37--48 Cheng Long and Raymond Chi-Wing Wong and H. V. Jagadish Trajectory simplification: on minimizing the direction-based error . . . . . . . 49--60 Kareem El Gebaly and Parag Agrawal and Lukasz Golab and Flip Korn and Divesh Srivastava Interpretable and informative explanations of outcomes . . . . . . . . 61--72 Fei Li and H. V. Jagadish Constructing an interactive natural language interface for relational databases . . . . . . . . . . . . . . . 73--84 Yuanyuan Zhu and Jeffrey Xu Yu and Lu Qin Leveraging graph dimensions in online graph search . . . . . . . . . . . . . . 85--96 Darius Sidlauskas and Christian S. Jensen Spatial joins in main memory: implementation matters! . . . . . . . . 97--100
Xiaoyang Wang and Ying Zhang and Wenjie Zhang and Xuemin Lin and Wei Wang Selectivity estimation on streaming spatio-textual data using local correlations . . . . . . . . . . . . . . 101--112 Chuanwen Li and Yu Gu and Jianzhong Qi and Ge Yu and Rui Zhang and Wang Yi Processing moving $k$ NN queries using influential neighbor sets . . . . . . . 113--124 Barzan Mozafari and Purna Sarkar and Michael Franklin and Michael Jordan and Samuel Madden Scaling up crowd-sourcing to very large datasets: a case for active learning . . 125--136 Dingyu Yang and Dongxiang Zhang and Kian-Lee Tan and Jian Cao and Frédéric Le Mouël CANDS: continuous optimal navigation via distributed stream processing . . . . . 137--148 Nurjahan Begum and Eamonn Keogh Rare time series motif discovery from unbounded streams . . . . . . . . . . . 149--160 Yingyi Bu and Vinayak Borkar and Jianfeng Jia and Michael J. Carey and Tyson Condie Pregelix: Big(ger) graph analytics on a dataflow engine . . . . . . . . . . . . 161--172 Shriram Sridharan and Jignesh M. Patel Profiling R on a contemporary processor 173--184
Peter Bailis and Alan Fekete and Michael J. Franklin and Ali Ghodsi and Joseph M. Hellerstein and Ion Stoica Coordination avoidance in database systems . . . . . . . . . . . . . . . . 185--196 Qiang Zeng and Jignesh M. Patel and David Page QuickFOIL: scalable inductive logic programming . . . . . . . . . . . . . . 197--208 Xiangyao Yu and George Bezerra and Andrew Pavlo and Srinivas Devadas and Michael Stonebraker Staring into the abyss: an evaluation of concurrency control with one thousand cores . . . . . . . . . . . . . . . . . 209--220 Immanuel Trummer and Christoph Koch Multi-objective parametric query optimization . . . . . . . . . . . . . . 221--232 Jana Giceva and Gustavo Alonso and Timothy Roscoe and Tim Harris Deployment of query plans on multicores 233--244 Rebecca Taft and Essam Mansour and Marco Serafini and Jennie Duggan and Aaron J. Elmore and Ashraf Aboulnaga and Andrew Pavlo and Michael Stonebraker E-store: fine-grained elastic partitioning for distributed transaction processing systems . . . . . . . . . . . 245--256 Saravanan Thirumuruganathan and Habibur Rahman and Sofiane Abbar and Gautam Das Beyond itemsets: mining frequent featuresets over structured items . . . 257--268 Jun Zhang and Chaokun Wang and Jianmin Wang and Jeffrey Xu Yu Inferring continuous dynamic social influence and personal preference for temporal behavior prediction . . . . . . 269--280 Yi Lu and James Cheng and Da Yan and Huanhuan Wu Large-scale distributed graph computing systems: an experimental evaluation . . 281--292 Hiroshi Inoue and Moriyoshi Ohara and Kenjiro Taura Faster set intersection with SIMD instructions by reducing branch mispredictions . . . . . . . . . . . . . 293--304 Ahmed El-Kishky and Yanglei Song and Chi Wang and Clare R. Voss and Jiawei Han Scalable topical phrase mining from text corpora . . . . . . . . . . . . . . . . 305--316 Wenbo Tao and Minghe Yu and Guoliang Li Efficient top-$k$ simrank-based similarity join . . . . . . . . . . . . 317--328
Jiong He and Shuhao Zhang and Bingsheng He In-cache query co-processing on coupled CPU--GPU architectures . . . . . . . . . 329--340 Yasuhiro Fujiwara and Go Irie and Shari Kuroyama and Makoto Onizuka Scaling manifold ranking based image retrieval . . . . . . . . . . . . . . . 341--352 R. Barber and G. Lohman and I. Pandis and V. Raman and R. Sidle and G. Attaluri and N. Chainani and S. Lightstone and D. Sharpe Memory-efficient hash joins . . . . . . 353--364 Bogdan Alexe and Mary Roth and Wang-Chiew Tan Preference-aware integration of temporal data . . . . . . . . . . . . . . . . . . 365--376 Chang Zhou and Jun Gao and Binbin Sun and Jeffrey Xu Yu MOCgraph: scalable distributed graph processing using message online computing . . . . . . . . . . . . . . . 377--388 Jian Huang and Karsten Schwan and Moinuddin K. Qureshi NVRAM-aware logging in transaction systems . . . . . . . . . . . . . . . . 389--400 Badrish Chandramouli and Jonathan Goldstein and Mike Barnett and Robert DeLine and Danyel Fisher and John C. Platt and James F. Terwilliger and John Wernsing Trill: a high-performance incremental query processor for diverse analytics 401--412 Chunyao Song and Tingjian Ge and Cindy Chen and Jie Wang Event pattern matching over graph streams . . . . . . . . . . . . . . . . 413--424 Qi Li and Yaliang Li and Jing Gao and Lu Su and Bo Zhao and Murat Demirbas and Wei Fan and Jiawei Han A confidence-aware approach for truth discovery on long-tail data . . . . . . 425--436 Yanyan Shen and Gang Chen and H. V. Jagadish and Wei Lu and Beng Chin Ooi and Bogdan Marius Tudor Fast failure recovery in distributed graph processing systems . . . . . . . . 437--448 Manuel Then and Moritz Kaufmann and Fernando Chirigati and Tuan-Anh Hoang-Vu and Kien Pham and Alfons Kemper and Thomas Neumann and Huy T. Vo The more the merrier: efficient multi-source graph traversal . . . . . . 449--460
Sebastian Wandelt and Ulf Leser MRCSI: compressing and searching string collections with multiple references . . 461--472 Rui Ding and Qiang Wang and Yingnong Dang and Qiang Fu and Haidong Zhang and Dongmei Zhang YADING: fast clustering of large-scale time series data . . . . . . . . . . . . 473--484 Ting Wu and Lei Chen and Pan Hui and Chen Jason Zhang and Weikai Li Hear the whole story: towards the diversity of opinion in crowdsourcing markets . . . . . . . . . . . . . . . . 485--496 Andreas Chatzistergiou and Marcelo Cintra and Stratis D. Viglas REWIND: recovery write-ahead system for in-memory non-volatile data-structures 497--508 Rong-Hua Li and Lu Qin and Jeffrey Xu Yu and Rui Mao Influential community search in large networks . . . . . . . . . . . . . . . . 509--520 Albert Kim and Eric Blais and Aditya Parameswaran and Piotr Indyk and Sam Madden and Ronitt Rubinfeld Rapid sampling for visualizations with ordering guarantees . . . . . . . . . . 521--532 Lijun Chang and Xuemin Lin and Wenjie Zhang and Jeffrey Xu Yu and Ying Zhang and Lu Qin Optimal enumeration: efficient top-$k$ tree matching . . . . . . . . . . . . . 533--544 Arnon Lazerson and Izchak Sharfman and Daniel Keren and Assaf Schuster and Minos Garofalakis and Vasilis Samoladas Monitoring distributed streams using convex decompositions . . . . . . . . . 545--556 Kun Li and Daisy Zhe Wang and Alin Dobra and Christopher Dudley UDA-GIST: an in-database framework to unify data-parallel and state-parallel analytics . . . . . . . . . . . . . . . 557--568 Weiren Yu and Julie A. McCann Efficient partial-pairs SimRank search on large networks . . . . . . . . . . . 569--580 Wolfgang Gatterbauer and Stephan Günnemann and Danai Koutra and Christos Faloutsos Linearized and single-pass belief propagation . . . . . . . . . . . . . . 581--592 Loc Do and Hady W. Lauw and Ke Wang Mining revenue-maximizing bundling configuration . . . . . . . . . . . . . 593--604 Shiyu Yang and Muhammad Aamir Cheema and Xuemin Lin and Wei Wang Reverse $k$ nearest neighbors query processing: experiments and analysis . . 605--616 Xuguang Ren and Junhu Wang Exploiting vertex relationships in speeding up subgraph isomorphism over large graphs . . . . . . . . . . . . . . 617--628 Wolfgang Gatterbauer and Dan Suciu Approximate lifted inference with probabilistic databases . . . . . . . . 629--640 Norases Vesdapunt and Kedar Bellare and Nilesh Dalvi Errata for ``Crowdsourcing algorithms for entity resolution'': (PVLDB \bf 7(12): 1071--1082) . . . . . . . . . . . 641--641
Saurabh Jha and Bingsheng He and Mian Lu and Xuntao Cheng and Huynh Phung Huynh Improving main memory hash joins on Intel Xeon Phi processors: an experimental approach . . . . . . . . . 642--653 Mohammad Hammoud and Dania Abed Rabbou and Reza Nouri and Seyed-Mehdi-Reza Beheshti and Sherif Sakr DREAM: distributed RDF engine with adaptive query planner and minimal communication . . . . . . . . . . . . . 654--665 Shuo Chen and Ju Fan and Guoliang Li and Jianhua Feng and Kian-lee Tan and Jinhui Tang Online topic-aware influence maximization . . . . . . . . . . . . . . 666--677 Azade Nazi and Zhuojie Zhou and Saravanan Thirumuruganathan and Nan Zhang and Gautam Das Walk, not wait: faster sampling over online social networks . . . . . . . . . 678--689 Michael Benedikt and Julien Leblay and Efthymia Tsamoura Querying with access patterns and integrity constraints . . . . . . . . . 690--701
Kanat Tangwongsan and Martin Hirzel and Scott Schneider and Kun-Lung Wu General incremental sliding-window aggregation . . . . . . . . . . . . . . 702--713 Chuan Lei and Zhongfang Zhuang and Elke A. Rundensteiner and Mohamed Eltabakh Shared execution of recurring workloads in MapReduce . . . . . . . . . . . . . . 714--725 Vivek Narasayya and Ishai Menache and Mohit Singh and Feng Li and Manoj Syamala and Surajit Chaudhuri Sharing buffer pool memory in multi-tenant relational database-as-a-service . . . . . . . . . 726--737 Yunjun Gao and Qing Liu and Gang Chen and Baihua Zheng and Linlin Zhou Answering why-not questions on reverse top-$k$ queries . . . . . . . . . . . . 738--749 Dimitrios Papadopoulos and Charalampos Papamanthou and Roberto Tamassia and Nikos Triandopoulos Practical authenticated pattern matching with optimal proof size . . . . . . . . 750--761 Dumitrel Loghin and Bogdan Marius Tudor and Hao Zhang and Beng Chin Ooi and Yong Meng Teo A performance study of big data on small nodes . . . . . . . . . . . . . . . . . 762--773 Thorsten Papenbrock and Sebastian Kruse and Jorge-Arnulfo Quiané-Ruiz and Felix Naumann Divide & conquer-based inclusion dependency discovery . . . . . . . . . . 774--785 Shimin Chen and Qin Jin Persistent B+-trees in non-volatile main memory . . . . . . . . . . . . . . . . . 786--797 Yubao Wu and Ruoming Jin and Jing Li and Xiang Zhang Robust local community detection: on free rider effect and its elimination 798--809 Hua Fan and Aditya Ramaraju and Marlon McKenzie and Wojciech Golab and Bernard Wong Understanding the causes of consistency anomalies in Apache Cassandra . . . . . 810--813 Cigdem Aslay and Wei Lu and Francesco Bonchi and Amit Goyal and Laks V. S. Lakshmanan Viral marketing meets social advertising: ad allocation with minimum regret . . . . . . . . . . . . . . . . . 814--825
Lingyang Chu and Shuhui Wang and Siyuan Liu and Qingming Huang and Jian Pei ALID: scalable dominant cluster detection . . . . . . . . . . . . . . . 826--837 Yingxia Shao and Bin Cui and Lei Chen and Mingming Liu and Xing Xie An efficient similarity search framework for SimRank over large dynamic graphs 838--849 Muhammad Yousuf Ahmad and Bettina Kemme Compaction management in distributed key--value datastores . . . . . . . . . 850--861 Rachid Guerraoui and Anne-Marie Kermarrec and Rhicheek Patra and Mahsa Taziki D2P: distance-based differential privacy in recommenders . . . . . . . . . . . . 862--873 Ioannis Mitliagkas and Michael Borokhovich and Alexandros G. Dimakis and Constantine Caramanis FrogWild!: fast PageRank approximations on graph engines . . . . . . . . . . . . 874--885 Andrea Vattani and Flavio Chierichetti and Keegan Lowenstein Optimal probabilistic cache stampede prevention . . . . . . . . . . . . . . . 886--897
Navneet Potti and Jignesh M. Patel DAQ: a new paradigm for approximate query processing . . . . . . . . . . . . 898--909 Nicolas Anciaux and Saliha Lallali and Iulian Sandu Popa and Philippe Pucheral A scalable search engine for mass storage smart objects . . . . . . . . . 910--921 Lanjun Wang and Shuo Zhang and Juwei Shi and Limei Jiao and Oktie Hassanzadeh and Jia Zou and Chen Wangz Schema management for document stores 922--933 Felix Martin Schuhknecht and Pankaj Khanchandani and Jens Dittrich On the surprising difficulty of simple things: the case of radix partitioning 934--937 Xin Luna Dong and Evgeniy Gabrilovich and Kevin Murphy and Van Dang and Wilko Horn and Camillo Lugaresi and Shaohua Sun and Wei Zhang Knowledge-based trust: estimating the trustworthiness of web sources . . . . . 938--949 Minyang Han and Khuzaima Daudjee Giraph unchained: barrierless asynchronous parallel execution in Pregel-like graph processing systems . . 950--961 Kenneth S. Bògh and Sean Chester and Ira Assent Work-efficient parallel skyline computation for the GPU . . . . . . . . 962--973
Longbin Lai and Lu Qin and Xuemin Lin and Lijun Chang Scalable subgraph enumeration in MapReduce . . . . . . . . . . . . . . . 974--985 Jan Finis and Robert Brunel and Alfons Kemper and Thomas Neumann and Norman May and Franz Faerber Indexing highly dynamic hierarchical data . . . . . . . . . . . . . . . . . . 986--997 Meng Wang and Chaokun Wang and Jeffrey Xu Yu and Jun Zhang Community detection in social networks: an in-depth benchmarking study with a procedure-oriented framework . . . . . . 998--1009 Ehsan Kazemi and S. Hamed Hassani and Matthias Grossglauser Growing a graph matching from a handful of seeds . . . . . . . . . . . . . . . . 1010--1021 Peng Cheng and Xiang Lian and Zhao Chen and Rui Fu and Lei Chen and Jinsong Han and Jizhong Zhao Reliable diversity-based spatial crowdsourcing by moving workers . . . . 1022--1033 Zhuojie Zhou and Nan Zhang and Gautam Das Leveraging history for faster sampling of online social networks . . . . . . . 1034--1045 Yufei Ding and Xipeng Shen and Madanlal Musuvathi and Todd Mytkowicz TOP: a framework for enabling algorithmic optimizations for distance-related problems . . . . . . . 1046--1057 Viktor Leis and Kan Kundhikanjana and Alfons Kemper and Thomas Neumann Efficient processing of window functions in analytical SQL queries . . . . . . . 1058--1069 Yuchen Li and Dongxiang Zhang and Kian-Lee Tan Real-time targeted influence maximization for online advertisements 1070--1081 Thorsten Papenbrock and Jens Ehrlich and Jannik Marten and Tommy Neubert and Jan-Peer Rudolph and Martin Schönberg and Jakob Zwiener and Felix Naumann Functional dependency discovery: an experimental evaluation of seven algorithms . . . . . . . . . . . . . . . 1082--1093 Alexander Kalinin and Ugur Cetintemel and Stan Zdonik Searchlight: enabling integrated search and exploration over large multidimensional data . . . . . . . . . 1094--1105 Md Farhadur Rahman and Weimo Liu and Saravanan Thirumuruganathan and Nan Zhang and Gautam Das Privacy implications of database ranking 1106--1117
Henning Köhler and Sebastian Link and Xiaofang Zhou Possible and certain SQL keys . . . . . 1118--1129 Yu Tang and Yilun Cai and Nikos Mamoulis Scaling similarity joins over tree-structured data . . . . . . . . . . 1130--1141 Habibur Rahman and Saravanan Thirumuruganathan and Senjuti Basu Roy and Sihem Amer-Yahia and Gautam Das Worker skill estimation in team-based tasks . . . . . . . . . . . . . . . . . 1142--1153 Xi He and Graham Cormode and Ashwin Machanavajjhala and Cecilia M. Procopiuc and Divesh Srivastava DPT: differentially private trajectory synthesis using hierarchical reference systems . . . . . . . . . . . . . . . . 1154--1165 Boduo Li and Yanlei Diao and Prashant Shenoy Supporting scalable analytics with latency constraints . . . . . . . . . . 1166--1177 Hiroaki Shiokawa and Yasuhiro Fujiwara and Makoto Onizuka SCAN++: efficient algorithm for finding clusters, hubs and outliers on large-scale graphs . . . . . . . . . . . 1178--1189 Jose M. Faleiro and Daniel J. Abadi Rethinking serializable multiversion concurrency control . . . . . . . . . . 1190--1201 Bryan Brancotte and Bo Yang and Guillaume Blin and Sarah Cohen-Boulakia and Alain Denise and Sylvie Hamel Rank aggregation with ties: experiments and analysis . . . . . . . . . . . . . . 1202--1213 Narayanan Sundaram and Nadathur Satish and Md Mostofa Ali Patwary and Subramanya R. Dulloor and Michael J. Anderson and Satya Gautam Vadlamudi and Dipankar Das and Pradeep Dubey GraphMat: high performance graph analytics made productive . . . . . . . 1214--1225 Kai Zhang and Kaibo Wang and Yuan Yuan and Lei Guo and Rubao Lee and Xiaodong Zhang Mega-KV: a case for GPUs to maximize the throughput of in-memory key--value stores . . . . . . . . . . . . . . . . . 1226--1237 Jinha Kim and Hyungyu Shin and Wook-Shin Han and Sungpack Hong and Hassan Chafi Taming subgraph isomorphism for RDF query processing . . . . . . . . . . . . 1238--1249 Lilong Jiang and Arnab Nandi SnapToQuery: providing interactive feedback during exploratory query specification . . . . . . . . . . . . . 1250--1261 Yang Zhou and Ling Liu and Kisung Lee and Qi Zhang GraphTwist: fast iterative graph computation with two-tier optimizations 1262--1273 Hiroshi Inoue and Kenjiro Taura SIMD- and cache-friendly algorithm for sorting an array of structures . . . . . 1274--1285 Shaoxu Song and Aoqian Zhang and Lei Chen and Jianmin Wang Enriching data imputation with extensive similarity neighbors . . . . . . . . . . 1286--1297 Darko Makreshanski and Justin Levandoski and Ryan Stutsman To lock, swap, or elide: on the interplay of hardware transactional memory and lock-free indexing . . . . . 1298--1309 Jaeho Shin and Sen Wu and Feiran Wang and Christopher De Sa and Ce Zhang and Christopher Ré Incremental knowledge base construction using DeepDive . . . . . . . . . . . . . 1310--1321 Li Qian and Jinyang Gao and H. V. Jagadish Learning user preferences by adaptive pairwise comparison . . . . . . . . . . 1322--1333
Weimo Liu and Md Farhadur Rahman and Saravanan Thirumuruganathan and Nan Zhang and Gautam Das Aggregate estimations over location based services . . . . . . . . . . . . . 1334--1345 Souvik Bhattacherjee and Amit Chavan and Silu Huang and Amol Deshpande and Aditya Parameswaran Principles of dataset versioning: exploring the recreation/storage tradeoff . . . . . . . . . . . . . . . . 1346--1357 Yeye He and Kris Ganjam and Xu Chu SEMA--JOIN: joining semantically-related tables using big table corpora . . . . . 1358--1369 Sanjay Krishnan and Jiannan Wang and Michael J. Franklin and Ken Goldberg and Tim Kraska Stale view cleaning: getting fresh answers from stale materialized views 1370--1381 Parth Nagarkar and K. Selçuk Candan and Aneesha Bhat Compressed spatial hierarchical bitmap (cSHB) indexes for efficiently processing spatial range query workloads 1382--1393 Daniel Deutch and Amir Gilad and Yuval Moskovitch Selective provenance for datalog programs using top-$k$ queries . . . . . 1394--1405 Yoonjae Park and Jun-Ki Min and Kyuseok Shim Processing of probabilistic skyline queries using MapReduce . . . . . . . . 1406--1417 Xiaofei Zhang and Hong Cheng and Lei Chen Bonding vertex sets over distributed graph: a betweenness aware approach . . 1418--1429 Yael Amsterdamer and Anna Kukliansky and Tova Milo A natural language interface for querying general and individual knowledge . . . . . . . . . . . . . . . 1430--1441 Iraklis Psaroudakis and Tobias Scheuer and Norman May and Abdelkader Sellami and Anastasia Ailamaki Scaling up concurrent main-memory column-store scans: towards adaptive NUMA-aware data and task placement . . . 1442--1453 Gihwan Oh and Sangchul Kim and Sang-Won Lee and Bongki Moon SQLite optimization with phase change memory for mobile applications . . . . . 1454--1465 Andrew Crotty and Alex Galakatos and Kayhan Dursun and Tim Kraska and Carsten Binnig and Ugur Cetintemel and Stan Zdonik An architecture for compiling UDF-centric workflows . . . . . . . . . 1466--1477 Daniel Margo and Margo Seltzer A scalable distributed graph partitioner 1478--1489 Artyom Sharov and Alexander Shraer and Arif Merchant and Murray Stokely Take me to your leader!: online optimization of distributed storage configurations . . . . . . . . . . . . . 1490--1501 Wenfei Fan and Xin Wang and Yinghui Wu and Jingbo Xu Association rules with graph patterns 1502--1513 Ben Kimmett and Venkatesh Srinivasan and Alex Thomo Fuzzy joins in MapReduce: an experimental study . . . . . . . . . . . 1514--1517 Minsik Cho and Daniel Brand and Rajesh Bordawekar and Ulrich Finkler and Vincent Kulandaisamy and Ruchir Puri PARADIS: an efficient parallel algorithm for in-place radix sort . . . . . . . . 1518--1529 David Vengerov and Andre Cavalheiro Menck and Mohamed Zait and Sunil P. Chakkappen Join size estimation subject to filter conditions . . . . . . . . . . . . . . . 1530--1541 Jingjing Wang and Magdalena Balazinska and Daniel Halperin Asynchronous and fault-tolerant recursive datalog evaluation in shared-nothing engines . . . . . . . . . 1542--1553 Kyriakos Mouratidis and Jilian Zhang and HweeHwa Pang Maximum rank query . . . . . . . . . . . 1554--1565 Foteini Katsarou and Nikos Ntarmos and Peter Triantafillou Performance and scalability of indexed subgraph query processing methods . . . 1566--1577 Ying Yang and Niccol\`o Meneghetti and Ronny Fehling and Zhen Hua Liu and Oliver Kennedy Lenses: an on-demand approach to ETL . . 1578--1589 Wenfei Fan and Zhe Fan and Chao Tian and Xin Luna Dong Keys for graphs . . . . . . . . . . . . 1590--1601 Ahmed Eldawy and Louai Alarabi and Mohamed F. Mokbel Spatial partitioning techniques in SpatialHadoop . . . . . . . . . . . . . 1602--1605 Tomohiro Manabe and Keishi Tajima Extracting logical hierarchical structure of HTML documents based on headings . . . . . . . . . . . . . . . . 1606--1617 Bilegsaikhan Naidan and Leonid Boytsov and Eric Nyberg Permutation search methods are efficient, yet faster search is possible 1618--1629 Niloy Mukherjee and Shasank Chavan and Maria Colgan and Dinesh Das and Mike Gleeson and Sanket Hase and Allison Holloway and Hui Jin and Jesse Kamp and Kartik Kulkarni and Tirthankar Lahiri and Juan Loaiza and Neil Macnaughton and Vineet Marwah and Atrayee Mullick and Andy Witkowski and Jiaqi Yan and Mohamed Zait Distributed architecture of Oracle database in-memory . . . . . . . . . . . 1630--1641 Daniel Haas and Jason Ansel and Lydia Gu and Adam Marcus Argonaut: macrotask crowdsourcing for complex data processing . . . . . . . . 1642--1653 Guozhang Wang and Joel Koshy and Sriram Subramanian and Kartik Paramasivam and Mammad Zadeh and Neha Narkhede and Jun Rao and Jay Kreps and Joe Stein Building a replicated logging system with Apache Kafka . . . . . . . . . . . 1654--1655 Alessandra Loro and Anja Gruenheid and Donald Kossmann and Damien Profeta and Philippe Beaudequin Indexing and selecting hierarchical business logic . . . . . . . . . . . . . 1656--1667 Dharma Shukla and Shireesh Thota and Karthik Raman and Madhan Gajendran and Ankur Shah and Sergii Ziuzin and Krishnan Sundaram and Miguel Gonzalez Guajardo and Anna Wawrzyniak and Samer Boshra and Renato Ferreira and Mohamed Nassar and Michael Koltachev and Ji Huang and Sudipta Sengupta and Justin Levandoski and David Lomet Schema-agnostic indexing with Azure DocumentDB . . . . . . . . . . . . . . . 1668--1679 Eric Boutin and Paul Brett and Xiaoyu Chen and Jaliya Ekanayake and Tao Guan and Anna Korsun and Zhicheng Yin and Nan Zhang and Jingren Zhou JetScope: reliable and interactive analytics at cloud scale . . . . . . . . 1680--1691 Xueyang Hu and Mingxuan Yuan and Jianguo Yao and Yu Deng and Lei Chen and Qiang Yang and Haibing Guan and Jia Zeng Differential privacy in telco big data platform . . . . . . . . . . . . . . . . 1692--1703 Amr El-Helw and Venkatesh Raghavan and Mohamed A. Soliman and George Caragea and Zhongxian Gu and Michalis Petropoulos Optimization of common table expressions in MPP database systems . . . . . . . . 1704--1715 Anil K. Goel and Jeffrey Pound and Nathan Auch and Peter Bumbulis and Scott MacLean and Franz Färber and Francis Gropengiesser and Christian Mathis and Thomas Bodner and Wolfgang Lehner Towards scalable real-time analytics: an architecture for scale-out of OLxP workloads . . . . . . . . . . . . . . . 1716--1727 Tamraparni Dasu and Vladislav Shkapenyuk and Divesh Srivastava and Deborah F. Swayne FIT to monitor feed quality . . . . . . 1728--1739 Per-Åke Larson and Adrian Birka and Eric N. Hanson and Weiyun Huang and Michal Nowakiewicz and Vassilis Papadimos Real-time analytical processing with SQL server . . . . . . . . . . . . . . . . . 1740--1751 You Wu and Boulos Harb and Jun Yang and Cong Yu Efficient evaluation of object-centric exploration queries for visualization 1752--1763 Lin Qiao and Yinan Li and Sahil Takiar and Ziyang Liu and Narasimha Veeramreddy and Min Tu and Ying Dai and Issac Buenrostro and Kapil Surlaker and Shirshanka Das and Chavdar Botev Gobblin: unifying data ingestion for Hadoop . . . . . . . . . . . . . . . . . 1764--1769 Dinesh Das and Jiaqi Yan and Mohamed Zait and Satyanarayana R. Valluri and Nirav Vyas and Ramarajan Krishnamachari and Prashant Gaharwar and Jesse Kamp and Niloy Mukherjee Query optimization in Oracle 12c database in-memory . . . . . . . . . . . 1770--1781 Todd J. Green and Dan Olteanu and Geoffrey Washburn Live programming in the LogicBlox system: a MetaLogiQL approach . . . . . 1782--1791 Tyler Akidau and Robert Bradshaw and Craig Chambers and Slava Chernyak and Rafael J. Fernández-Moctezuma and Reuven Lax and Sam McVeety and Daniel Mills and Frances Perry and Eric Schmidt and Sam Whittle The dataflow model: a practical approach to balancing correctness, latency, and cost in massive-scale, unbounded, out-of-order data processing . . . . . . 1792--1803 Avery Ching and Sergey Edunov and Maja Kabiljo and Dionysios Logothetis and Sambavi Muthukrishnan One trillion edges: graph processing at Facebook-scale . . . . . . . . . . . . . 1804--1815 Tuomas Pelkonen and Scott Franklin and Justin Teller and Paul Cavallaro and Qi Huang and Justin Meza and Kaushik Veeraraghavan Gorilla: a fast, scalable, in-memory time series database . . . . . . . . . . 1816--1827 Rahul Potharaju and Joseph Chan and Luhui Hu and Cristina Nita-Rotaru and Mingshi Wang and Liyuan Zhang and Navendu Jain ConfSeer: leveraging customer support knowledge bases for automated misconfiguration detection . . . . . . . 1828--1839 Michael Armbrust and Tathagata Das and Aaron Davidson and Ali Ghodsi and Andrew Or and Josh Rosen and Ion Stoica and Patrick Wendell and Reynold Xin and Matei Zaharia Scaling spark in the real world: performance and usability . . . . . . . 1840--1843 Majed Sahli and Essam Mansour and Panos Kalnis StarDB: a large-scale DBMS for strings 1844--1847 Razen Harbi and Ibrahim Abdelaziz and Panos Kalnis and Nikos Mamoulis Evaluating SPARQL queries on massive RDF datasets . . . . . . . . . . . . . . . . 1848--1851 Ngai Meng Kou and Leong Hou U. and Nikos Mamoulis and Yuhong Li and Ye Li and Zhiguo Gong A topic-based reviewer assignment system 1852--1855 Miguel Liroz-Gistau and Reza Akbarinia and Patrick Valduriez FP--Hadoop: efficient execution of parallel jobs over skewed data . . . . . 1856--1859 Thorsten Papenbrock and Tanja Bergmann and Moritz Finke and Jakob Zwiener and Felix Naumann Data profiling with Metanome . . . . . . 1860--1863 Arun Kumar and Mona Jalal and Boqun Yan and Jeffrey Naughton and Jignesh M. Patel Demonstration of Santoku: optimizing machine learning over normalized data 1864--1867 Boon Siew Seah and Sourav S. Bhowmick and Aixin Sun PRISM: concept-preserving summarization of top-$k$ social image search results 1868--1871 Tobias Müller and Torsten Grust Provenance for SQL through abstract interpretation: value-less, but worthwhile . . . . . . . . . . . . . . . 1872--1875 Zhian He and Wai Kit Wong and Ben Kao and David Wai Lok Cheung and Rongbin Li and Siu Ming Yiu and Eric Lo SDB: a secure query processing system with data interoperability . . . . . . . 1876--1879 Ibrahim Abdelaziz and Razen Harbi and Semih Salihoglu and Panos Kalnis and Nikos Mamoulis SPARTex: a vertex-centric framework for RDF data analytics . . . . . . . . . . . 1880--1883 Lu Chen and Yunjun Gao and Zhihao Xing and Christian S. Jensen and Gang Chen I2RS: a distributed geo-textual image retrieval and recommendation system . . 1884--1887 Damian Bursztyn and François Goasdoué and Ioana Manolescu Reformulation-based query answering in RDF: alternatives and performance . . . 1888--1891 Marc Bux and Jörgen Brandt and Carsten Lipka and Kamal Hakimzadeh and Jim Dowling and Ulf Leser SAASFEE: scalable scientific workflow execution engine . . . . . . . . . . . . 1892--1895 Ahmed Eldawy and Mohamed F. Mokbel and Christopher Jonathan A demonstration of HadoopViz: an extensible MapReduce system for visualizing big spatial data . . . . . . 1896--1899 Moria Bergman and Tova Milo and Slava Novgorodov and Wang-Chiew Tan QOCO: a query oriented data cleaning system with oracles . . . . . . . . . . 1900--1903 Shanshan Ying and Flip Korn and Barna Saha and Divesh Srivastava TreeScope: finding structural anomalies in semi-structured data . . . . . . . . 1904--1907 A. Elmore and J. Duggan and M. Stonebraker and M. Balazinska and U. Cetintemel and V. Gadepally and J. Heer and B. Howe and J. Kepner and T. Kraska and S. Madden and D. Maier and T. Mattson and S. Papadopoulos and J. Parkhurst and N. Tatbul and M. Vartak and S. Zdonik A demonstration of the BigDAWG polystore system . . . . . . . . . . . . . . . . . 1908--1911 Kostas Zoumpatianos and Stratos Idreos and Themis Palpanas RINSE: interactive data series exploration with ADS+ . . . . . . . . . 1912--1915 Anant Bhardwaj and Amol Deshpande and Aaron J. Elmore and David Karger and Sam Madden and Aditya Parameswaran and Harihar Subramanyam and Eugene Wu and Rebecca Zhang Collaborative data analytics with DataHub . . . . . . . . . . . . . . . . 1916--1919 Jaeho Shin and Christopher Ré and Michael Cafarella Mindtagger: a demonstration of data labeling in knowledge base construction 1920--1923 Danai Koutra and Di Jin and Yuanchi Ning and Christos Faloutsos Perseus: an interactive large-scale graph mining and visualization tool . . 1924--1927 Manas Joglekar and Hector Garcia-Molina and Aditya Parameswaran Smart drill-down: a new data exploration operator . . . . . . . . . . . . . . . . 1928--1931 Curtis E. Dyreson and Sourav S. Bhowmick and Ryan Grapp Virtual eXist-db: liberating hierarchical queries from the shackles of access path dependence . . . . . . . 1932--1935 Eli Cortez and Philip A. Bernstein and Yeye He and Lev Novik Annotating database schemas to help enterprise search . . . . . . . . . . . 1936--1939 Nandish Jayaram and Sidharth Goyal and Chengkai Li VIIQ: auto-suggestion enabled visual interface for interactive graph query formulation . . . . . . . . . . . . . . 1940--1943 Qingyuan Liu and Eduard C. Dragut and Arjun Mukherjee and Weiyi Meng FLORIN: a system to support (near) real-time applications on user generated content on daily news . . . . . . . . . 1944--1947 Yunyao Li and Elmer Kim and Marc A. Touchette and Ramiya Venkatachalam and Hao Wang VINERy: a visual IDE for information extraction . . . . . . . . . . . . . . . 1948--1951 Xu Chu and John Morcos and Ihab F. Ilyas and Mourad Ouzzani and Paolo Papotti and Nan Tang and Yin Ye KATARA: reliable data cleaning with knowledge bases and crowdsourcing . . . 1952--1955 Foteini Alvanaki and Romulo Goncalves and Milena Ivanova and Martin Kersten and Kostis Kyzirakos GIS navigation boosted by column stores 1956--1959 Patricia C. Arocena and Radu Ciucanu and Boris Glavic and Renée J. Miller Gain control over your integration evaluations . . . . . . . . . . . . . . 1960--1963 Yanlei Diao and Kyriaki Dimitriadou and Zhan Li and Wenzhao Liu and Olga Papaemmanouil and Kemi Peng and Liping Peng AIDE: an automatic user navigation system for interactive data exploration 1964--1967 Ahmed M. Aly and Ahmed S. Abdelhamid and Ahmed R. Mahmood and Walid G. Aref and Mohamed S. Hassan and Hazem Elmeleegy and Mourad Ouzzani A demonstration of AQWA: adaptive query-workload-aware partitioning of big spatial data . . . . . . . . . . . . . . 1968--1971 Jens Dittrich and Patrick Bender Janiform intra-document analytics for reproducible research . . . . . . . . . 1972--1975 Erich Schubert and Alexander Koos and Tobias Emrich and Andreas Züfle and Klaus Arthur Schmid and Arthur Zimek A framework for clustering uncertain data . . . . . . . . . . . . . . . . . . 1976--1979 Nicole Bidoit and Melanie Herschel and Katerina Tzompanaki EFQ: why-not answer polynomials in action . . . . . . . . . . . . . . . . . 1980--1983 Xiaolan Wang and Mary Feng and Yue Wang and Xin Luna Dong and Alexandra Meliou Error diagnosis and data profiling with DataXRay . . . . . . . . . . . . . . . . 1984--1987 Quan Pham and Severin Thaler and Tanu Malik and Ian Foster and Boris Glavic Sharing and reproducing database applications . . . . . . . . . . . . . . 1988--1991 Marcin Wylot and Philippe Cudré-Mauroux and Paul Groth A demonstration of TripleProv: tracking and querying provenance over Web data 1992--1995 Stefano Ortona and Giorgio Orsi and Marcello Buoncristiano and Tim Furche WADaR: joint wrapper and data repair . . 1996--1999 Mangesh Bendre and Bofan Sun and Ding Zhang and Xinyan Zhou and Kevin Chen-Chuan Chang and Aditya Parameswaran DataSpread: unifying databases and spreadsheets . . . . . . . . . . . . . . 2000--2003 Daniel Haas and Sanjay Krishnan and Jiannan Wang and Michael J. Franklin and Eugene Wu Wisteria: nurturing scalable data cleaning infrastructure . . . . . . . . 2004--2007 Ashoke S. and Jayant R. Haritsa CODD: a dataless approach to big data testing . . . . . . . . . . . . . . . . 2008--2011 Sejla Cebiri\'c and François Goasdoué and Ioana Manolescu Query-oriented summarization of RDF graphs . . . . . . . . . . . . . . . . . 2012--2015 Yodsawalai Chodpathumwan and Amirhossein Aleyasen and Arash Termehchy and Yizhou Sun Universal-DB: towards representation independent graph analytics . . . . . . 2016--2019 Ahmed R. Mahmood and Ahmed M. Aly and Thamir Qadah and El Kindi Rezig and Anas Daghistani and Amgad Madkour and Ahmed S. Abdelhamid and Mohamed S. Hassan and Walid G. Aref and Saleh Basalamah Tornado: a distributed spatio-textual stream processing system . . . . . . . . 2020--2023 Andrew Crotty and Alex Galakatos and Emanuel Zgraggen and Carsten Binnig and Tim Kraska Vizdom: interactive analytics through pen and touch . . . . . . . . . . . . . 2024--2027 Mariano P. Consens and Valeria Fionda and Shahan Khatchadourian and Giuseppe Pirr\`o S+EPPs: construct and explore bisimulation summaries, plus optimize navigational queries; all on existing SPARQL systems . . . . . . . . . . . . . 2028--2031 Konstantinos Xirogiannopoulos and Udayan Khurana and Amol Deshpande GraphGen: exploring interesting graphs in relational data . . . . . . . . . . . 2032--2035 Dong Young Yoon and Barzan Mozafari and Douglas P. Brown DBSeer: pain-free database administration through workload intelligence . . . . . . . . . . . . . . 2036--2039 Arun Kejariwal and Sanjeev Kulkarni and Karthik Ramasamy Real time analytics: algorithms and systems . . . . . . . . . . . . . . . . 2040--2041 Arijit Khan and Lei Chen On uncertain graphs modeling and queries 2042--2043 Xin Luna Dong and Wang-Chiew Tan A time machine for information: looking back to look forward . . . . . . . . . . 2044--2045 Mahashweta Das and Gautam Das Structured analytics in social media . . 2046--2047 Jing Gao and Qi Li and Bo Zhao and Wei Fan and Jiawei Han Truth discovery and crowdsourcing aggregation: a unified perspective . . . 2048--2049 Daniel Abadi and Shivnath Babu and Fatma Özcan and Ippokratis Pandis SQL-on-Hadoop systems: tutorial . . . . 2050--2051 Juan Loaiza Engineering database hardware and software together . . . . . . . . . . . 2052--2052 Magdalena Balazinska Big data research: will industry solve all the problems? . . . . . . . . . . . 2053--2056 Todd Walter Big plateaus of Big Data on the big island . . . . . . . . . . . . . . . . . 2057--2057 Anastasia Ailamaki Databases and hardware: the beginning and sequel of a beautiful friendship . . 2058--2061
Ahmed M. Aly and Ahmed R. Mahmood and Mohamed S. Hassan and Walid G. Aref and Mourad Ouzzani and Hazem Elmeleegy and Thamir Qadah AQWA: adaptive query workload aware partitioning of big spatial data . . . . 2062--2073 Zuhair Khayyat and William Lucia and Meghna Singh and Mourad Ouzzani and Paolo Papotti and Jorge-Arnulfo Quiané-Ruiz and Nan Tang and Panos Kalnis Lightning fast and space efficient inequality joins . . . . . . . . . . . . 2074--2085 Jinfei Liu and Li Xiong and Jian Pei and Jun Luo and Haoyu Zhang Finding Pareto optimal groups: group-based skyline . . . . . . . . . . 2086--2097 Taylor Kessler Faulkner and Will Brackenbury and Ashwin Lall $k$-regret queries with nonlinear utilities . . . . . . . . . . . . . . . 2098--2109 Juwei Shi and Yunjie Qiu and Umar Farooq Minhas and Limei Jiao and Chen Wang and Berthold Reinwald and Fatma Özcan Clash of the titans: MapReduce vs. Spark for large scale data analytics . . . . . 2110--2121 Yu Liu and Jiaheng Lu and Hua Yang and Xiaokui Xiao and Zhewei Wei Towards maximum independent sets on massive graphs . . . . . . . . . . . . . 2122--2133 John Meehan and Nesime Tatbul and Stan Zdonik and Cansu Aslantas and Ugur Cetintemel and Jiang Du and Tim Kraska and Samuel Madden and David Maier and Andrew Pavlo and Michael Stonebraker and Kristin Tufte and Hao Wang S-Store: streaming meets transaction processing . . . . . . . . . . . . . . . 2134--2145 Justin Levandoski and David Lomet and Sudipta Sengupta and Ryan Stutsman and Rui Wang Multi-version range concurrency control in Deuteronomy . . . . . . . . . . . . . 2146--2157 Hao Li and Chee-Yong Chan and David Maier Query from examples: an iterative, data-driven approach to query construction . . . . . . . . . . . . . . 2158--2169 Sainyam Galhotra and Amitabha Bagchi and Srikanta Bedathur and Maya Ramanath and Vidit Jain Tracking the conductance of rapidly evolving topic-subgraphs . . . . . . . . 2170--2181 Manasi Vartak and Sajjadur Rahman and Samuel Madden and Aditya Parameswaran and Neoklis Polyzotis SeeDB: efficient data-driven visualization recommendations to support visual analytics . . . . . . . . . . . . 2182--2193 Disheng Qiu and Luciano Barbosa and Xin Luna Dong and Yanyan Shen and Divesh Srivastava Dexter: large-scale discovery and extraction of product specifications on the web . . . . . . . . . . . . . . . . 2194--2205
Qiang Huang and Jianlin Feng and Yikai Zhang and Qiong Fang and Wilfred Ng Query-aware locality-sensitive hashing for approximate nearest neighbor search 1--12 Wissam Khaouid and Marina Barsky and Venkatesh Srinivasan and Alex Thomo $K$-core decomposition of large networks on a single PC . . . . . . . . . . . . . 13--23 Zhenguo Li and Yixiang Fang and Qin Liu and Jiefeng Cheng and Reynold Cheng and John C. S. Lui Walking in the cloud: parallel SimRank at scale . . . . . . . . . . . . . . . . 24--35
Patricia C. Arocena and Boris Glavic and Giansalvatore Mecca and Renée J. Miller and Paolo Papotti and Donatello Santoro Messing up with BART: error generation for evaluating data-cleaning algorithms 36--47 Takanori Hayashi and Takuya Akiba and Yuichi Yoshida Fully dynamic betweenness centrality maintenance on massive networks . . . . 48--59 Wei Lu and Wei Chen and Laks V. S. Lakshmanan From competition to complementarity: comparative influence diffusion and maximization . . . . . . . . . . . . . . 60--71 Konstantinos Kloudas and Margarida Mamede and Nuno Preguiça and Rodrigo Rodrigues Pixida: optimizing data parallel jobs in wide-area data analytics . . . . . . . . 72--83
Lu Wang and Robert Christensen and Feifei Li and Ke Yi Spatial online sampling and aggregation 84--95 Stefan Richter and Victor Alvarez and Jens Dittrich A seven-dimensional analysis of hashing methods and its implications on query processing . . . . . . . . . . . . . . . 96--107 Patricia C. Arocena and Boris Glavic and Radu Ciucanu and Renée J. Miller The iBench integration metadata generator . . . . . . . . . . . . . . . 108--119 Hotham Altwaijry and Sharad Mehrotra and Dmitri V. Kalashnikov QuERy: a framework for integrating entity resolution with query processing 120--131 Taesung Lee and Jin-woo Park and Sanghoon Lee and Seung-Won Hwang and Sameh Elnikety and Yuxiong He Processing and optimizing main memory spatial-keyword queries . . . . . . . . 132--143 Yongjoo Park and Michael Cafarella and Barzan Mozafari Neighbor-sensitive hashing . . . . . . . 144--155 Botong Huang and Nicholas W. D. Jarrett and Shivnath Babu and Sayan Mukherjee and Jun Yang Cümülön: matrix-based data analytics in the cloud with spot instances . . . . . . . 156--167 Manohar Kaul and Raymond Chi-Wing Wong and Christian S. Jensen New lower and upper bounds for shortest distance queries on terrains . . . . . . 168--179 Cibele Freire and Wolfgang Gatterbauer and Neil Immerman and Alexandra Meliou The complexity of resilience and responsibility for self-join-free conjunctive queries . . . . . . . . . . 180--191 Hao Huang and Shiva Prasad Kasiviswanathan Streaming anomaly detection using randomized matrix sketching . . . . . . 192--203 Viktor Leis and Andrey Gubichev and Atanas Mirchev and Peter Boncz and Alfons Kemper and Thomas Neumann How good are query optimizers, really? 204--215 Matteo Interlandi and Kshitij Shah and Sai Deep Tetali and Muhammad Ali Gulzar and Seunghyun Yoo and Miryung Kim and Todd Millstein and Tyson Condie Titian: data provenance support in Spark 216--227
Wolf Rödiger and Tobias Mühlbauer and Alfons Kemper and Thomas Neumann High-speed query processing over high-speed networks . . . . . . . . . . 228--239 Bo Zong and Xusheng Xiao and Zhichun Li and Zhenyu Wu and Zhiyun Qian and Xifeng Yan and Ambuj K. Singh and Guofei Jiang Behavior query discovery in system-generated temporal graphs . . . . 240--251 Onur Kocberber and Babak Falsafi and Boris Grot Asynchronous memory access chaining . . 252--263 Samuel Haney and Ashwin Machanavajjhala and Bolin Ding Design of policy-aware differentially private algorithms . . . . . . . . . . . 264--275 Xin Huang and Laks V. S. Lakshmanan and Jeffrey Xu Yu and Hong Cheng Approximate closest community search in networks . . . . . . . . . . . . . . . . 276--287 Fabien André and Anne-Marie Kermarrec and Nicolas Le Scouarnec Cache locality is not enough: high-performance nearest neighbor search with product quantization fast scan . . 288--299 Nataliya Prokoshyna and Jaros\law Szlichta and Fei Chiang and Renée J. Miller and Divesh Srivastava Combining quantitative and logical data cleaning . . . . . . . . . . . . . . . . 300--311 George Papadakis and George Alexiou and George Papastefanatos and Georgia Koutrika Schema-agnostic vs schema-based configurations for blocking methods on homogeneous data . . . . . . . . . . . . 312--323 Alessandro Epasto and Silvio Lattanzi and Vahab Mirrokni and Ismail Oner Sebe and Ahmed Taei and Sunita Verma Ego-net community mining applied to friend suggestion . . . . . . . . . . . 324--335 Ziawasch Abedjan and Cuneyt G. Akcora and Mourad Ouzzani and Paolo Papotti and Michael Stonebraker Temporal rules discovery for web data cleaning . . . . . . . . . . . . . . . . 336--347 Sudeepa Roy and Laurel Orr and Dan Suciu Explaining query answers with explanation-ready databases . . . . . . 348--359 Dong Deng and Guoliang Li and He Wen and Jianhua Feng An efficient partition based method for exact set similarity joins . . . . . . . 360--371 Daniel Haas and Jiannan Wang and Eugene Wu and Michael J. Franklin CLAMShell: speeding up crowds for low-latency data labeling . . . . . . . 372--383
Donatella Firmani and Barna Saha and Divesh Srivastava Online entity resolution using an oracle 384--395 Marco Calautti and Sergio Greco and Cristian Molinaro and Irina Trubitsyna Exploiting equality generating dependencies in checking chase termination . . . . . . . . . . . . . . 396--407 Tong Yang and Alex X. Liu and Muhammad Shahzad and Yuankun Zhong and Qiaobin Fu and Zi Li and Gaogang Xie and Xiaoming Li A shifting Bloom filter framework for set queries . . . . . . . . . . . . . . 408--419 Fan Yang and Jinfeng Li and James Cheng Husky: towards a more efficient and expressive distributed computing framework . . . . . . . . . . . . . . . 420--431 Zeyu Li and Hongzhi Wang and Wei Shao and Jianzhong Li and Hong Gao Repairing data through regular expressions . . . . . . . . . . . . . . 432--443 Cong Yan and Alvin Cheung Leveraging lock contention to improve OLTP application performance . . . . . . 444--455
Farhana M. Choudhury and J. Shane Culpepper and Timos Sellis and Xin Cao Maximizing bichromatic reverse spatial and textual $k$ nearest neighbor queries 456--467 Julien Subercaze and Christophe Gravier and Jules Chevalier and Frederique Laforest Inferray: fast in-memory RDF inference 468--479 Darko Makreshanski and Georgios Giannikis and Gustavo Alonso and Donald Kossmann MQJoin: efficient shared execution of main-memory joins . . . . . . . . . . . 480--491 Tenindra Abeywickrama and Muhammad Aamir Cheema and David Taniar $k$-nearest neighbors on road networks: a journey in experimentation and in-memory implementation . . . . . . . . 492--503 Yuan Yuan and Kaibo Wang and Rubao Lee and Xiaoning Ding and Jing Xing and Spyros Blanas and Xiaodong Zhang BCC: reducing false aborts in optimistic concurrency control with low cost for in-memory databases . . . . . . . . . . 504--515
Long Yuan and Lu Qin and Xuemin Lin and Lijun Chang and Wenjie Zhang I/O efficient ECC graph decomposition via graph reduction . . . . . . . . . . 516--527 Carsten Binnig and Andrew Crotty and Alex Galakatos and Tim Kraska and Erfan Zamanian The end of slow networks: it's time for a redesign . . . . . . . . . . . . . . . 528--539 Jiewen Huang and Daniel J. Abadi Leopard: lightweight edge-oriented partitioning and replication for dynamic graphs . . . . . . . . . . . . . . . . . 540--551 Eric Gribkoff and Dan Suciu SlimShot: in-database probabilistic inference for knowledge bases . . . . . 552--563 Da Yan and James Cheng and M. Tamer Özsu and Fan Yang and Yi Lu and John C. S. Lui and Qizhen Zhang and Wilfred Ng A general-purpose query-centric framework for querying big graphs . . . 564--575 Matteo Brucato and Juan Felipe Beltran and Azza Abouzied and Alexandra Meliou Scalable package queries in relational database systems . . . . . . . . . . . . 576--587 Xiang Wang and Ying Zhang and Wenjie Zhang and Xuemin Lin and Zengfeng Huang Skype: top-$k$ spatial-keyword publish/subscribe over sliding window 588--599 Abolfazl Asudeh and Saravanan Thirumuruganathan and Nan Zhang and Gautam Das Discovering the skyline of web databases 600--611
Xiaohang Zhang and Guoliang Li and Jianhua Feng Crowdsourced top-$k$ algorithms: an experimental evaluation . . . . . . . . 612--623
Michael Maddox and David Goehring and Aaron J. Elmore and Samuel Madden and Aditya Parameswaran and Amol Deshpande Decibel: the relational dataset branching system . . . . . . . . . . . . 624--635 Willi Mann and Nikolaus Augsten and Panagiotis Bouros An empirical evaluation of set similarity join techniques . . . . . . . 636--647 Immanuel Trummer and Christoph Koch Multiple query optimization on the D-Wave 2X adiabatic quantum computer . . 648--659 Immanuel Trummer and Christoph Koch Parallelizing query optimization on shared-nothing architectures . . . . . . 660--671 Vasiliki Kalavri and Tiago Simas and Dionysios Logothetis The shortest path is not always a straight line: leveraging semi-metricity in graph analysis . . . . . . . . . . . 672--683 George Papadakis and Jonathan Svirsky and Avigdor Gal and Themis Palpanas Comparative analysis of approximate blocking techniques for entity resolution . . . . . . . . . . . . . . . 684--695 Yiran Zhao and Shen Li and Shaohan Hu and Hongwei Wang and Shuochao Yao and Huajie Shao and Tarek Abdelzaher An experimental evaluation of datacenter workloads on low-power embedded micro servers . . . . . . . . . . . . . . . . 696--707
Shaoxu Song and Yue Cao and Jianmin Wang Cleaning timestamps with temporal constraints . . . . . . . . . . . . . . 708--719 Zilong Tan and Shivnath Babu Tempo: robust and self-tuning resource management in multi-tenant parallel databases . . . . . . . . . . . . . . . 720--731 Jonny Daenen and Frank Neven and Tony Tan and Stijn Vansummeren Parallel evaluation of multi-semi-joins 732--743 Jianfei Chen and Kaiwei Li and Jun Zhu and Wenguang Chen WarpLDA: a cache efficient O(1) algorithm for latent Dirichlet allocation . . . . . . . . . . . . . . . 744--755 Marius Eich and Pit Fender and Guido Moerkotte Faster plan generation through consideration of functional dependencies and keys . . . . . . . . . . . . . . . . 756--767 Felix Martin Schuhknecht and Jens Dittrich and Ankur Sharma RUMA has it: rewired user-space memory access is possible! . . . . . . . . . . 768--779 Ryan Marcus and Olga Papaemmanouil WiSeDB: a learning-based workload management advisor for cloud databases 780--791 Gianmarco De Francisci Morales and Aristides Gionis Streaming similarity self-join . . . . . 792--803 Alexander Schätzle and Martin Przyjaciel-Zablocki and Simon Skilevic and Georg Lausen S2RDF: RDF querying with SPARQL on spark 804--815 Rishabh Singh BlinkFill: semi-supervised programming by example for syntactic string transformations . . . . . . . . . . . . 816--827 Dong Deng and Guoliang Li and He Wen and H. V. Jagadish and Jianhua Feng META: an efficient matching-based method for error-tolerant autocompletion . . . 828--839
Weiguo Zheng and Lei Zou and Wei Peng and Xifeng Yan and Shaoxu Song and Dongyan Zhao Semantic SPARQL similarity search over RDF knowledge graphs . . . . . . . . . . 840--851 Ayush Dubey and Greg D. Hill and Robert Escriva and Emin Gün Sirer Weaver: a high-performance, transactional graph database based on refinable timestamps . . . . . . . . . . 852--863 Xu Chu and Ihab F. Ilyas and Paraschos Koutris Distributed data deduplication . . . . . 864--875 Marcelo Arenas and Francisco Maturana and Cristian Riveros and Domagoj Vrgoc A framework for annotating CSV-like data 876--887 Abolfazl Asudeh and Nan Zhang and Gautam Das Query reranking as a service . . . . . . 888--899
Hongbin Ma and Bin Shao and Yanghua Xiao and Liang Jeff Chen and Haixun Wang G-SQL: fast query processing via graph exploration . . . . . . . . . . . . . . 900--911 Mingxing Zhang and Yongwei Wu and Kang Chen and Teng Ma and Weimin Zheng Measuring and optimizing distributed array programs . . . . . . . . . . . . . 912--923 Insoon Jo and Duck-Ho Bae and Andre S. Yoon and Jeong-Uk Kang and Sangyeun Cho and Daniel D. G. Lee and Jaeheon Jeong YourSQL: a high-performance database system leveraging in-storage computing 924--935 Lu Lu and Xuanhua Shi and Yongluan Zhou and Xiong Zhang and Hai Jin and Cheng Pei and Ligang He and Yuanzhen Geng Lifetime-based memory management for distributed data processing systems . . 936--947 Sanjay Krishnan and Jiannan Wang and Eugene Wu and Michael J. Franklin and Ken Goldberg ActiveClean: interactive data cleaning for statistical modeling . . . . . . . . 948--959 Ahmed Elgohary and Matthias Boehm and Peter J. Haas and Frederick R. Reiss and Berthold Reinwald Compressed linear algebra for large-scale machine learning . . . . . . 960--971 Manos Karpathiotakis and Ioannis Alagiannis and Anastasia Ailamaki Fast queries over heterogeneous data through engine customization . . . . . . 972--983 Sourav S. Bhowmick and Byron Choi and Curtis Dyreson Data-driven visual graph query interface construction and maintenance: challenges and opportunities . . . . . . . . . . . 984--992 Ziawasch Abedjan and Xu Chu and Dong Deng and Raul Castro Fernandez and Ihab F. Ilyas and Mourad Ouzzani and Paolo Papotti and Michael Stonebraker and Nan Tang Detecting data errors: where are we and what needs to be done? . . . . . . . . . 993--1004 Hai Liu and Dongqing Xiao and Pankaj Didwania and Mohamed Y. Eltabakh Exploiting soft and hard correlations in big data query optimization . . . . . . 1005--1016 Minsuk Kahng and Shamkant B. Navathe and John T. Stasko and Duen Horng Polo Chau Interactive browsing and navigation in relational databases . . . . . . . . . . 1017--1028 Renata Borovica-Gaji\'c and Raja Appuswamy and Anastasia Ailamaki Cheap data analytics using cold storage devices . . . . . . . . . . . . . . . . 1029--1040 Julian Shun and Farbod Roosta-Khorasani and Kimon Fountoulakis and Michael W. Mahoney Parallel local graph clustering . . . . 1041--1052 Yongxin Tong and Jieying She and Bolin Ding and Lei Chen and Tianyu Wo and Ke Xu Online minimum matching in real-time spatial data: experiments and analysis 1053--1064 Robert Brunel and Norman May and Alfons Kemper Index-assisted hierarchical computations in main-memory RDBMS . . . . . . . . . . 1065--1076 Naoto Ohsaka and Takuya Akiba and Yuichi Yoshida and Ken-ichi Kawarabayashi Dynamic influence analysis in evolving networks . . . . . . . . . . . . . . . . 1077--1088 Luan Tran and Liyue Fan and Cyrus Shahabi Distance-based outlier detection in data streams . . . . . . . . . . . . . . . . 1089--1100 Michalis Mountantonakis and Yannis Tzitzikas On measuring the lattice of commonalities among several linked datasets . . . . . . . . . . . . . . . . 1101--1112 Zhao Chang and Dong Xie and Feifei Li Oblivious RAM: a dissection and experimental evaluation . . . . . . . . 1113--1124 Fisnik Kastrati and Guido Moerkotte Optimization of conjunctive predicates for main memory column stores . . . . . 1125--1136 Zaheer Chothia and John Liagouris and Frank McSherry and Timothy Roscoe Explaining outputs in modern data analytics . . . . . . . . . . . . . . . 1137--1148 Peter Buneman and Slawek Staworko RDF graph alignment with bisimulation 1149--1160 Damian Bursztyn and François Goasdoué and Ioana Manolescu Teaching an RDBMS about ontological constraints . . . . . . . . . . . . . . 1161--1172 Giovanni Simonini and Sonia Bergamaschi and H. V. Jagadish BLAST: a loosely schema-aware meta-blocking approach for entity resolution . . . . . . . . . . . . . . . 1173--1184 Erkang Zhu and Fatemeh Nargesian and Ken Q. Pu and Renée J. Miller LSH ensemble: Internet-scale domain search . . . . . . . . . . . . . . . . . 1185--1196 Pradap Konda and Sanjib Das and Paul Suganthan G. C. and AnHai Doan and Adel Ardalan and Jeffrey R. Ballard and Han Li and Fatemah Panahi and Haojun Zhang and Jeff Naughton and Shishir Prasad and Ganesh Krishnan and Rohit Deep and Vijay Raghavendra Magellan: toward building entity matching management systems . . . . . . 1197--1208 Diptikalyan Saha and Avrilia Floratou and Karthik Sankaranarayanan and Umar Farooq Minhas and Ashish R. Mittal and Fatma Özcan ATHENA: an ontology-driven system for natural language querying over relational data stores . . . . . . . . . 1209--1220 Richard Wesley and Fei Xu Incremental computation of common windowed holistic aggregates . . . . . . 1221--1232 Yixiang Fang and Reynold Cheng and Siqiang Luo and Jiafeng Hu Effective community search for large attributed graphs . . . . . . . . . . . 1233--1244
Willis Lang and Karthik Ramachandra and David J. DeWitt and Shize Xu and Qun Guo and Ajay Kalhan and Peter Carlin Not for the timid: on the impact of aggressive over-booking in the cloud . . 1245--1256 Martin Sevenich and Sungpack Hong and Oskar van Rest and Zhe Wu and Jayanta Banerjee and Hassan Chafi Using domain-specific languages for analytic graph databases . . . . . . . . 1257--1268 Shaosu Liu and Bin Song and Sriharsha Gangam and Lawrence Lo and Khaled Elmeleegy Kodiak: leveraging materialized views for very low-latency analytics over high-dimensional web-scale data . . . . 1269--1280 Aneesh Sharma and Jerry Jiang and Praveen Bommannavar and Brian Larson and Jimmy Lin GraphJet: real-time content recommendations at Twitter . . . . . . . 1281--1292 Edward Ma and Vishrut Gupta and Meichun Hsu and Indrajit Roy \pkgdmapply: a functional primitive to express distributed machine learning algorithms in R . . . . . . . . . . . . 1293--1304 Pedro Pedreira and Chris Croswhite and Luis Bona Cubrick: indexing millions of records per second for interactive analytics . . 1305--1316 Alexandru Iosup and Tim Hegeman and Wing Lung Ngai and Stijn Heldens and Arnau Prat-Pérez and Thomas Manhardto and Hassan Chafio and Mihai Capota and Narayanan Sundaram and Michael Anderson and Ilie Gabriel Tanase and Yinglong Xia and Lifeng Nai and Peter Boncz LDBC graphalytics: a benchmark for large-scale graph analysis on parallel and distributed platforms . . . . . . . 1317--1328 Hermano Lustosa and Fabio Porto and Patrick Valduriez and Pablo Blanco Database system support of simulation data . . . . . . . . . . . . . . . . . . 1329--1340 Gabriela Jacques-Silva and Fang Zheng and Daniel Debrunner and Kun-Lung Wu and Victor Dogaru and Eric Johnson and Michael Spicer and Ahmet Erdem Sariyüce Consistent regions: guaranteed tuple processing in IBM streams . . . . . . . 1341--1352 Mohammed Al-Kateb and Paul Sinclair and Grace Au and Carrie Ballinger Hybrid row-column partitioning in Teradata\reg . . . . . . . . . . . . . . 1353--1364 Ricardo Fernandes and Piotr Zaczkowski and Bernd Göttler and Conor Ettinoffe and Anis Moussa TrafficDB: HERE's high performance shared-memory data store . . . . . . . . 1365--1376 Alex Scotti and Mark Hannum and Michael Ponomarenko and Dorin Hogea and Akshat Sikarwar and Mohit Khullar and Adi Zaimi and James Leddy and Rivers Zhang and Fabio Angius and Lingzhi Deng Comdb2: Bloomberg's highly available relational database system . . . . . . . 1377--1388 V. Srinivasan and Brian Bulkowski and Wei-Ling Chu and Sunil Sayyaparaju and Andrew Gooding and Rajkumar Iyer and Ashish Shinde and Thomas Lopatic Aerospike: architecture of a real-time operational DBMS . . . . . . . . . . . . 1389--1400 Jack Chen and Samir Jindel and Robert Walzer and Rajkumar Sen and Nika Jimsheleishvilli and Michael Andrews The MemSQL query optimizer: a modern optimizer for real-time analytics in a distributed database . . . . . . . . . . 1401--1412 Sarath Lakshman and Sriram Melkote and John Liang and Ravi Mayuram Nitro: a fast, scalable in-memory storage engine for NoSQL global secondary index . . . . . . . . . . . . 1413--1424 Matthias Boehm and Michael W. Dusenberry and Deron Eriksson and Alexandre V. Evfimievski and Faraz Makari Manshadi and Niketan Pansare and Berthold Reinwald and Frederick R. Reiss and Prithviraj Sen and Arvind C. Surve and Shirish Tatikonda SystemML: declarative machine learning on spark . . . . . . . . . . . . . . . . 1425--1436 Aurosish Mishra and Shasank Chavan and Allison Holloway and Tirthankar Lahiri and Zhen Hua Liu and Sunil Chakkappen and Dennis Lui and Vinita Subramanian and Ramesh Kumar and Maria Colgan and Jesse Kamp and Niloy Mukherjee and Vineet Marwah Accelerating analytics with dynamic in-memory expressions . . . . . . . . . 1437--1448 Satyajit Bhadange and Akhil Arora and Arnab Bhattacharya GARUDA: a system for large-scale mining of statistically significant connected subgraphs . . . . . . . . . . . . . . . 1449--1452 Huan Li and Hua Lu and Xin Chen and Gang Chen and Ke Chen and Lidan Shou Vita: a versatile toolkit for generating indoor mobility data for real-world buildings . . . . . . . . . . . . . . . 1453--1456 Guillaume Bagan and Angela Bonifati and Radu Ciucanu and George H. L. Fletcher and Aurélien Lemay and Nicky Advokaat Generating flexible workloads for graph databases . . . . . . . . . . . . . . . 1457--1460 Xiaofeng Zhou and Yang Chen and Daisy Zhe Wang ArchimedesOne: query processing over probabilistic knowledge bases . . . . . 1461--1464 Tova Milo and Slava Novgorodov and Wang-Chiew Tan Rudolf: interactive rule refinement system for fraud detection . . . . . . . 1465--1468 Antonio Maccioni and Matteo Collina Graph databases in the browser: using LevelGraph to explore New Delhi . . . . 1469--1472 Thibault Sellam and Martin Kersten Ziggy: characterizing query results for data explorers . . . . . . . . . . . . . 1473--1476 Thibault Sellam and Robin Cijvat and Richard Koopmanschap and Martin Kersten Blaeu: mapping and navigating large tables with cluster analysis . . . . . . 1477--1480 Ahmed El-Roby and Khaled Ammar and Ashraf Aboulnaga and Jimmy Lin Sapphire: querying RDF data made simple 1481--1484 Yael Amsterdamer and Tova Milo and Amit Somech and Brit Youngmann December: a declarative tool for crowd member selection . . . . . . . . . . . . 1485--1488 Xi He and Nisarg Raval and Ashwin Machanavajjhala A demonstration of VisDPT: visual exploration of differentially private trajectories . . . . . . . . . . . . . . 1489--1492 Tobias Scheuer and Norman May and Alexander Böhm and Daniel Scheibli JexLog: a sonar for the abyss . . . . . 1493--1496 Kosetsu Ikeda and Atsuyuki Morishima and Habibur Rahman and Senjuti Basu Roy and Saravanan Thirumuruganathan and Sihem Amer-Yahia and Gautam Das Collaborative crowdsourcing with Crowd4u 1497--1500 Lei Chen and Jianliang Xu and Christian S. Jensen and Yafei Li YASK: a why-not question answering engine for spatial keyword query services . . . . . . . . . . . . . . . . 1501--1504 Peipei Yi and Byron Choi and Sourav S. Bhowmick and Jianliang Xu AutoG: a visual query autocompletion framework for graph databases . . . . . 1505--1508 Xiaoye Miao and Yunjun Gao and Gang Chen and Huiyong Cui and Chong Guo and Weida Pan Si$^2$ p: a restaurant recommendation system using preference queries over incomplete information . . . . . . . . . 1509--1512 R. Bonaque and T. D. Cao and B. Cautis and F. Goasdoué and J. Letelier and I. Manolescu and O. Mendoza and S. Ribeiro and X. Tannier Mixed-instance querying: a lightweight integration architecture for data journalism . . . . . . . . . . . . . . . 1513--1516 Dennis Butterstein and Torsten Grust Precision performance surgery for CostgreSQL: LLVM-based Expression Compilation, Just in Time . . . . . . . 1517--1520 Mohamed Yahya and Klaus Berberich and Maya Ramanath and Gerhard Weikum Exploratory querying of extended knowledge graphs . . . . . . . . . . . . 1521--1524 Kiril Panev and Sebastian Michel and Evica Milchevski and Koninika Pal Exploring databases via reverse engineering ranking queries with PALEO 1525--1528 Ramon Bespinyowong and Wei Chen and H. V. Jagadish and Yuxin Ma ExRank: an exploratory ranking interface 1529--1532 Gonzalo Diaz and Marcelo Arenas and Michael Benedikt SPARQLByE: querying RDF data by example 1533--1536 Daniel Deutch and Nave Frost and Amir Gilad NLProv: natural language provenance . . 1537--1540 Bikash Chandra and Mathew Joseph and Bharath Radhakrishnan and Shreevidhya Acharya and S. Sudarshan Partial marking for automated grading of SQL queries . . . . . . . . . . . . . . 1541--1544 Kaiqi Zhao and Yiding Liu and Quan Yuan and Lisi Chen and Zhida Chen and Gao Cong Towards personalized maps: mining user preferences from geo-textual data . . . 1545--1548 Kaiyu Feng and Kaiqi Zhao and Yiding Liu and Gao Cong A system for region search and exploration . . . . . . . . . . . . . . 1549--1552 Aleksandar Vitorovic and Mohammed Elseidy and Khayyam Guliyev and Khue Vu Minh and Daniel Espino and Mohammad Dashti and Yannis Klonatos and Christoph Koch Squall: scalable real-time analytics . . 1553--1556 Udayan Khurana and Srinivasan Parthasarathy and Deepak Turaga Graph-based exploration of non-graph datasets . . . . . . . . . . . . . . . . 1557--1560 Minjian Liu and Qing Wang Rogas: a declarative framework for network analytics . . . . . . . . . . . 1561--1564 Mingjie Tang and Yongyang Yu and Qutaibah M. Malluhi and Mourad Ouzzani and Walid G. Aref LocationSpark: a distributed in-memory data management system for big spatial data . . . . . . . . . . . . . . . . . . 1565--1568 Anil Shanbhag and Alekh Jindal and Yi Lu and Samuel Madden Amoeba: a shape changing storage system for big data . . . . . . . . . . . . . . 1569--1572 Dan Olteanu and Maximilian Schleich F: regression models over factorized views . . . . . . . . . . . . . . . . . 1573--1576 Miguel Rodríguez and Sean Goldberg and Daisy Zhe Wang SigmaKB: multiple probabilistic knowledge base fusion . . . . . . . . . 1577--1580 Pradap Konda and Sanjib Das and Paul Suganthan G. C. and AnHai Doan and Adel Ardalan and Jeffrey R. Ballard and Han Li and Fatemah Panahi and Haojun Zhang and Jeff Naughton and Shishir Prasad and Ganesh Krishnan and Rohit Deep and Vijay Raghavendra Magellan: toward building entity matching management systems over data science stacks . . . . . . . . . . . . . 1581--1584 Wail Y. Alkowaileet and Sattam Alsubaiee and Michael J. Carey and Till Westmann and Yingyi Bu Large-scale complex analytics on semi-structured datasets using AsterixDB and Spark . . . . . . . . . . . . . . . 1585--1588 Jose Picado and Parisa Ataei and Arash Termehchy and Alan Fern Schema independent and scalable relational learning by Castor . . . . . 1589--1592 Rajeshkumar Kannapalli and Azade Nazi and Mahashweta Das and Gautam Das AD-WIRE: add-on for Web item reviewing system . . . . . . . . . . . . . . . . . 1593--1596 Vineet Chaoji and Rajeev Rastogi and Gourav Roy Machine learning in the real world . . . 1597--1600 Alexander Böhm and Jens Dittrich and Niloy Mukherjee and Ippokratis Pandis and Rajkumar Sen Operational analytics data management systems . . . . . . . . . . . . . . . . 1601--1604 Xu Chu and Ihab F. Ilyas Qualitative data cleaning . . . . . . . 1605--1608 Per-Åke Larson and Justin Levandoski Modern main-memory database systems . . 1609--1610 Ashwin Machanavajjhala and Xi He and Michael Hay Differential privacy in the wild: a tutorial on current practices & open challenges . . . . . . . . . . . . . . . 1611--1614 Sihem Amer-Yahia and Senjuti Basu Roy Human factors in crowdsourcing . . . . . 1615--1618 Ion Stoica Trends and challenges in big data processing . . . . . . . . . . . . . . . 1619--1619 Anand Rajaraman Data-driven disruption: the view from Silicon Valley . . . . . . . . . . . . . 1620--1620 Xin Luna Dong Leave no valuable data behind: the crazy ideas and the business . . . . . . . . . 1621--1621 Mohamed Mokbel and Chi-Yin Chow and Walid Aref Location data management: a tale of two systems and the ``next destination''! 1622--1622
Badrish Chandramouli and Raul Castro Fernandez and Jonathan Goldstein and Ahmed Eldawy and Abdul Quamar Quill: efficient, transferable, and rich analytics at scale . . . . . . . . . . . 1623--1634 Brett Walenz and Jun Yang Perturbation analysis of database queries . . . . . . . . . . . . . . . . 1635--1646 Jing Li and Hung-Wei Tseng and Chunbin Lin and Yannis Papakonstantinou and Steven Swanson HippogriffDB: balancing I/O and GPU bandwidth in big data analytics . . . . 1647--1658 Steffen Zeuch and Holger Pirk and Johann-Christoph Freytag Non-invasive progressive optimization for in-memory databases . . . . . . . . 1659--1670 J. W. Zhang and Y. C. Tay Dscaler: synthetically scaling a given relational database . . . . . . . . . . 1671--1682 Sheng Wang and David Maier and Beng Chin Ooi Fast and adaptive indexing of multi-dimensional observational data . . 1683--1694 Prasang Upadhyaya and Magdalena Balazinska and Dan Suciu Price-optimal querying with data APIs 1695--1706 Holger Pirk and Oscar Moll and Matei Zaharia and Sam Madden Voodoo --- a vector algebra for portable database performance on modern hardware 1707--1718
Dawei Jiang and Qingchao Cai and Gang Chen and H. V. Jagadish and Beng Chin Ooi and Kian-Lee Tan and Anthony K. H. Tung Cohort query processing . . . . . . . . 1--12 Yubao Wu and Yuchen Bian and Xiang Zhang Remember where you came from: on the second-order random walk based proximity measures . . . . . . . . . . . . . . . . 13--24 Lars George and Bruno Cadonna and Matthias Weidlich IL-Miner: instance-level discovery of complex event patterns . . . . . . . . . 25--36
Iraklis Psaroudakis and Tobias Scheuer and Norman May and Abdelkader Sellami and Anastasia Ailamaki Adaptive NUMA-aware data placement and task scheduling for analytical workloads in main-memory column-stores . . . . . . 37--48 Tianzheng Wang and Hideaki Kimura Mostly-optimistic concurrency control for highly contended dynamic workloads on a thousand cores . . . . . . . . . . 49--60 Sibo Wang and Xiaokui Xiao and Yin Yang and Wenqing Lin Effective indexing for approximate constrained shortest path queries on large road networks . . . . . . . . . . 61--72
Qun Huang and Patrick P. C. Lee Toward high-performance distributed stream processing via approximate fault tolerance . . . . . . . . . . . . . . . 73--84 Jian Dai and Bin Yang and Chenjuan Guo and Christian S. Jensen and Jilin Hu Path cost distribution estimation using trajectory data . . . . . . . . . . . . 85--96 Ahmet Erdem Sariyüce and Ali Pinar Fast hierarchy construction for dense subgraphs . . . . . . . . . . . . . . . 97--108 Xuhong Zhang and Jun Wang and Jiangling Yin Sapprox: enabling efficient and accurate approximations on sub-datasets with distribution-aware online sampling . . . 109--120 Xuguang Ren and Junhu Wang Multi-query optimization for subgraph isomorphism search . . . . . . . . . . . 121--132 Michael Simpson and Venkatesh Srinivasan and Alex Thomo Efficient computation of feedback arc set at web-scale . . . . . . . . . . . . 133--144 Dolan Antenucci and Michael R. Anderson and Michael Cafarella A declarative query processing system for nowcasting . . . . . . . . . . . . . 145--156 Alessandro Lulli and Matteo Dell'Amico and Pietro Michiardi and Laura Ricci NG-DBSCAN: scalable density-based clustering for arbitrary data . . . . . 157--168 Rodica Neamtu and Ramoza Ahsan and Elke Rundensteiner and Gabor Sarkozy Interactive time series exploration powered by the marriage of similarity distances . . . . . . . . . . . . . . . 169--180 Youhuan Li and Lei Zou and Huaming Zhang and Dongyan Zhao Computing longest increasing subsequences over sequential data streams . . . . . . . . . . . . . . . . 181--192 Fernando Chirigati and Jialu Liu and Flip Korn and You (Will) Wu and Cong Yu and Hao Zhang Knowledge exploration using tables on the web . . . . . . . . . . . . . . . . 193--204 Sibo Wang and Youze Tang and Xiaokui Xiao and Yin Yang and Zengxiang Li HubPPR: effective indexing for approximate Personalized Pagerank . . . 205--216 Longbin Lai and Lu Qin and Xuemin Lin and Ying Zhang and Lijun Chang and Shiyu Yang Scalable distributed subgraph enumeration . . . . . . . . . . . . . . 217--228 Yasuhiro Fujiwara and Yasutoshi Ida and Junya Arai and Mai Nishimura and Sotetsu Iwamura Fast algorithm for the lasso based $ L_1 $-graph construction . . . . . . . . . . 229--240 Ennan Zhai and Zhenhua Li and Zhenyu Li and Fan Wu and Guihai Chen Resisting tag spam by leveraging implicit user behaviors . . . . . . . . 241--252 Xiaowei Chen and Yongkun Li and Pinghui Wang and John C. S. Lui A general framework for estimating graphlet statistics via random walk . . 253--264 Chunbin Lin and Benjamin Mandel and Yannis Papakonstantinou and Matthias Springer Fast in-memory SQL analytics on typed graphs . . . . . . . . . . . . . . . . . 265--276 Zheng Li and Tingjian Ge Stochastic data acquisition for answering queries as time goes by . . . 277--288
Haipeng Dai and Muhammad Shahzad and Alex X. Liu and Yuankun Zhong Finding persistent items in data streams 289--300 Shuotao Xu and Sungjin Lee and Sang-Woo Jun and Ming Liu and Jamey Hicks and Arvind Bluecache: a scalable distributed flash-based key--value store . . . . . . 301--312 Qi Fan and Dongxiang Zhang and Huayu Wu and Kian-Lee Tan A general and parallel platform for mining co-movement patterns over large-scale trajectories . . . . . . . . 313--324 Zhou Shao and Muhammad Aamir Cheema and David Taniar and Hua Lu VIP-Tree: an effective index for indoor spatial queries . . . . . . . . . . . . 325--336 Joy Arulraj and Matthew Perron and Andrew Pavlo Write-behind logging . . . . . . . . . . 337--348 Stavros Papadopoulos and Kushal Datta and Samuel Madden and Timothy Mattson The TileDB array data storage manager 349--360 Yudian Zheng and Guoliang Li and Reynold Cheng DOCS: a domain-aware crowdsourcing system using knowledge bases . . . . . . 361--372 Yue Wang and Alexandra Meliou and Gerome Miklau Lifting the haze off the cloud: a consumer-centric market for database computation in the cloud . . . . . . . . 373--384 Jia Yu and Mohamed Sarwat Two birds, one stone: a fast, yet lightweight, indexing scheme for modern database systems . . . . . . . . . . . . 385--396 Zheng Li and Tingjian Ge History is a mirror to the future: best-effort approximate complex event matching with insufficient resources . . 397--408 Hillel Avni and Trevor Brown Persistent hybrid transactional memory for databases . . . . . . . . . . . . . 409--420 Liwen Sun and Michael J. Franklin and Jiannan Wang and Eugene Wu Skipping-oriented partitioning for columnar layouts . . . . . . . . . . . . 421--432 Sneha Aman Singh and Divesh Srivastava and Srikanta Tirthapura Estimating quantiles from the union of historical and streaming data . . . . . 433--444 Marco Serafini and Rebecca Taft and Aaron J. Elmore and Andrew Pavlo and Ashraf Aboulnaga and Michael Stonebraker Clay: fine-grained adaptive partitioning for general database schemas . . . . . . 445--456 Tarique Siddiqui and Albert Kim and John Lee and Karrie Karahalios and Aditya Parameswaran Effortless data exploration with zenvisage: an expressive and interactive visual analytics system . . . . . . . . 457--468
Matteo Ceccarello and Andrea Pietracaprina and Geppino Pucci and Eli Upfal MapReduce and streaming algorithms for diversity maximization in metric spaces of bounded doubling dimension . . . . . 469--480 Vincent Bindschaedler and Reza Shokri and Carl A. Gunter Plausible deniability for privacy-preserving data synthesis . . . 481--492 Shiv Verma and Luke M. Leslie and Yosub Shin and Indranil Gupta An experimental comparison of partitioning strategies in distributed graph processing . . . . . . . . . . . . 493--504 Badrish Chandramouli and Jonathan Goldstein Shrink: prescribing resiliency solutions for streaming . . . . . . . . . . . . . 505--516 Claude Barthels and Ingo Müller and Timo Schneider and Gustavo Alonso and Torsten Hoefler Distributed join algorithms on thousands of cores . . . . . . . . . . . . . . . . 517--528 Junling Liu and Ke Deng and Huanliang Sun and Yu Ge and Xiaofang Zhou and Christian S. Jensen Clue-based spatio-textual query . . . . 529--540 Yudian Zheng and Guoliang Li and Yuanbing Li and Caihua Shan and Reynold Cheng Truth inference in crowdsourcing: is the problem solved? . . . . . . . . . . . . 541--552 Rachael Harding and Dana Van Aken and Andrew Pavlo and Michael Stonebraker An evaluation of distributed concurrency control . . . . . . . . . . . . . . . . 553--564 Wanyun Cui and Yanghua Xiao and Haixun Wang and Yangqiu Song and Seung-won Hwang and Wei Wang KBQA: learning question answering over QA corpora and knowledge bases . . . . . 565--576 Daniel Deutch and Nave Frost and Amir Gilad Provenance for natural language queries 577--588 Yi Lu and Anil Shanbhag and Alekh Jindal and Samuel Madden AdaptDB: adaptive partitioning for distributed joins . . . . . . . . . . . 589--600 Zhipeng Zhang and Yingxia Shao and Bin Cui and Ce Zhang An experimental evaluation of SimRank-based similarity search algorithms . . . . . . . . . . . . . . . 601--612 Jose M. Faleiro and Daniel J. Abadi and Joseph M. Hellerstein High performance transactions via early write visibility . . . . . . . . . . . . 613--624 Dhivya Eswaran and Stephan Günnemann and Christos Faloutsos and Disha Makhija and Mohit Kumar ZooBP: belief propagation for heterogeneous networks . . . . . . . . . 625--636
Min Lyu and Dong Su and Ninghui Li Understanding the sparse vector technique for differential privacy . . . 637--648 Fan Zhang and Wenjie Zhang and Ying Zhang and Lu Qin and Xuemin Lin OLAK: an efficient algorithm to prevent unraveling in social networks . . . . . 649--660 Meraj Khan and Larry Xu and Arnab Nandi and Joseph M. Hellerstein Data tweening: incremental visualization of data transforms . . . . . . . . . . . 661--672 Johes Bater and Gregory Elliott and Craig Eggen and Satyender Goel and Abel Kho and Jennie Rogers SMCQL: secure querying for federated databases . . . . . . . . . . . . . . . 673--684 Erfan Zamanian and Carsten Binnig and Tim Harris and Tim Kraska The end of a myth: distributed transactions can scale . . . . . . . . . 685--696 Haohan Zhu and Xianrui Meng and George Kollios NED: an inter-graph node metric based on edit distance . . . . . . . . . . . . . 697--708 Yixiang Fang and Reynold Cheng and Xiaodong Li and Siqiang Luo and Jiafeng Hu Effective community search over large spatial graphs . . . . . . . . . . . . . 709--720
Jaros\law Szlichta and Parke Godfrey and Lukasz Golab and Mehdi Kargar and Divesh Srivastava Effective and complete discovery of order dependencies via set-based axiomatization . . . . . . . . . . . . . 721--732 Tomas Karnagel and Dirk Habich and Wolfgang Lehner Adaptive work placement for query processing on heterogeneous computing resources . . . . . . . . . . . . . . . 733--744 Fan Yang and Fanhua Shang and Yuzhen Huang and James Cheng and Jinfeng Li and Yunjian Zhao and Ruihao Zhao LFTF: a framework for efficient tensor analytics at scale . . . . . . . . . . . 745--756 Shalmoli Gupta and Ravi Kumar and Kefu Lu and Benjamin Moseley and Sergei Vassilvitskii Local search methods for $k$-means with outliers . . . . . . . . . . . . . . . . 757--768 Guillaume Casanova and Elias Englmeier and Michael E. Houle and Peer Kröger and Michael Nett and Erich Schubert and Arthur Zimek Dimensional testing for reverse $k$-nearest neighbor search . . . . . . 769--780 Yingjun Wu and Joy Arulraj and Jiexi Lin and Ran Xian and Andrew Pavlo An empirical evaluation of in-memory multi-version concurrency control . . . 781--792 You Wu and Junyang Gao and Pankaj K. Agarwal and Jun Yang Finding diverse, high-value representatives on a surface of answers 793--804 Yanhao Wang and Qi Fan and Yuchen Li and Kian-Lee Tan Real-time influence maximization on dynamic social streams . . . . . . . . . 805--816 Hongyun Cai and Vincent W. Zheng and Fanwei Zhu and Kevin Chen-Chuan Chang and Zi Huang From community detection to community profiling . . . . . . . . . . . . . . . 817--828 Ayush Jain and Akash Das Sarma and Aditya Parameswaran and Jennifer Widom Understanding workers, developing effective tasks, and enhancing marketplace dynamics: a study of a large crowdsourcing marketplace . . . . . . . 829--840 Xuelian Lin and Shuai Ma and Han Zhang and Tianyu Wo and Jinpeng Huai One-pass error bounded trajectory simplification . . . . . . . . . . . . . 841--852
Jianguo Wang and Chunbin Lin and Ruining He and Moojin Chae and Yannis Papakonstantinou and Steven Swanson MILC: inverted list compression in memory . . . . . . . . . . . . . . . . . 853--864 Botong Huang and Jun Yang Cümülön--D: data analytics in a dynamic spot market . . . . . . . . . . . . . . 865--876 Manuel Then and Timo Kersten and Stephan Günnemann and Alfons Kemper and Thomas Neumann Automatic algorithm transformation for efficient multi-snapshot analytics on temporal graphs . . . . . . . . . . . . 877--888 Jianqiao Zhu and Navneet Potti and Saket Saurabh and Jignesh M. Patel Looking ahead makes query plans robust: making the initial case with in-memory star schema data warehouse workloads . . 889--900 Michael Anderson and Shaden Smith and Narayanan Sundaram and Mihai Capota and Zheguang Zhao and Subramanya Dulloor and Nadathur Satish and Theodore L. Willke Bridging the gap between HPC and big data frameworks . . . . . . . . . . . . 901--912
Keke Huang and Sibo Wang and Glenn Bevilacqua and Xiaokui Xiao and Laks V. S. Lakshmanan Revisiting the stop-and-stare algorithms for influence maximization . . . . . . . 913--924 Xubo Wang and Lu Qin and Xuemin Lin and Ying Zhang and Lijun Chang Leveraging set relations in exact set similarity join . . . . . . . . . . . . 925--936 Minhao Jiang and Ada Wai-Chee Fu and Raymond Chi-Wing Wong READS: a random walk approach for efficient and accurate dynamic SimRank 937--948 Xin Huang and Laks V. S. Lakshmanan Attribute-driven community search . . . 949--960 Jiecao Chen and Qin Zhang Bias-aware sketches . . . . . . . . . . 961--972 Yang Cao and Wenfei Fan Data driven approximation with bounded resources . . . . . . . . . . . . . . . 973--984 Zuhair Khayyat and William Lucia and Meghna Singh and Mourad Ouzzani and Paolo Papotti and Jorge-Arnulfo Quiané-Ruiz and Nan Tang and Panos Kalnis Errata for ``Lightning Fast and Space Efficient Inequality Joins'' (PVLDB 8(13): 2074--2085) . . . . . . . . . . . 985--985
Chengjie Qin and Martin Torres and Florin Rusu Scalable asynchronous gradient descent optimization for out-of-core models . . 986--997 Fan Zhang and Ying Zhang and Lu Qin and Wenjie Zhang and Xuemin Lin When engagement meets similarity: efficient $ (k, r)$-core computation on social networks . . . . . . . . . . . . 998--1009 Yiding Liu and Tuan-Anh Nguyen Pham and Gao Cong and Quan Yuan An experimental evaluation of point-of-interest recommendation in location-based social networks . . . . . 1010--1021 Mark Raasveldt and Hannes Mühleisen Don't hold my data hostage: a case for client protocol redesign . . . . . . . . 1022--1033 Erkang Zhu and Yeye He and Surajit Chaudhuri Auto-join: joining tables by leveraging transformations . . . . . . . . . . . . 1034--1045 Aoqian Zhang and Shaoxu Song and Jianmin Wang and Philip S. Yu Time series data cleaning: from anomaly detection to anomaly repairing . . . . . 1046--1057 Lu Chen and Yunjun Gao and Baihua Zheng and Christian S. Jensen and Hanyu Yang and Keyu Yang Pivot-based metric indexing . . . . . . 1058--1069 Rachid Guerraoui and Anne-Marie Kermarrec and Tao Lin and Rhicheek Patra Heterogeneous recommendations: what you might like to read after watching interstellar . . . . . . . . . . . . . . 1070--1081 Dong Deng and Albert Kim and Samuel Madden and Michael Stonebraker SilkMoth: an efficient method for finding related sets with maximum matching constraints . . . . . . . . . . 1082--1093 Yeounoh Chung and Sanjay Krishnan and Tim Kraska A data quality metric (DQM): how to estimate the number of undetected errors in data sets . . . . . . . . . . . . . . 1094--1105 Matthaios Olma and Manos Karpathiotakis and Ioannis Alagiannis and Manos Athanassoulis and Anastasia Ailamaki Slalom: coasting through raw data via adaptive partitioning and indexing . . . 1106--1117 Yinan Li and Nikos R. Katsipoulakis and Badrish Chandramouli and Jonathan Goldstein and Donald Kossmann Mison: a fast JSON parser for data analytics . . . . . . . . . . . . . . . 1118--1129 Silu Huang and Liqi Xu and Jialin Liu and Aaron J. Elmore and Aditya Parameswaran OrpheusDB: bolt-on versioning for relational databases . . . . . . . . . . 1130--1141 Alex Galakatos and Andrew Crotty and Emanuel Zgraggen and Carsten Binnig and Tim Kraska Revisiting reuse for approximate query processing . . . . . . . . . . . . . . . 1142--1153 Laurel Orr and Magdalena Balazinska and Dan Suciu Probabilistic database summarization for interactive data exploration . . . . . . 1154--1165
Ismail Oukid and Daniel Booss and Adrien Lespinasse and Wolfgang Lehner and Thomas Willhalm and Grégoire Gomes Memory management techniques for large-scale persistent-main-memory systems . . . . . . . . . . . . . . . . 1166--1177 Shuo Shang and Lisi Chen and Zhewei Wei and Christian S. Jensen and Kai Zheng and Panos Kalnis Trajectory similarity join in spatial networks . . . . . . . . . . . . . . . . 1178--1189 Theodoros Rekatsinas and Xu Chu and Ihab F. Ilyas and Christopher Ré HoloClean: holistic data repairs with probabilistic inference . . . . . . . . 1190--1201 Zsolt István and David Sidler and Gustavo Alonso Caribou: intelligent distributed storage 1202--1213 Lingjiao Chen and Arun Kumar and Jeffrey Naughton and Jignesh M. Patel Towards linear algebra over normalized data . . . . . . . . . . . . . . . . . . 1214--1225 Parmita Mehta and Sven Dorkenwald and Dongfang Zhao and Tomer Kaftan and Alvin Cheung and Magdalena Balazinska and Ariel Rokem and Andrew Connolly and Jacob Vanderplas and Yusra AlSayyad Comparative evaluation of big-data systems on scientific image analytics workloads . . . . . . . . . . . . . . . 1226--1237 Cigdem Aslay and Francesco Bonchi and Laks V. S. Lakshmanan and Wei Lu Revenue maximization in incentivized social advertising . . . . . . . . . . . 1238--1249 Lukas Rupprecht and William Culhane and Peter Pietzuch SquirrelJoin: network-aware distributed join processing with lazy partitioning 1250--1261 Sajjadur Rahman and Maryam Aliakbarpour and Ha Kyung Kong and Eric Blais and Karrie Karahalios and Aditya Parameswaran and Ronitt Rubinfield I've seen ``enough'': incrementally improving visualizations to support rapid decision making . . . . . . . . . 1262--1273 Lei Li and Wen Hua and Xingzhong Du and Xiaofang Zhou Minimal on-road time route scheduling on time-dependent graphs . . . . . . . . . 1274--1285 Nikos R. Katsipoulakis and Alexandros Labrinidis and Panos K. Chrysanthis A holistic view of stream partitioning costs . . . . . . . . . . . . . . . . . 1286--1297 Esra Akbas and Peixiang Zhao Truss-based community search: a truss-equivalence based indexing approach . . . . . . . . . . . . . . . . 1298--1309 José Cambronero and John K. Feser and Micah J. Smith and Samuel Madden Query optimization for dynamic imputation . . . . . . . . . . . . . . . 1310--1321 Neil G. Marchant and Benjamin I. P. Rubinstein In search of an entity resolution OASIS: optimal asymptotic sequential importance sampling . . . . . . . . . . . . . . . . 1322--1333 Yongxin Tong and Libin Wang and Zimu Zhou and Bolin Ding and Lei Chen and Jieping Ye and Ke Xu Flexible online task assignment in real-time spatial data . . . . . . . . . 1334--1345 Panagiotis Bouros and Nikos Mamoulis A forward scan based plane sweep algorithm for parallel interval joins 1346--1357 Kexin Rong and Peter Bailis ASAP: prioritizing attention via time series smoothing . . . . . . . . . . . . 1358--1369 Furong Li and Xin Luna Dong and Anno Langen and Yang Li Knowledge verification for long-tail verticals . . . . . . . . . . . . . . . 1370--1381 Shiladitya Pande and Sayan Ranu and Arnab Bhattacharya SkyGraph: retrieving regions of interest using skyline subgraph queries . . . . . 1382--1393 Wei Chit Tan and Meihui Zhang and Hazem Elmeleegy and Divesh Srivastava Reverse engineering aggregation queries 1394--1405 Lele Yut and Ce Zhang and Yingxia Shao and Bin Cui LDA*: a robust and large-scale topic modeling system . . . . . . . . . . . . 1406--1417 Igor Kabiljo and Brian Karrer and Mayank Pundir and Sergey Pupyrev and Alon Shalita Social hash partitioner: a scalable distributed hypergraph partitioner . . . 1418--1429 Nesreen K. Ahmed and Nick Duffield and Theodore L. Willke and Ryan A. Rossi On sampling from massive graph streams 1430--1441 Tong Yang and Yang Zhou and Hao Jin and Shigang Chen and Xiaoming Li Pyramid sketch: a sketch framework for frequency estimation of data streams . . 1442--1453 Paolo Ciaccia and Davide Martinenghi Reconciling skyline and ranking queries 1454--1465 Stella Giannakopoulou and Manos Karpathiotakis and Benjamin Gaidioz and Anastasia Ailamaki CleanM: an optimizable query language for unified scale-out data cleaning . . 1466--1477 Dong Xie and Feifei Li and Jeff M. Phillips Distributed trajectory similarity search 1478--1489 Bikash Chandra and S. Sudarshan Runtime optimization of join location in parallel data management systems . . . . 1490--1501 Oliver Lehmberg and Christian Bizer Stitching web tables for improving matching quality . . . . . . . . . . . . 1502--1513 Michael Shekelyan and Anton Dignös and Johann Gamper DigitHist: a histogram-based data summary with tight error bounds . . . . 1514--1525 Markus Pilman and Kevin Bocksrocker and Lucas Braun and Renato Marroquín and Donald Kossmann Fast scans on key--value stores . . . . 1526--1537 Can Lu and Jeffrey Xu Yu and Hao Wei and Yikai Zhang Finding the maximum clique in massive graphs . . . . . . . . . . . . . . . . . 1538--1549 Yuankai Zhang and Adam O'Neill and Micah Sherr and Wenchao Zhou Privacy-preserving network provenance 1550--1561 Daniel A. Garcia-Ulloa and Li Xiong and Vaidy Sunderam Truth discovery for spatio-temporal events from crowdsourced data . . . . . 1562--1573 Immanuel Trummer and Jiancheng Zhu and Mark Bryan Data vocalization: optimizing voice output of relational data . . . . . . . 1574--1585 Daniel Kang and John Emmons and Firas Abuzaid and Peter Bailis and Matei Zaharia NoScope: optimizing neural network queries over video at scale . . . . . . 1586--1597
Juchang Lee and SeungHyun Moon and Kyu Hwan Kim and Deok Hoe Kim and Sang Kyun Cha and Wook-Shin Han Parallel replication across formats in SAP HANA for scaling out mixed OLTP/OLAP workloads . . . . . . . . . . . . . . . 1598--1609 Rittika Shamsuddin and Amit Sawant and Balakrishnan Prabhakaran Developing a low dimensional patient class profile in accordance to their respiration-induced tumor motion . . . . 1610--1621 Mohamed Ziauddin and Andrew Witkowski and You Jung Kim and Dmitry Potapov and Janaki Lahorani and Murali Krishna Dimensions based data clustering and zone maps . . . . . . . . . . . . . . . 1622--1633 Shadi A. Noghabi and Kartik Paramasivam and Yi Pan and Navina Ramesh and Jon Bringhurst and Indranil Gupta and Roy H. Campbell Samza: stateful scalable stream processing at LinkedIn . . . . . . . . . 1634--1645 Eric Falk and Vijay K. Gurbani and Radu State Query-able Kafka: an agile data analytics pipeline for mobile wireless networks . . . . . . . . . . . . . . . . 1646--1657 Anisoara Nica and Reza Sherkat and Mihnea Andrei and Xun Cheng and Martin Heidel and Christian Bensberg and Heiko Gerwens Statisticum: data statistics management in SAP HANA . . . . . . . . . . . . . . 1658--1669 Felix Gessert and Michael Schaarschmidt and Wolfram Wingerath and Erik Witt and Eiko Yoneki and Norbert Ritter Quaestor: query web caching for database-as-a-service providers . . . . 1670--1681 Vaidas Gasiunas and David Dominguez-Sal and Ralph Acker and Aharon Avitzur and Ilan Bronshtein and Rushan Chen and Eli Ginot and Norbert Martinez-Bazan and Michael Müller and Alexander Nozdrin and Weijie Ou and Nir Pachter and Dima Sivov and Eliezer Levy Fiber-based architecture for NFV cloud databases . . . . . . . . . . . . . . . 1682--1693 Joos-Hendrik Böse and Valentin Flunkert and Jan Gasthaus and Tim Januschowski and Dustin Lange and David Salinas and Sebastian Schelter and Matthias Seeger and Yuyang Wang Probabilistic demand forecasting at scale . . . . . . . . . . . . . . . . . 1694--1705 Jinho Lee and Heesu Kim and Sungjoo Yoo and Kiyoung Choi and H. Peter Hofstee and Gi-Joon Nam and Mark R. Nutter and Damir Jamsek ExtraV: boosting graph processing near storage with a coherent accelerator . . 1706--1717 Paris Carbone and Stephan Ewen and Gyula Fóra and Seif Haridi and Stefan Richter and Kostas Tzoumas State management in Apache Flink\reg: consistent stateful distributed stream processing . . . . . . . . . . . . . . . 1718--1729 Jianjun Zheng and Qian Lin and Jiatao Xu and Cheng Wei and Chuwei Zeng and Pingan Yang and Yunfan Zhang PaxosStore: high-availability storage made practical in WeChat . . . . . . . . 1730--1741 Panagiotis Antonopoulos and Hanuma Kodavalla and Alex Tran and Nitish Upreti and Chaitali Shah and Mirek Sztajno Resumable online index rebuild in SQL server . . . . . . . . . . . . . . . . . 1742--1753 Mihnea Andrei and Christian Lemke and Günter Radestock and Robert Schulze and Carsten Thiel and Rolando Blanco and Akanksha Meghlan and Muhammad Sharique and Sebastian Seifert and Surendra Vishnoi and Daniel Booss and Thomas Peh and Ivan Schreter and Werner Thesing and Mehul Wagle and Thomas Willhalm SAP HANA adoption of non-volatile memory 1754--1765 Mingming Zhang and Tianyu Wo and Tao Xie and Xuelian Lin and Yaxiao Liu CarStream: an industrial system of big data processing for Internet-of-Vehicles 1766--1777 Daniele Bonetta and Matthias Brantner FAD.js: fast JSON data access using JIT-based speculative optimizations . . 1778--1789 Kareem S. Aggour and Jenny Weisenberg Williams and Justin McHugh and Vijay S. Kumar Colt: concept lineage tool for data flow metadata capture and analysis . . . . . 1790--1801 Chin-Chia Michael Yeh and Nickolas Kavantzas and Eamonn Keogh Matrix profile IV: using weakly labeled time series to predict outcomes . . . . 1802--1812 Sunil Chakkappen and Suratna Budalakoti and Ramarajan Krishnamachari and Satyanarayana R. Valluri and Alan Wood and Mohamed Zait Adaptive statistics in Oracle 12c . . . 1813--1824 Avrilia Floratou and Ashvin Agrawal and Bill Graham and Sriram Rao and Karthik Ramasamy Dhalion: self-regulating stream processing in Heron . . . . . . . . . . 1825--1836 Erkang Zhu and Ken Q. Pu and Fatemeh Nargesian and Renée J. Miller Interactive navigation of open data linkages . . . . . . . . . . . . . . . . 1837--1840 João Felipe Pimentel and Leonardo Murta and Vanessa Braganholo and Juliana Freire noWorkflow: a tool for collecting, analyzing, and managing provenance from Python scripts . . . . . . . . . . . . . 1841--1844 Chao Wang and Yihao Feng and Qi Guo and Zhaoxian Li and Kexin Liu and Zijian Tang and Anthony K. H. Tung and Lifu Wu and Yuxin Zheng ARShop: a cloud-based augmented reality system for shopping . . . . . . . . . . 1845--1848 Christopher R. Aberger and Andrew Lamb and Kunle Olukotun and Christopher Ré Mind the gap: bridging multi-domain query workloads with EmptyHeaded . . . . 1849--1852 Antonio Maccioni and Riccardo Torlone Crossing the finish line faster when paddling the data lake with KAYAK . . . 1853--1856 Xing Niu and Bahareh Sadat Arab and Seokki Lee and Su Feng and Xun Zou and Dieter Gawlick and Vasudha Krishnaswamy and Zhen Hua Liu and Boris Glavic Debugging transactions and tracking their provenance with reenactment . . . 1857--1860 Kai Huang and Sourav S. Bhowmick and Shuigeng Zhou and Byron Choi picasso: exploratory search of connected subgraph substructures in graph databases . . . . . . . . . . . . . . . 1861--1864 Ruichu Cai and Zijie Lu and Li Wang and Zhenjie Zhang and Tom Z. J. Fur and Marianne Winslett DITIR: distributed index for high throughput trajectory insertion and real-time temporal range query . . . . . 1865--1868 Zhifei Pang and Sai Wu and Gang Chen and Ke Chen and Lidan Shou FlashView: an interactive visual explorer for raw data . . . . . . . . . 1869--1872 Julien Subercaze and Christophe Gravier and Syed Gillani and Abderrahmen Kammoun and Frédérique Laforest Upsortable: programming top-$k$ queries over data streams . . . . . . . . . . . 1873--1876 Javad Chamanara and Birgitta König-Ries and H. V. Jagadish QUIS: in-situ heterogeneous data source querying . . . . . . . . . . . . . . . . 1877--1880 Abdussalam Alawini and Susan B. Davidson and Wei Hu and Yinjun Wu Automating data citation in CiteDB . . . 1881--1884 Yixiang Fang and Reynold Cheng and Siqiang Luo and Jiafeng Hu and Kai Huang C-explorer: browsing communities in large graphs . . . . . . . . . . . . . . 1885--1888 Wenfei Fan and Jingbo Xu and Yinghui Wu and Wenyuan Yu and Jiaxin Jiang GRAPE: parallelizing sequential graph computations . . . . . . . . . . . . . . 1889--1892 Alireza Khoshkbarforoushha and Rajiv Ranjan and Qing Wang and Carsten Friedrich Flower: a data analytics flow elasticity manager . . . . . . . . . . . . . . . . 1893--1896 Zhiyi Wang and Dongyan Zhou and Shimin Chen STEED: an analytical database system for tree-structured data . . . . . . . . . . 1897--1900 Yonghui Xiao and Li Xiong and Si Zhang and Yang Cao LocLok: location cloaking with differential privacy via hidden Markov model . . . . . . . . . . . . . . . . . 1901--1904 Xiangnan Ren and Olivier Curé and Li Ke and Jeremy Lhez and Badre Belabbess and Tendry Randriamalala and Yufan Zheng and Gabriel Kepeklian Strider: an adaptive, inference-enabled distributed RDF stream processing engine 1905--1908 Yan Li and Ngai Meng Kou and Hao Wang and Leong Hou U. and Zhiguo Gong A confidence-aware top-$k$ query processing toolkit on crowdsourcing . . 1909--1912 Valeria Fionda and Giuseppe Pirr\`o Explaining and querying knowledge graphs by relatedness . . . . . . . . . . . . . 1913--1916 Mayuresh Kunjir and Shivnath Babu Thoth in action: memory management in modern data analytics . . . . . . . . . 1917--1920 Maximilian E. Schüle and Pascal M. N. Schliski and Thomas Hutzelmann and Tobias Rosenberger and Viktor Leis and Dimitri Vorona and Alfons Kemper and Thomas Neumann Monopedia: staying single is good enough --- the hyper way for web scale applications . . . . . . . . . . . . . . 1921--1924 Ji Sun and Zeyuan Shang and Guoliang Li and Dong Deng and Zhifeng Bao Dima: a distributed in-memory similarity-based query processing system 1925--1928 Melisachew W. Chekol and Giuseppe Pirr\`o and Joerg Schoenfisch and Heiner Stuckenschmidt TeCoRe: temporal conflict resolution in knowledge graphs . . . . . . . . . . . . 1929--1932 Xupeng Li and Bin Cui and Yiru Chen and Wentao Wu and Ce Zhang MLog: towards declarative in-database machine learning . . . . . . . . . . . . 1933--1936 Çagatay Demiralp and Peter J. Haas and Srinivasan Parthasarathy and Tejaswini Pedapati Foresight: recommending visual insights 1937--1940 Steven Jacobs and Md Yusuf Sarwar Uddin and Michael Carey and Vagelis Hristidis and Vassilis J. Tsotras and N. Venkatasubramanian and Yao Wu and Syed Safir and Purvi Kaul and Xikui Wang and Mohiuddin Abdul Qader and Yawei Li A BAD demonstration: towards Big Active Data . . . . . . . . . . . . . . . . . . 1941--1944 Naeemul Hassan and Gensheng Zhang and Fatma Arslan and Josue Caraballo and Damian Jimenez and Siddhant Gawsane and Shohedul Hasan and Minumol Joseph and Aaditya Kulkarni and Anil Kumar Nayak and Vikas Sable and Chengkai Li and Mark Tremayne ClaimBuster: the first-ever end-to-end fact-checking system . . . . . . . . . . 1945--1948 Shaleen Deep and Paraschos Koutris and Yash Bidasaria QIRANA demonstration: real time scalable query pricing . . . . . . . . . . . . . 1949--1952 Meraj Khan and Larry Xu and Arnab Nandi and Joseph M. Hellerstein DataTweener: a demonstration of a tweening engine for incremental visualization of data transforms . . . . 1953--1956 Babak Salimi and Corey Cole and Dan R. K. Ports and Dan Suciu ZaliQL: causal inference from observational data at scale . . . . . . 1957--1960 Louai Alarabi and Mohamed F. Mokbel A demonstration of ST-Hadoop: a MapReduce framework for big spatio-temporal data . . . . . . . . . . 1961--1964 S. Bharadwaj and L. Chiticariu and M. Danilevsky and S. Dhingra and S. Divekar and A. Carreno-Fuentes and H. Gupta and N. Gupta and S.-D. Han and M. Hernández and H. Ho and P. Jain and S. Joshi and H. Karanam and S. Krishnan and R. Krishnamurthy and Y. Li and S. Manivannan and A. Mittal and F. Özcan and A. Quamar and P. Raman and D. Saha and K. Sankaranarayanan and J. Sen and P. Sen and S. Vaithyanathan and M. Vasa and H. Wang and H. Zhu Creation and interaction with large-scale domain-specific knowledge bases . . . . . . . . . . . . . . . . . 1965--1968 Christopher Jonathan and Mohamed F. Mokbel A demonstration of Stella: a crowdsourcing-based geotagging framework 1969--1972 Oscar Moll and Aaron Zalewski and Sudeep Pillai and Sam Madden and Michael Stonebraker and Vijay Gadepally Exploring big volume sensor data with Vroom . . . . . . . . . . . . . . . . . 1973--1976 Davide Mottin and Matteo Lissandrini and Yannis Velegrakis and Themis Palpanas New trends on exploratory methods for data analytics . . . . . . . . . . . . . 1977--1980 Arijit Khan and Sourav S. Bhowmick and Francesco Bonchi Summarizing static and dynamic big graphs . . . . . . . . . . . . . . . . . 1981--1984 Kyriakos Mouratidis Geometric approaches for top-$k$ queries 1985--1987 Yongxin Tong and Lei Chen and Cyrus Shahabi Spatial crowdsourcing: challenges, techniques, and applications . . . . . . 1988--1991 Ahmed Eldawy and Mohamed F. Mokbel The era of big spatial data . . . . . . 1992--1995 Nikos Giatrakos and Alexander Artikis and Antonios Deligiannakis and Minos Garofalakis Complex event recognition in the big data era . . . . . . . . . . . . . . . . 1996--1999 C. Mohan Tutorial: blockchains and databases . . 2000--2001 Victor Zakhary and Divyakant Agrawal and Amr El Abbadi Caching at the web scale . . . . . . . . 2002--2005 Guoliang Li Human-in-the-loop data integration . . . 2006--2017 Wolfgang Lehner The data center under your desk: how disruptive is modern hardware for DB system design? . . . . . . . . . . . . . 2018--2019 Tova Milo 7 secrets that my mother didn't tell me 2020--2020 Qin Lv and William Josephson and Zhe Wang and Moses Charikar and Kai Li Intelligent probing for locality sensitive hashing: multi-probe LSH and beyond . . . . . . . . . . . . . . . . . 2021--2024
Dai Qin and Angela Demke Brown and Ashvin Goel Scalable replay-based replication for fast databases . . . . . . . . . . . . . 2025--2036 Kai Ren and Qing Zheng and Joy Arulraj and Garth Gibson SlimDB: a space-efficient key--value storage engine for semi-sorted data . . 2037--2048 Ibrahim Abdelaziz and Razen Harbi and Zuhair Khayyat and Panos Kalnis A survey and experimental comparison of distributed SPARQL engines for very large RDF data . . . . . . . . . . . . . 2049--2060 Andreas Kunft and Asterios Katsifodimos and Sebastian Schelter and Tilmann Rabl and Volker Markl Blockjoin: efficient matrix partitioning through joins . . . . . . . . . . . . . 2061--2072 Dong-Wan Choi and Jian Pei and Thomas Heinis Efficient mining of regional movement patterns in semantic trajectories . . . 2073--2084 Martin Kiefer and Max Heimel and Sebastian Breß and Volker Markl Estimating join selectivities using bandwidth-optimized kernel density models . . . . . . . . . . . . . . . . . 2085--2096
Prashanth Menon and Todd C. Mowry and Andrew Pavlo Relaxed operator fusion for in-memory databases: making compilation, vectorization, and prefetching work together at last . . . . . . . . . . . . 1--13 Yu Liu and Bolong Zheng and Xiaodong He and Zhewei Wei and Xiaokui Xiao and Kai Zheng and Jiaheng Lu Probesim: scalable single-source and top-$k$ SimRank computations on dynamic graphs . . . . . . . . . . . . . . . . . 14--26 Paolo Guagliardo and Leonid Libkin A formal semantics of SQL queries, its validation, and applications . . . . . . 27--39 Jinhyun Kim and Jun-Ki Min and Kyuseok Shim Efficient Haar$^+$ synopsis construction for the maximum absolute error measure 40--52 Wenbo Tao and Dong Deng and Michael Stonebraker Approximate string joins with abbreviations . . . . . . . . . . . . . 53--65 Dat Ba Nguyen and Abdalghani Abujabal and Nam Khanh Tran and Martin Theobald and Gerhard Weikum Query-driven on-the-fly knowledge base construction . . . . . . . . . . . . . . 66--79 Olga Poppe and Chuan Lei and Elke A. Rundensteiner and David Maier GRETA: graph-based real-time event trend aggregation . . . . . . . . . . . . . . 80--92 Wentian Guo and Yuchen Li and Mo Sha and Kian-Lee Tan Parallel Personalized PageRank on dynamic graphs . . . . . . . . . . . . . 93--106 Mo Sha and Yuchen Li and Bingsheng He and Kian-Lee Tan Accelerating dynamic graph analytics on GPUs . . . . . . . . . . . . . . . . . . 107--120
Raja Appuswamy and Angelos C. Anadiotis and Danica Porobic and Mustafa K. Iman and Anastasia Ailamaki Analyzing the impact of system architecture on the scalability of OLTP engines for high-contention workloads 121--134 Hyungsoo Jung and Hyuck Han and Sooyong Kang Scalable database logging for multicores 135--148 Angela Bonifati and Wim Martens and Thomas Timm An analytical study of large SPARQL query logs . . . . . . . . . . . . . . . 149--161 Pinghui Wang and Yiyan Qi and Yu Sun and Xiangliang Zhang and Jing Tao and Xiaohong Guan Approximately counting triangles in large graph streams including edge duplicates with a fixed memory usage . . 162--175 Miao Qiao and Hao Zhang and Hong Cheng Subgraph matching: on compression and computation . . . . . . . . . . . . . . 176--188 Rohit Singh and Venkata Vamsikrishna Meduri and Ahmed Elmagarmid and Samuel Madden and Paolo Papotti and Jorge-Arnulfo Quiané-Ruiz and Armando Solar-Lezama and Nan Tang Synthesizing entity matching rules by examples . . . . . . . . . . . . . . . . 189--202 Liang He and Bin Shao and Yatao Li and Huanhuan Xia and Yanghua Xiao and Enhong Chen and Liang Jeff Chen Stylus: a strongly-typed store for serving massive RDF data . . . . . . . . 203--216 Ekaterini Ioannou and Minos Garofalakis Holistic query evaluation over information extraction pipelines . . . . 217--229 Georgios Psaropoulos and Thomas Legler and Norman May and Anastasia Ailamaki Interleaving with coroutines: a practical approach for robust index joins . . . . . . . . . . . . . . . . . 230--242
Dong Wen and Lu Qin and Ying Zhang and Lijun Chang and Xuemin Lin Efficient structural graph clustering: an index-based approach . . . . . . . . 243--255 Sabrina De Capitani di Vimercati and Sara Foresti and Sushil Jajodia and Giovanni Livraga and Stefano Paraboschi and Pierangela Samarati An authorization model for multi provider queries . . . . . . . . . . . . 256--268 Alexander Ratner and Stephen H. Bach and Henry Ehrenberg and Jason Fries and Sen Wu and Christopher Ré Snorkel: rapid training data creation with weak supervision . . . . . . . . . 269--282 Yuliang Li and Alin Deutsch and Victor Vianu VERIFAS: a practical verifier for artifact systems . . . . . . . . . . . . 283--296 Zhihao Jia and Yongkee Kwon and Galen Shipman and Pat McCormick and Mattan Erez and Alex Aiken A distributed multi-GPU system for fast graph processing . . . . . . . . . . . . 297--310 Tobias Bleifuß and Sebastian Kruse and Felix Naumann Efficient denial constraint discovery with Hydra . . . . . . . . . . . . . . . 311--323 Tahir Azim and Manos Karpathiotakis and Anastasia Ailamaki ReCache: reactive caching for fast analytics over heterogeneous data . . . 324--337 Long Yuan and Lu Qin and Xuemin Lin and Lijun Chang and Wenjie Zhang Effective and efficient dynamic graph coloring . . . . . . . . . . . . . . . . 338--351 Eleni Tzirita Zacharatou and Harish Doraiswamy and Anastasia Ailamaki and Cláudio T. Silva and Juliana Freiref GPU rasterization for real-time spatial aggregation over arbitrary polygons . . 352--365 Vraj Shah and Arun Kumar and Xiaojin Zhu Are key--foreign key joins safe to avoid when learning high-capacity classifiers? 366--379 Zheng Liu and Lei Chen Worker recommendation for crowdsourced Q&A services: a triple-factor aware approach . . . . . . . . . . . . . . . . 380--392
Shufeng Gong and Yanfeng Zhang and Ge Yu Clustering stream data by exploring the evolution of density mountain . . . . . 393--405 Tianzheng Wang and Ryan Johnson and Ippokratis Pandis Query fresh: log shipping on steroids 406--419 Siddhartha Sahu and Amine Mhedhbi and Semih Salihoglu and Jimmy Lin and M. Tamer Özsu The ubiquity of large graphs and surprising challenges of graph processing . . . . . . . . . . . . . . . 420--431 Karthik Ramachandra and Kwanghyun Park and K. Venkatesh Emani and Alan Halverson and César Galindo-Legaria and Conor Cunningham Froid: optimization of imperative programs in a relational database . . . 432--444 Ye Li and Leong Hou U. and Man Lung Yiu and Ngai Meng Kou An experimental study on hub labeling based shortest path algorithms . . . . . 445--457 Alexander Merritt and Ada Gavrilovska and Yuan Chen and Dejan Milojicic Concurrent log-structured memory for many-core key--value stores . . . . . . 458--471 Matteo Ceccarello and Carlo Fantozzi and Andrea Pietracaprina and Geppino Pucci and Fabio Vandin Clustering uncertain graphs . . . . . . 472--484 Ibrahim Abdelaziz and Essam Mansour and Mourad Ouzzani and Ashraf Aboulnaga and Panos Kalnis Lusail: a system for querying linked data at scale . . . . . . . . . . . . . 485--498 Hazar Harmouch and Felix Naumann Cardinality estimation: an experimental survey . . . . . . . . . . . . . . . . . 499--512 Jong-Hyeok Park and Gihwan Oh and Sang-Won Lee SQL statement logging for making SQLite truly lite . . . . . . . . . . . . . . . 513--525
Noah Johnson and Joseph P. Near and Dawn Song Towards practical differential privacy for SQL queries . . . . . . . . . . . . 526--539 Alexander Shraer and Alexandre Aybes and Bryan Davis and Christos Chrysafis and Dave Browning and Eric Krugler and Eric Stone and Harrison Chandler and Jacob Farkas and John Quinn and Jonathan Ruben and Michael Ford and Mike McMahon and Nathan Williams and Nicolas Favre-Felix and Nihar Sharma and Ori Herrnstadt and Paul Seligman and Raghav Pisolkar and Scott Dugas and Scott Gray and Sytze Harkema and Valentin Kravtsov and Vanessa Hong and Wan Ling Yih and Yizuo Tian Cloudkit: structured storage for mobile applications . . . . . . . . . . . . . . 540--552 Joy Arulraj and Justin Levandoski and Umar Farooq Minhas and Per-Ake Larson Bztree: a high-performance latch-free range index for non-volatile memory . . 553--565 Yuzhen Huang and Tatiana Jin and Yidi Wu and Zhenkun Cai and Xiao Yan and Fan Yang and Jinfeng Li and Yuying Guo and James Cheng FlexPS: flexible parallelism control in parameter server architecture . . . . . 566--579 Navid Yaghmazadeh and Xinyu Wang and Isil Dillig Automated migration of hierarchical data to relational tables using programming-by-example . . . . . . . . . 580--593 Siqiang Luo and Ben Kao and Guoliang Li and Jiafeng Hu and Reynold Cheng and Yudian Zheng TOAIN: a throughput optimizing adaptive index for answering dynamic $k$ NN queries on road networks . . . . . . . . 594--606 Tian Li and Jie Zhong and Ji Liu and Wentao Wu and Ce Zhang Ease.ml: towards multi-tenant resource sharing for machine learning workloads 607--620 Jianzhong Qi and Yufei Tao and Yanchuan Chang and Rui Zhang Theoretically optimal and empirically efficient R-trees with strong parallelizability . . . . . . . . . . . 621--634 Xueling Lin and Lei Chen Domain-aware multi-truth discovery from conflicting sources . . . . . . . . . . 635--647 Boyu Tian and Jiamin Huang and Barzan Mozafari and Grant Schoenebeck Contention-aware lock scheduling for transactional databases . . . . . . . . 648--662
Jignesh M. Patel and Harshad Deshmukh and Jianqiao Zhu and Navneet Potti and Zuyu Zhang and Marc Spehlmann and Hakan Memisoglu and Saket Saurabh Quickstep: a data platform based on the scaling-up approach . . . . . . . . . . 663--676 Haridimos Kondylakis and Niv Dayan and Kostas Zoumpatianos and Themis Palpanas Coconut: a scalable bottom-up approach for building data series indexes . . . . 677--690 Khaled Ammar and Frank McSherry and Semih Salihoglu and Manas Joglekar Distributed evaluation of subgraph queries using worst-case optimal low-memory dataflows . . . . . . . . . . 691--704 Teng Li and Zhiyuan Xu and Jian Tang and Yanzhi Wang Model-free control for distributed stream data processing using deep reinforcement learning . . . . . . . . . 705--718 Fotis Psallidas and Eugene Wu Smoke: fine-grained lineage at interactive speed . . . . . . . . . . . 719--732
Muhammad Idris and Martín Ugarte and Stijn Vansummeren and Hannes Voigt and Wolfgang Lehner Conjunctive queries with inequalities under updates . . . . . . . . . . . . . 733--745 Zhicheng Yint and Jin Sun and Ming Li and Jaliya Ekanayake and Haibo Lin and Marc Friedman and José A. Blakeley and Clemens Szyperski and Nikhil R. Devanur Bubble execution: resource-aware reliable analytics at cloud scale . . . 746--758 Sebastian Kruse and Felix Naumann Efficient discovery of approximate dependencies . . . . . . . . . . . . . . 759--772 Yue Wang and Alexandra Meliou and Gerome Miklau RC-index: diversifying answers to range queries . . . . . . . . . . . . . . . . 773--786 Xin Ding and Lu Chen and Yunjun Gao and Christian S. Jensen and Hujun Bao UlTraMan: a unified platform for big trajectory data management and analytics 787--799 Alekh Jindal and Konstantinos Karanasos and Sriram Rao and Hiren Patel Selecting subexpressions to materialize at datacenter scale . . . . . . . . . . 800--812 Fatemeh Nargesian and Erkang Zhu and Ken Q. Pu and Renée J. Miller Table union search on open data . . . . 813--825 Jianfei Chen and Jun Zhu and Jie Lu and Shixia Liu Scalable training of hierarchical topic models . . . . . . . . . . . . . . . . . 826--839
Mustafa Coskun and Ananth Grama and Mehmet Koyutürk Indexed fast network proximity querying 840--852 Libin Zheng and Lei Chen and Jieping Ye Order dispatch in price-aware ridesharing . . . . . . . . . . . . . . 853--865 Kyriakos Mouratidis and Bo Tang Exact processing of uncertain top-$k$ queries in multi-criteria settings . . . 866--879 Laure Berti-Équille and Hazar Harmouch and Felix Naumann and Noël Novelli and Saravanan Thirumuruganathan Discovery of genuine functional dependencies from relational data with missing values . . . . . . . . . . . . . 880--892 Qingchao Cai and Zhongle Xie and Meihui Zhang and Gang Chen and H. V. Jagadish and Beng Chin Ooi Effective temporal dependence discovery in time series data . . . . . . . . . . 893--905 Akhil Arora and Sakshi Sinha and Piyush Kumar and Arnab Bhattacharya HD-index: pushing the scalability-accuracy boundary for approximate kNN search in high-dimensional spaces . . . . . . . . 906--919 Yousuf Ahmad and Omar Khattab and Arsal Malik and Ahmad Musleh and Mohammad Hammoud and Mucahid Kutlu and Mostafa Shehata and Tamer Elsayed LA3: a scalable link- and locality-aware linear algebra-based graph analytics system . . . . . . . . . . . . . . . . . 920--933
Dongxiang Zhang and Mengting Ding and Dingyu Yang and Yi Liu and Ju Fan and Heng Tao Shen Trajectory simplification: an experimental study and quality analysis 934--946 Dolan Antenucci and Michael Cafarella Constraint-based explanation and repair of filter-based transformations . . . . 947--960 Xiaolan Wang and Aaron Feng and Behzad Golshan and Alon Halevy and George Mihaila and Hidekazu Oiwa and Wang-Chiew Tan Scalable semantic querying of text . . . 961--974 Luigi Bellomarini and Emanuel Sallinger and Georg Gottlob The Vadalog system: datalog-based reasoning for knowledge graphs . . . . . 975--987 Sourav Medya and Sayan Ranu and Jithin Vachery and Ambuj Singh Noticeable network delay minimization via node upgrades . . . . . . . . . . . 988--1001 Shoumik Palkar and James Thomas and Deepak Narayanan and Pratiksha Thaker and Rahul Palamuttam and Parimajan Negi and Anil Shanbhag and Malte Schwarzkopf and Holger Pirk and Saman Amarasinghe and Samuel Madden and Matei Zaharia Evaluating end-to-end optimization for data analytics applications in Weld . . 1002--1015 Magnus Müller and Guido Moerkotte and Oliver Kolb Improved selectivity estimation by combining knowledge from sampling and synopses . . . . . . . . . . . . . . . . 1016--1028 Kai Han and Keke Huang and Xiaokui Xiao and Jing Tang and Aixin Sun and Xueyan Tang Efficient algorithms for adaptive influence maximization . . . . . . . . . 1029--1040 Alex D. Breslow and Nuwan S. Jayasena Morton filters: faster, space-efficient cuckoo filters via biasing, compression, and decoupled logical sparsity . . . . . 1041--1055 Fei Bi and Lijun Chang and Xuemin Lin and Wenjie Zhang An optimal and progressive approach to online search of top-$k$ influential communities . . . . . . . . . . . . . . 1056--1068
Andreas Meister and Guido Moerkotte and Gunter Saake Errata for ``Analysis of two existing and one new dynamic programming algorithm for the generation of optimal bushy join trees without cross products'' . . . . . . . . . . . . . . . 1069--1070 Noseong Park and Mahmoud Mohammadi and Kshitij Gorde and Sushil Jajodia and Hongkyu Park and Youngmin Kim Data synthesis based on generative adversarial networks . . . . . . . . . . 1071--1083 Colin Lockard and Xin Luna Dong and Arash Einolghozati and Prashant Shiralkar CERES: distantly supervised relation extraction from the semi-structured web 1084--1096 Azade Nazi and Bolin Ding and Vivek Narasayya and Surajit Chaudhuri Efficient estimation of inclusion coefficient using hyperloglog sketches 1097--1109 Fabian Fier and Nikolaus Augsten and Panagiotis Bouros and Ulf Leser and Johann-Christoph Freytag Set similarity joins on MapReduce: an experimental survey . . . . . . . . . . 1110--1122 Bailu Ding and Sudipto Das and Wentao Wu and Surajit Chaudhuri and Vivek Narasayya Plan Stitch: harnessing the best of many plans . . . . . . . . . . . . . . . . . 1123--1136 Sheng Wang and Tien Tuan Anh Dinh and Qian Lin and Zhongle Xie and Meihui Zhang and Qingchao Cai and Gang Chen and Beng Chin Ooi and Pingcheng Ruan Forkbase: an efficient storage engine for blockchain and forkable applications 1137--1150 Khaled Ammar and M. Tamer Özsu Experimental analysis of distributed graph systems . . . . . . . . . . . . . 1151--1164 Yeye He and Xu Chu and Kris Ganjam and Yudian Zheng and Vivek Narasayya and Surajit Chaudhuri Transform-data-by-example (TDE): an extensible search engine for data transformations . . . . . . . . . . . . 1165--1177 Dan O'Keeffe and Theodoros Salonidis and Peter Pietzuch Frontier: resilient edge processing for the Internet of Things . . . . . . . . . 1178--1191 Brandon Haynes and Amrita Mazumdar and Armin Alaghi and Magdalena Balazinska and Luis Ceze and Alvin Cheung LightDB: a DBMS for virtual reality video . . . . . . . . . . . . . . . . . 1192--1205 Ryan McKenna and Gerome Miklau and Michael Hay and Ashwin Machanavajjhala Optimizing error of high-dimensional statistical queries under differential privacy . . . . . . . . . . . . . . . . 1206--1219 Yu Liu and Hantian Zhang and Luyuan Zeng and Wentao Wu and Ce Zhang MLbench: benchmarking machine learning services against human experts . . . . . 1220--1232 Lu Chen and Chengfei Liu and Rui Zhou and Jianxin Li and Xiaochun Yang and Bin Wang Maximum co-located community search in large scale social networks . . . . . . 1233--1246 Ramon Antonio Rodriges Zalipynis ChronosDB: distributed, file based, geospatial array DBMS . . . . . . . . . 1247--1261 Stephen Macke and Yiming Zhang and Silu Huang and Aditya Parameswaran Adaptive sampling for rapidly matching histograms . . . . . . . . . . . . . . . 1262--1275 Abolfazl Asudeh and Azade Nazi and Jees Augustine and Saravanan Thirumuruganathan and Nan Zhang and Gautam Das and Divesh Srivastava Leveraging similarity joins for signal reconstruction . . . . . . . . . . . . . 1276--1288 Xiangyao Yu and Yu Xia and Andrew Pavlo and Daniel Sanchez and Larry Rudolph and Srinivas Devadas Sundial: harmonizing concurrency control and caching in a distributed OLTP database management system . . . . . . . 1289--1302 Luo Mai and Kai Zeng and Rahul Potharaju and Le Xu and Steve Suh and Shivaram Venkataraman and Paolo Costa and Terry Kim and Saravanan Muthukrishnan and Vamsi Kuppa and Sudheer Dhulipalla and Sriram Rao Chi: a scalable and programmable control plane for distributed stream processing systems . . . . . . . . . . . . . . . . 1303--1316
Anthony Thomas and Arun Kumar A comparative evaluation of systems for scalable linear algebra-based analytics 2168--2182 Srinivas Karthik and Jayant R. Haritsa and Sreyash Kenkre and Vinayaka Pandit A concave path to low-overhead robust query processing . . . . . . . . . . . . 2183--2195 Yuhao Wen and Xiaodan Zhu and Sudeepa Roy and Jun Yang Interactive summarization and exploration of top aggregate query answers . . . . . . . . . . . . . . . . 2196--2208 Timo Kersten and Viktor Leis and Alfons Kemper and Thomas Neumann and Andrew Pavlo and Peter Boncz Everything you always wanted to know about compiled and vectorized queries but were afraid to ask . . . . . . . . . 2209--2222 Junyang Gao and Pankaj K. Agarwal and Jun Yang Durable top-$k$ queries on temporal data 2223--2235 Michele Linardi and Themis Palpanas Scalable, variable-length similarity search in data series: the ULISSE approach . . . . . . . . . . . . . . . . 2236--2248 Caetano Sauer and Goetz Graefe and Theo Härder FineLine: log-structured transactional storage and recovery . . . . . . . . . . 2249--2262 Protiva Rahman and Courtney Hebert and Arnab Nandi ICARUS: minimizing human effort in iterative data completion . . . . . . . 2263--2276
Sunghwan Kim and Taesung Lee and Seung-won Hwang and Sameh Elnikety List intersection for web search: algorithms, cost models, and optimizations . . . . . . . . . . . . . 1--13 Michael Whittaker and Joseph M. Hellerstein Interactive checks for coordination avoidance . . . . . . . . . . . . . . . 14--27 Jianbin Qin and Chuan Xiao Pigeonring: a principle for faster thresholded similarity search . . . . . 28--42 Ahmet Erdem Sariyüce and C. Seshadhri and Ali Pinar Local algorithms for hierarchical dense subgraph discovery . . . . . . . . . . . 43--56 Jingru Yang and Ju Fan and Zhewei Wei and Guoliang Li and Tongyu Liu and Xiaoyong Du Cost-effective data annotation using game-based crowdsourcing . . . . . . . . 57--70 Enhui Huang and Liping Peng and Luciano Di Palma and Ahmed Abdelkafi and Anna Liu and Yanlei Diao Optimization for active learning-based interactive database exploration . . . . 71--84
Tobias Bleifuß and Leon Bornemann and Theodore Johnson and Dmitri V. Kalashnikov and Felix Naumann and Divesh Srivastava Exploring change: a new dimension of data analytics . . . . . . . . . . . . . 85--98 Bishwamittra Ghosh and Mohammed Eunus Ali and Farhana M. Choudhury and Sajid Hasan Apon and Timos Sellis and Jianxin Li The flexible socio spatial group queries 99--111 Karima Echihabi and Kostas Zoumpatianos and Themis Palpanas and Houda Benbrahim The Lernaean Hydra of data series similarity search: an experimental evaluation of the state of the art . . . 112--127 Wei Wang and Jinyang Gao and Meihui Zhang and Sheng Wang and Gang Chen and Teck Khim Ng and Beng Chin Ooi and Jie Shao and Moaz Reyad Rafiki: machine learning as an analytics service system . . . . . . . . . . . . . 128--140 Pavle Suboti\'c and Herbert Jordan and Lijun Chang and Alan Fekete and Bernhard Scholz Automatic index selection for large-scale datalog computation . . . . 141--153 Shuang Song and Xu Liu and Qinzhe Wu and Andreas Gerstlauer and Tao Li and Lizy K. John Start late, finish early: a distributed graph processing system with redundancy reduction . . . . . . . . . . . . . . . 154--168 Bailu Ding and Lucja Kot and Johannes Gehrke Improving optimistic concurrency control through transaction batching and operation reordering . . . . . . . . . . 169--182
Ting Xie and Varun Chandola and Oliver Kennedy Query log compression for workload analytics . . . . . . . . . . . . . . . 183--196 Mohammed Eunus Ali and Shadman Saqib Eusuf and Kaysar Abdullah and Farhana M. Choudhury and J. Shane Culpepper and Timos Sellis The maximum trajectory coverage query in spatial databases . . . . . . . . . . . 197--209 Chenggang Wu and Alekh Jindal and Saeed Amizadeh and Hiren Patel and Wangchao Le and Shi Qiao and Sriram Rao Towards a learning optimizer for shared clouds . . . . . . . . . . . . . . . . . 210--222 Paroma Varma and Christopher Ré Snuba: automating weak supervision to label training data . . . . . . . . . . 223--236 Abolfazl Asudeh and H. V. Jagadish and Gerome Miklau and Julia Stoyanovich On obtaining stable rankings . . . . . . 237--250 Shuping Ji and Hans-Arno Jacobsen PS-tree-based efficient Boolean expression matching for high-dimensional and dense workloads . . . . . . . . . . 251--264 Yizhou Yan and Lei Cao and Samuel Madden and Elke A. Rundensteiner SWIFT: mining representative patterns from large event streams . . . . . . . . 265--277 Paul Suganthan G. C. and Adel Ardalan and AnHai Doan and Aditya Akella Smurf: self-service string matching using random forests . . . . . . . . . . 278--291 Feilong Liu and Ario Salmasi and Spyros Blanas and Anastasios Sidiropoulos Chasing similarity: distribution-aware aggregation scheduling . . . . . . . . . 292--306 Johes Bater and Xi He and William Ehrich and Ashwin Machanavajjhala and Jennie Rogers Shrinkwrap: efficient SQL query processing in differentially private data federations . . . . . . . . . . . . 307--320
Gurbinder Gill and Roshan Dathathri and Loc Hoang and Keshav Pingali A study of partitioning policies for graph analytics on large-scale distributed platforms . . . . . . . . . 321--334 K. Ashwin Kumar and Petros Efstathopoulos Utility-driven graph summarization . . . 335--347 Kaan Kara and Ken Eguro and Ce Zhang and Gustavo Alonso ColumnML: column-store machine learning with on-the-fly data transformation . . 348--361 Yanying Li and Haipei Sun and Boxiang Dong and Hui (Wendy) Wang Cost-efficient data acquisition on online data marketplaces for correlation analysis . . . . . . . . . . . . . . . . 362--375 Mohamad Dolatshah and Mathew Teoh and Jiannan Wang and Jian Pei Cleaning crowdsourced labels using oracles for statistical classification 376--389 Matteo Lissandrini and Martin Brugnara and Yannis Velegrakis Beyond macrobenchmarks: microbenchmark-based graph database evaluation . . . . . . . . . . . . . . . 390--403 Valter Balegas and Sérgio Duarte and Carla Ferreira and Rodrigo Rodrigues and Nuno Preguiça IPA: invariant-preserving applications for weakly consistent replicated databases . . . . . . . . . . . . . . . 404--418 Firas Abuzaid and Peter Kraft and Sahaana Suri and Edward Gan and Eric Xu and Atul Shenoy and Asvin Ananthanarayan and John Sheu and Erik Meijer and Xi Wu and Jeff Naughton and Peter Bailis and Matei Zaharia DIFF: a relational interface for large-scale data explanation . . . . . . 419--432 Ran Ben Basat and Roy Friedman and Rana Shahout Stream frequency over interval queries 433--445 Doris Xin and Stephen Macke and Litian Ma and Jialin Liu and Shuchen Song and Aditya Parameswaran HELIX: holistic optimization for accelerating iterative machine learning 446--460
Cong Fu and Chao Xiang and Changxu Wang and Deng Cai Fast approximate nearest neighbor search with the navigating spreading-out graph 461--474 Qi Wang and Torsten Suel Document reordering for faster intersection . . . . . . . . . . . . . . 475--487 Xiaofei Zhang and M. Tamer Özsu Correlation constraint shortest path over large multi-relation graphs . . . . 488--501 Harald Lang and Thomas Neumann and Alfons Kemper and Peter Boncz Performance-optimal filtering: Bloom overtakes Cuckoo at high throughput . . 502--515 Steffen Zeuch and Bonaventura Del Monte and Jeyhun Karimov and Clemens Lutz and Manuel Renz and Jonas Traub and Sebastian Breß and Tilmann Rabl and Volker Markl Analyzing efficient stream processing on modern hardware . . . . . . . . . . . . 516--530 Chen Luo and Michael J. Carey Efficient data ingestion and query processing for LSM-based storage systems 531--543 Periklis Chrysogelos and Manos Karpathiotakis and Raja Appuswamy and Anastasia Ailamaki HetExchange: encapsulating heterogeneous CPU--GPU parallelism in JIT compiled engines . . . . . . . . . . . . . . . . 544--556 Paolo Atzeni and Luigi Bellomarini and Paolo Papotti and Riccardo Torlone Meta-mappings for schema mapping reuse 557--569 Lijie Xu and Tian Guo and Wensheng Dou and Wei Wang and Jun Wei An experimental evaluation of garbage collectors on big data applications . . 570--583 Jinwei Guo and Peng Cai and Jiahao Wang and Weining Qian and Aoying Zhou Adaptive optimistic concurrency control for heterogeneous workloads . . . . . . 584--596 Yu-Shan Lin and Shao-Kan Pi and Meng-Kai Liao and Ching Tsai and Aaron Elmore and Shan-Hung Wu MgCrab: transaction crabbing for live migration in deterministic database systems . . . . . . . . . . . . . . . . 597--610 Sujaya Maiyya and Faisal Nawab and Divyakant Agrawal and Amr El Abbadi Unifying consensus and atomic commitment for effective cloud data management . . 611--623
Chenggang Wu and Vikram Sreekanti and Joseph M. Hellerstein Autoscaling tiered cloud storage in Anna 624--638 Anton Dignös and Boris Glavic and Xing Niu and Michael Böhlen and Johann Gamper Snapshot semantics for temporal multiset relations . . . . . . . . . . . . . . . 639--652 Selasi Kwashie and Lin Liu and Jixue Liu and Markus Stumptner and Jiuyong Li and Lujing Yang Certus: an effective entity resolution approach with graph differential dependencies (GDDs) . . . . . . . . . . 653--666 Kai Han and Fei Gui and Xiaokui Xiao and Jing Tang and Yuntian He and Zongmai Cao and He Huang Efficient and effective algorithms for clustering uncertain graphs . . . . . . 667--680 Jia Zou and Arun Iyengar and Chris Jermaine Pangea: monolithic distributed storage for data analytics . . . . . . . . . . . 681--694 Zhiwei Fan and Jianqiao Zhu and Zuyu Zhang and Aws Albarghouthi and Paraschos Koutris and Jignesh M. Patel Scaling-up in-memory datalog processing: observations and techniques . . . . . . 695--708 Aaron Archer and Kevin Aydin and Mohammad Hossein Bateni and Vahab Mirrokni and Aaron Schild and Ray Yang and Richard Zhuang Cache-aware load balancing of data center applications . . . . . . . . . . 709--723
Michael Borkowski and Christoph Hochreiner and Stefan Schulte Minimizing cost by reducing scaling operations in distributed stream processing . . . . . . . . . . . . . . . 724--737 Yinjun Wu and Abdussalam Alawini and Daniel Deutch and Tova Milo and Susan Davidson ProvCite: provenance-based data citation 738--751 Wenfei Fan and Ping Lu and Chao Tian and Jingren Zhou Deducing certain fixes to graphs . . . . 752--765 Matteo Ceccarello and Andrea Pietracaprina and Geppino Pucci Solving $k$-center clustering (with outliers) in MapReduce and streaming, almost as accurately as sequentially . . 766--778 Xiaolan Wang and Alexandra Meliou Explain$3$D: explaining disagreements in disjoint datasets . . . . . . . . . . . 779--792 Youjip Won and Sundoo Kim and Juseong Yun and Dam Quang Tuan and Jiwon Seo DASH: database shadowing for mobile DBMS 793--806 Zeke Wang and Kaan Kara and Hantian Zhang and Gustavo Alonso and Onur Mutlu and Ce Zhang Accelerating generalized linear models with MLWeaving: a one-size-fits-all system for any-precision learning . . . 807--821 Dimitrije Jankov and Shangyu Luo and Binhang Yuan and Zhuhua Cai and Jia Zou and Chris Jermaine and Zekai J. Gao Declarative recursive computation on an RDBMS: or, why you should use a database for distributed machine learning . . . . 822--835
Shahram Ghandeharizadeh and Hieu Nguyen Design, implementation, and evaluation of write-back policy with cache augmented data stores . . . . . . . . . 836--849 Thanh Tam Nguyen and Matthias Weidlich and Hongzhi Yin and Bolong Zheng and Quoc Viet Hung Nguyen and Bela Stantic User guidance for efficient fact checking . . . . . . . . . . . . . . . . 850--863 Xiangyu Ke and Arijit Khan and Leroy Lim Hong Quan An in-depth comparison of $s$--$t$ reliability algorithms over uncertain graphs . . . . . . . . . . . . . . . . . 864--876 Wenfei Fan and Chunming Hu and Muyang Liu and Ping Lu and Qiang Yin and Jingren Zhou Dynamic scaling for parallel graph computations . . . . . . . . . . . . . . 877--890 Dongsheng Li and Yiming Zhang and Jinyan Wang and Kian-Lee Tan TopoX: topology refactorization for efficient graph partitioning and processing . . . . . . . . . . . . . . . 891--905 Dmitrii Avdiukhin and Sergey Pupyrev and Grigory Yaroslavtsev Multi-dimensional balanced graph partitioning via projected gradient descent . . . . . . . . . . . . . . . . 906--919 Lei Cao and Yizhou Yan and Samuel Madden and Elke A. Rundensteiner and Mathan Gopalsamy Efficient discovery of sequence outlier patterns . . . . . . . . . . . . . . . . 920--932 Dmytro Bogatov and George Kollios and Leonid Reyzin A comparative evaluation of order-revealing encryption schemes and secure range-query protocols . . . . . . 933--947
Faisal Orakzai and Toon Calders and Torben Bach Pedersen $ k / 2$-hop: fast mining of convoy patterns with effective pruning . . . . 948--960 Ji Sun and Zeyuan Shang and Guoliang Li and Dong Deng and Zhifeng Bao Balance-aware distributed string similarity-based query processing system 961--974 Pingcheng Ruan and Gang Chen and Tien Tuan Anh Dinh and Qian Lin and Beng Chin Ooi and Meihui Zhang Fine-grained, secure and efficient data provenance on blockchain systems . . . . 975--988 Dalsu Choi and Chang-Sup Park and Yon Dohn Chung Progressive top-$k$ subarray query processing in array databases . . . . . 989--1001 Moritz Hoffmann and Andrea Lattuada and Frank McSherry Megaphone: latency-conscious state migration for distributed streaming dataflows . . . . . . . . . . . . . . . 1002--1015 Nguyen Thanh Tam and Matthias Weidlich and Bolong Zheng and Hongzhi Yin and Nguyen Quoc Viet Hung and Bela Stantic From anomaly detection to rumour detection using data streams of social platforms . . . . . . . . . . . . . . . 1016--1029 Peeyush Gupta and Yin Li and Sharad Mehrotra and Nisha Panwar and Shantanu Sharma and Sumaya Almanee Obscure: information-theoretic oblivious and verifiable aggregation queries . . . 1030--1043 Anshuman Dutt and Chi Wang and Azade Nazi and Srikanth Kandula and Vivek Narasayya and Surajit Chaudhuri Selectivity estimation for range predicates using lightweight models . . 1044--1057
Ye Yuan and Xiang Lian and Guoren Wang and Yuliang Ma and Yishu Wang Constrained shortest path query in a large time-dependent graph . . . . . . . 1058--1070 Lingyang Chu and Zhefeng Wang and Jian Pei and Yanyan Zhang and Yu Yang and Enhong Chen Finding theme communities from database networks . . . . . . . . . . . . . . . . 1071--1084 James J. Pan and Guoliang Li and Juntao Hu Ridesharing: simulator, benchmark, and evaluation . . . . . . . . . . . . . . . 1085--1098 Longbin Lai and Zhu Qing and Zhengyi Yang and Xin Jin and Zhengmin Lai and Ran Wang and Kongzhang Hao and Xuemin Lin and Lu Qin and Wenjie Zhang and Ying Zhang and Zhengping Qian and Jingren Zhou Distributed subgraph matching on timely dataflow . . . . . . . . . . . . . . . . 1099--1112 Shi Qiao and Adrian Nicoara and Jin Sun and Marc Friedman and Hiren Patel and Jaliya Ekanayake Hyper dimension shuffle: efficient data repartition at petabyte scale in SCOPE 1113--1125 Graham Cormode and Tejas Kulkarni and Divesh Srivastava Answering range queries under local differential privacy . . . . . . . . . . 1126--1138 Kai Wang and Xuemin Lin and Lu Qin and Wenjie Zhang and Ying Zhang Vertex priority based butterfly counting for large-scale bipartite networks . . . 1139--1152 Yang Cao and Wenfei Fan and Tengfei Yuan Block as a value for SQL over NoSQL . . 1153--1166 Kanat Tangwongsan and Martin Hirzel and Scott Schneider Optimal and general out-of-order sliding-window aggregation . . . . . . . 1167--1180 Bo Tang and Kyriakos Mouratidis and Man Lung Yiu and Zhenyu Chen Creating top ranking options in the continuous option and preference space 1181--1194 Hanchao Ma and Morteza Alipourlangouri and Yinghui Wu and Fei Chiang and Jiaxing Pi Ontology-based entity matching in attributed graphs . . . . . . . . . . . 1195--1207 Lu Chen and Yunjun Gao and Ziquan Fang and Xiaoye Miao and Christian S. Jensen and Chenjuan Guo Real-time distributed co-movement pattern detection on streaming trajectories . . . . . . . . . . . . . . 1208--1220 Jian Tan and Tieying Zhang and Feifei Li and Jie Chen and Qixing Zheng and Ping Zhang and Honglin Qiao and Yue Shi and Wei Cao and Rui Zhang iBTune: individualized buffer tuning for large-scale cloud databases . . . . . . 1221--1234
Michael Whittaker and Nick Edmonds and Sandeep Tata and James B. Wendt and Marc Najork Online template induction for machine-generated emails . . . . . . . . 1235--1248 Yong Wang and Guoliang Li and Nan Tang Querying shortest paths on time dependent road networks . . . . . . . . 1249--1261 Anna Fariha and Alexandra Meliou Example-driven query intent discovery: abductive reasoning using semantic similarity . . . . . . . . . . . . . . . 1262--1275 Qi Zhou and Joy Arulraj and Shamkant Navathe and William Harris and Dong Xu Automated verification of query equivalence using satisfiability modulo theories . . . . . . . . . . . . . . . . 1276--1288 Pengfei Xu and Jiaheng Lu Towards a unified framework for string similarity joins . . . . . . . . . . . . 1289--1302 Susik Yoon and Jae-Gil Lee and Byung Suk Lee NETS: extremely fast outlier detection from a data stream via set-based processing . . . . . . . . . . . . . . . 1303--1315 Yi Lu and Xiangyao Yu and Samuel Madden STAR: scaling transactions through asymmetric replication . . . . . . . . . 1316--1329 Yuliang Li and Aaron Feng and Jinfeng Li and Saran Mumick and Alon Halevy and Vivian Li and Wang-Chiew Tan Subjective databases . . . . . . . . . . 1330--1343 Xuguang Ren and Junhu Wang and Wook-Shin Han and Jeffrey Xu Yu Fast and robust distributed subgraph enumeration . . . . . . . . . . . . . . 1344--1356 Fangeheng Fu and Jiawei Jiang and Yingxia Shao and Bin Cui An experimental evaluation of large scale GBDT systems . . . . . . . . . . . 1357--1370 Ios Kotsogiannis and Yuchao Tao and Xi He and Maryam Fanaeepour and Ashwin Machanavajjhala and Michael Hay and Gerome Miklau PrivateSQL: a differentially private SQL query engine . . . . . . . . . . . . . . 1371--1384 Mohammad Javad Amiri and Divyakant Agrawal and Amr El Abbadi CAPER: a cross-application permissioned blockchain . . . . . . . . . . . . . . . 1385--1398 Alexandros Koliousis and Pijika Watcharapichat and Matthias Weidlich and Luo Mai and Paolo Costa and Peter Pietzuch Crossbow: scaling deep learning with small batch sizes on multi-GPU servers 1399--1412 Kaiyu Feng and Gao Cong and Christian S. Jensen and Tao Guo Finding attribute-aware similar regions for data analysis . . . . . . . . . . . 1414--1426 Dixin Tang and Zechao Shang and Aaron J. Elmore and Sanjay Krishnan and Michael J. Franklin Intermittent query processing . . . . . 1427--1441 Mihai Budiu and Parikshit Gopalan and Lalith Suresh and Udi Wieder and Han Kruiger and Marcos K. Aguilera Hillview: a trillion-cell spreadsheet for big data . . . . . . . . . . . . . . 1442--1457 Ziheng Wei and Sebastian Link Embedded functional dependencies and data-completeness tailored database design . . . . . . . . . . . . . . . . . 1458--1470 Hua Fan and Wojciech Golab Ocean Vista: gossip-based visibility control for speedy geo-distributed transactions . . . . . . . . . . . . . . 1471--1484 Xikui Wang and Michael J. Carey An IDEA: an ingestion framework for data enrichment in asterixDB . . . . . . . . 1485--1498 Alexey Karyakin and Kenneth Salem DimmStore: memory power optimization for database systems . . . . . . . . . . . . 1499--1512 Cong Yan and Alvin Cheung Generating application-specific data layouts for in-memory databases . . . . 1513--1525 Rihan Hai and Christoph Quix Rewriting of plain SO tgds into nested tgds . . . . . . . . . . . . . . . . . . 1526--1538 Senthil Nathan and Chander Govindarajan and Adarsh Saraf and Manish Sethi and Praveen Jayachandran Blockchain meets database: design and implementation of a blockchain relational database . . . . . . . . . . 1539--1552 Andreas Kunft and Asterios Katsifodimos and Sebastian Schelter and Sebastian Breß and Tilmann Rabl and Volker Markl An intermediate representation for optimizing machine learning pipelines 1553--1567 Yuanwei Fang and Chen Zou and Andrew A. Chien Accelerating raw data analysis with the ACCORDA software and hardware architecture . . . . . . . . . . . . . . 1568--1582 A. B. Siddique and Ahmed Eldawy and Vagelis Hristidis Comparing synopsis techniques for approximate spatial data analysis . . . 1583--1596 Muhammad El-Hindi and Carsten Binnig and Arvind Arasu and Donald Kossmann and Ravi Ramamurthy BlockchainDB: a shared database on blockchains . . . . . . . . . . . . . . 1597--1609 Ruoxi Jia and David Dao and Boxin Wang and Frances Ann Hubis and Nezihe Merve Gurel and Bo Li and Ce Zhang and Costas Spanos and Dawn Song Efficient task-specific data valuation for nearest neighbor algorithms . . . . 1610--1623 Hemant Saxena and Lukasz Golab and Ihab F. Ilyas Distributed implementations of dependency discovery algorithms . . . . 1624--1636 Erfan Zamanian and Xiangyao Yu and Michael Stonebraker and Tim Kraska Rethinking database high availability with RDMA networks . . . . . . . . . . . 1637--1650 Marco Bressan and Stefano Leucci and Alessandro Panconesi Motivo: fast motif counting via succinct color coding and adaptive sampling . . . 1651--1663 Rishabh Poddar and Tobias Boelter and Raluca Ada Popa Arx: an encrypted database using semantically secure encryption . . . . . 1664--1678 Junyang Gao and Xian Li and Yifan Ethan Xu and Bunyamin Sisman and Xin Luna Dong and Jun Yang Efficient knowledge graph accuracy evaluation . . . . . . . . . . . . . . . 1679--1691 Amine Mhedhbi and Semih Salihoglu Optimizing subgraph queries by combining binary and worst-case optimal joins . . 1692--1704 Ryan Marcus and Parimarjan Negi and Hongzi Mao and Chi Zhang and Mohammad Alizadeh and Tim Kraska and Olga Papaemmanouil and Nesime Tatbul Neo: a learned query optimizer . . . . . 1705--1718 Yixiang Fang and Kaiqiang Yu and Reynold Cheng and Laks V. S. Lakshmanan and Xuemin Lin Efficient algorithms for densest subgraph discovery . . . . . . . . . . . 1719--1732 Ryan Marcus and Olga Papaemmanouil Plan-structured deep neural network models for query performance prediction 1733--1746 Kun Ren and Dennis Li and Daniel J. Abadi SLOG: serializable, low-latency, geo-replicated transactions . . . . . . 1747--1761 John Paparrizos and Michael J. Franklin GRAIL: efficient time-series representation learning . . . . . . . . 1762--1777
Guilherme Damasio and Spencer Bryson and Vincent Corvinelli and Parke Godfrey and Piotr Mierzejewski and Jaroslaw Szlichta and Calisto Zuzarte GALO: guided automated learning for re-optimization . . . . . . . . . . . . 1778--1781 Yuanyuan Tian and Wen Sun and Sui Jun Tong and En Liang Xu and Mir Hamid Pirahesh and Wei Zhao Synergistic graph and SQL analytics inside IBM Db2 . . . . . . . . . . . . . 1782--1785 Xiaoou Ding and Hongzhi Wang and Jiaxuan Su and Zijue Li and Jianzhong Li and Hong Gao Cleanits: a data cleaning system for industrial time series . . . . . . . . . 1786--1789 Yipeng Zhang and Zhifeng Bao and Songsong Mo and Yuchen Li and Yanghao Zhou ITAA: an intelligent trajectory-driven outdoor advertising deployment assistant 1790--1793 Kun Qian and Lucian Popa and Prithviraj Sen SystemER: a human-in-the-loop system for explainable entity resolution . . . . . 1794--1797 Viet-Phi Huynh and Paolo Papotti Buckle: evaluating fact checking algorithms built on knowledge bases . . 1798--1801 Peng Gao and Xusheng Xiao and Zhichun Li and Kangkook Jee and Fengyuan Xu and Sanjeev R. Kulkarni and Prateek Mittal A query system for efficiently investigating complex attack behaviors for enterprise security . . . . . . . . 1802--1805 Zhengjie Miao and Qitian Zeng and Chenjie Li and Boris Glavic and Oliver Kennedy and Sudeepa Roy CAPE: explaining outliers by counterbalancing . . . . . . . . . . . . 1806--1809 Karthik Ramachandra and Kwanghyun Park BlackMagic: automatic inlining of scalar UDFs into SQL queries with Froid . . . . 1810--1813 Lukas Berg and Tobias Ziegler and Carsten Binnig and Uwe Röhm ProgressiveDB: progressive data analytics as a middleware . . . . . . . 1814--1817 Kaan Kara and Zeke Wang and Ce Zhang and Gustavo Alonso doppioDB 2.0: hardware techniques for improved integration of machine learning into databases . . . . . . . . . . . . . 1818--1821 Cicero A. L. Pahins and Behrooz Omidvar-Tehrani and Sihem Amer-Yahia and Valérie Siroux and Jean-Louis Pepin and Jean-Christian Borel and João L. D. Comba COVIZ: a system for visual formation and exploration of patient cohorts . . . . . 1822--1825 Martin Franke and Ziad Sehili and Erhard Rahm PRIMAT: a toolbox for fast privacy-preserving matching . . . . . . 1826--1829 Ryan Marcus and Chi Zhang and Shuai Yu and Geoffrey Kao and Olga Papaemmanouil NashDB: fragmentation, replication, and provisioning using economic methods . . 1830--1833 Ibrahim Sabek and Mashaal Musleh and Mohamed F. Mokbel Flash in action: scalable spatial data analysis using Markov logic networks . . 1834--1837 Lucas Kuhring and Zsolt István I can't believe it's not (only) software!: bionic distributed storage for Parquet files . . . . . . . . . . . 1838--1841 Hyewon Choi and Erkang Zhu and Arsala Bangash and Renée J. Miller VISE: vehicle image search engine with traffic camera . . . . . . . . . . . . . 1842--1845 Stephan Goldberg and Tova Milo and Slava Novgorodov and Kathy Razmadze WiClean: a system for fixing Wikipedia interlinks using revision history patterns . . . . . . . . . . . . . . . . 1846--1849 Abhishek Roy and Alekh Jindal and Hiren Patel and Ashit Gosalia and Subru Krishnan and Carlo Curino SparkCruise: handsfree computation reuse in Spark . . . . . . . . . . . . . . . . 1850--1853 Sandeep Singh Sandha and Wellington Cabrera and Mohammed Al-Kateb and Sanjay Nair and Mani Srivastava In-database distributed machine learning: demonstration using Teradata SQL engine . . . . . . . . . . . . . . . 1854--1857 Zhao Li and Xia Chen and Xuming Pan and Pengcheng Zou and Yuchen Li and Guoxian Yu SHOAL: large-scale hierarchical taxonomy via graph-based query coalition in e-commerce . . . . . . . . . . . . . . . 1858--1861 Min Xu and Tianhao Wang and Bolin Ding and Jingren Zhou and Cheng Hong and Zhicong Huang DPSAaS: multi-dimensional data sharing and analytics as services under local differential privacy . . . . . . . . . . 1862--1865 Yang Cao and Yonghui Xiao and Li Xiong and Liquan Bai and Masatoshi Yoshikawa PriSTE: protecting spatiotemporal event privacy in continuous location-based services . . . . . . . . . . . . . . . . 1866--1869 Daniel Deutch and Evgeny Marants and Yuval Moskovitch Datalignment: ontology schema alignment through datalog containment . . . . . . 1870--1873 Congcong Ge and Yunjun Gao and Xiaoye Miao and Lu Chen and Christian S. Jensen and Ziyuan Zhu IHCS: an integrated hybrid cleaning system . . . . . . . . . . . . . . . . . 1874--1877 Constantinos Costa and Xiaoyu Ge and Panos K. Chrysanthis CAPRIO: graph-based integration of indoor and outdoor data for path discovery . . . . . . . . . . . . . . . 1878--1881 Yingjun Wu and Jia Yu and Yuanyuan Tian and Richard Sidle and Ronald Barber HERMIT in action: succinct secondary indexing mechanism via correlation exploration . . . . . . . . . . . . . . 1882--1885 Julien Loudet and Iulian Sandu-Popa and Luc Bouganim DISPERS: securing highly distributed queries on personal data management systems . . . . . . . . . . . . . . . . 1886--1889 Adil Akhter and Marios Fragkoulis and Asterios Katsifodimos Stateful functions as a service in action . . . . . . . . . . . . . . . . . 1890--1893 Allen Ordookhanians and Xin Li and Supun Nakandala and Arun Kumar Demonstration of Krypton: optimized CNN inference for occlusion-based deep CNN explanations . . . . . . . . . . . . . . 1894--1897 Zhengjie Miao and Andrew Lee and Sudeepa Roy LensXPlain: visualizing and explaining contributing subsets for aggregate query answers . . . . . . . . . . . . . . . . 1898--1901 Yi Zhang and Zachary G. Ives Juneau: data lake management for Jupyter 1902--1905 Sona Hasani and Faezeh Ghaderi and Shohedul Hasan and Saravanan Thirumuruganathan and Abolfazl Asudeh and Nick Koudas and Gautam Das ApproxML: efficient approximate ad-hoc ML models through materialization and reuse . . . . . . . . . . . . . . . . . 1906--1909 Grégory Essertel and Ruby Y. Tahboub and Fei Wang and James Decker and Tiark Rompf Flare & Lantern: efficiently swapping horses midstream . . . . . . . . . . . . 1910--1913 Ruben Martins and Jia Chen and Yanju Chen and Yu Feng and Isil Dillig Trinity: an extensible synthesis framework for data science . . . . . . . 1914--1917 Zhiqi Huang and Ryan McKenna and George Bissias and Gerome Miklau and Michael Hay and Ashwin Machanavajjhala PSynDB: accurate and accessible private data generation . . . . . . . . . . . . 1918--1921 Badrish Chandramouli and Dong Xie and Yinan Li and Donald Kossmann FishStore: fast ingestion and indexing of raw data . . . . . . . . . . . . . . 1922--1925 Yanlei Diao and Pawe\l Guzewicz and Ioana Manolescu and Mirjana Mazuran Spade: a modular framework for analytical exploration of RDF graphs . . 1926--1929 Joseph Vinish D'silva and Florestan De Moor and Bettina Kemme Making an RDBMS data scientist friendly: advanced in-database interactive analytics with visualization support . . 1930--1933 Khaled Zaouk and Fei Song and Chenghao Lyu and Arnab Sinha and Yanlei Diao and Prashant Shenoy UDAO: a next-generation unified data analytics optimizer . . . . . . . . . . 1934--1937 Saehan Jo and Immanuel Trummer and Weicheng Yu and Xuezhi Wang and Cong Yu and Daniel Liu and Niyati Mehta AggChecker: a fact-checking system for text summaries of relational data sets 1938--1941 Hanzhang Wang and Phuong Nguyen and Jun Li and Selcuk Kopru and Gene Zhang and Sanjeev Katariya and Sami Ben-Romdhane GRANO: interactive graph-based root cause analysis for cloud-native distributed data platform . . . . . . . 1942--1945 Davide Frey and Marc X. Makkes and Pierre-Louis Roman and François Ta\"\iani and Spyros Voulgaris Dietcoin: hardening Bitcoin transaction verification process for mobile devices 1946--1949 Samriddhi Singla and Ahmed Eldawy and Rami Alghamdi and Mohamed F. Mokbel Raptor: large scale analysis of big raster and vector data . . . . . . . . . 1950--1953 El Kindi Rezig and Lei Cao and Michael Stonebraker and Giovanni Simonini and Wenbo Tao and Samuel Madden and Mourad Ouzzani and Nan Tang and Ahmed K. Elmagarmid Data Civilizer 2.0: a holistic framework for data preparation and analytics . . . 1954--1957 Leonhard F. Spiegelberg and Tim Kraska Tuplex: robust, efficient analytics when Python rules . . . . . . . . . . . . . . 1958--1961 Cedric Renggli and Frances Ann Hubis and Bojan Karlas and Kevin Schawinski and Wentao Wu and Ce Zhang Ease.ml/ci and Ease.ml/meter in action: towards data management for statistical generalization . . . . . . . . . . . . . 1962--1965 Xueran Han and Jun Chen and Jiaheng Lu and Yueguo Chen and Xiaoyong Du PivotE: revealing and visualizing the underlying entity structures for exploration . . . . . . . . . . . . . . 1966--1969 Jiaheng Lu and Yuxing Chen and Herodotos Herodotou and Shivnath Babu Speedup your analytics: automatic parameter tuning for databases and big data systems . . . . . . . . . . . . . . 1970--1973 Yu Meng and Jiaxin Huang and Jingbo Shang and Jiawei Han TextCube: automated construction and multidimensional exploration . . . . . . 1974--1977 Sihem Amer-Yahia and Senjuti Basu Roy The ever evolving online labor market: overview, challenges and opportunities 1978--1981 Ibrahim Sabek and Mohamed F. Mokbel Machine learning meets big spatial data 1982--1985 Fatemeh Nargesian and Erkang Zhu and Renée J. Miller and Ken Q. Pu and Patricia C. Arocena Data lake management: challenges and opportunities . . . . . . . . . . . . . 1986--1989 Laks V. S. Lakshmanan and Michael Simpson and Saravanan Thirumuruganathan Combating fake news: a data management and mining perspective . . . . . . . . . 1990--1993 Nicolas Anciaux and Luc Bouganim and Philippe Pucheral and Iulian Sandu Popa and Guillaume Scerri Personal database security and trusted execution environments: a tutorial at the crossroads . . . . . . . . . . . . . 1994--1997 Stephan Kessler and Jens Hoff and Johann-Christoph Freytag SAP HANA goes private: from privacy research to privacy aware enterprise analytics . . . . . . . . . . . . . . . 1998--2009 Guilherme Damasio and Vincent Corvinelli and Parke Godfrey and Piotr Mierzejewski and Alex Mihaylov and Jaroslaw Szlichta and Calisto Zuzarte Guided automated learning for query workload re-optimization . . . . . . . . 2010--2021 Biswapesh Chattopadhyay and Priyam Dutta and Weiran Liu and Ott Tinn and Andrew Mccormick and Aniket Mokashi and Paul Harvey and Hector Gonzalez and David Lomax and Sagar Mittal and Roee Ebenstein and Nikita Mikhaylin and Hung-ching Lee and Xiaoyan Zhao and Tony Xu and Luis Perez and Farhad Shahmohammadi and Tran Bui and Neil McKay and Selcuk Aya and Vera Lychagina and Brett Elliott Procella: unifying serving and analytical data at YouTube . . . . . . . 2022--2034 Wei Lu and Zhanhao Zhao and Xiaoyu Wang and Haixiang Li and Zhenmiao Zhang and Zhiyu Shui and Sheng Ye and Anqun Pan and Xiaoyong Du A lightweight and efficient temporal database management system in TDSQL . . 2035--2046 Reza Sherkat and Colin Florendo and Mihnea Andrei and Rolando Blanco and Adrian Dragusanu and Amit Pathak and Pushkar Khadilkar and Neeraj Kulkarni and Christian Lemke and Sebastian Seifert and Sarika Iyer and Sasikanth Gottapu and Robert Schulze and Chaitanya Gottipati and Nirvik Basak and Yanhong Wang and Vivek Kandiyanallur and Santosh Pendap and Dheren Gala and Rajesh Almeida and Prasanta Ghosh Native store extension for SAP HANA . . 2047--2058 Chaoqun Zhan and Maomeng Su and Chuangxian Wei and Xiaoqiang Peng and Liang Lin and Sheng Wang and Zhe Chen and Feifei Li and Yue Pan and Fang Zheng and Chengliang Chai AnalyticDB: real-time OLAP database system at Alibaba cloud . . . . . . . . 2059--2070 William Schultz and Tess Avitabile and Alyson Cabral Tunable consistency in MongoDB . . . . . 2071--2081 Shaosheng Cao and XinXing Yang and Cen Chen and Jun Zhou and Xiaolong Li and Yuan Qi TitAnt: online real-time transaction fraud detection in Ant Financial . . . . 2082--2093 Rong Zhu and Kun Zhao and Hongxia Yang and Wei Lin and Chang Zhou and Baole Ai and Yong Li and Jingren Zhou AliGraph: a comprehensive graph neural network platform . . . . . . . . . . . . 2094--2105 Zhimin Chen and Yue Wang and Vivek Narasayya and Surajit Chaudhuri Customizable and scalable fuzzy join for big data . . . . . . . . . . . . . . . . 2106--2117 Guoliang Li and Xuanhe Zhou and Shifu Li and Bo Gao QTune: a query-aware database tuning system with deep reinforcement learning 2118--2130 Srikanth Kandula and Kukjin Lee and Surajit Chaudhuri and Marc Friedman Experiences with approximating queries in Microsoft's production big-data clusters . . . . . . . . . . . . . . . . 2131--2142 Panagiotis Antonopoulos and Peter Byrne and Wayne Chen and Cristian Diaconu and Raghavendra Thallam Kodandaramaih and Hanuma Kodavalla and Prashanth Purnananda and Adrian-Leonard Radu and Chaitanya Sreenivas Ravella and Girish Mittur Venkataramanappa Constant time recovery in Azure SQL database . . . . . . . . . . . . . . . . 2143--2154 Yuzhen Huang and Yingjie Shi and Zheng Zhong and Yihui Feng and James Cheng and Jiwei Li and Haochuan Fan and Chao Li and Tao Guan and Jingren Zhou Yugong: geo-distributed data and job placement at scale . . . . . . . . . . . 2155--2169 Junjay Tan and Thanaa Ghanem and Matthew Perron and Xiangyao Yu and Michael Stonebraker and David DeWitt and Marco Serafini and Ashraf Aboulnaga and Tim Kraska Choosing a cloud DBMS: architectures and tradeoffs . . . . . . . . . . . . . . . 2170--2182 Jingtian Zhang and Sai Wu and Zeyuan Tan and Gang Chen and Zhushi Cheng and Wei Cao and Yusong Gao and Xiaojie Feng S3: a scalable in-memory skip-list index for key--value store . . . . . . . . . . 2183--2194 Charles Masson and Jee E. Rim and Homin K. Lee DDSketch: a fast and fully-mergeable quantile sketch with relative-error guarantees . . . . . . . . . . . . . . . 2195--2205 Qiang Long and Wei Wang and Jinfu Deng and Song Liu and Wenhao Huang and Fangying Chen and Sifan Liu A distributed system for large-scale $n$-gram language models at Tencent . . 2206--2217 Kayhan Dursun and Carsten Binnig and Ugur Cetintemel and Garret Swart and Weiwei Gong A morsel-driven query execution engine for heterogeneous multi-cores . . . . . 2218--2229 Lei Cao and Wenbo Tao and Sungtae An and Jing Jin and Yizhou Yan and Xiaoyu Liu and Wendong Ge and Adam Sah and Leilani Battle and Jimeng Sun and Remco Chang and Brandon Westover and Samuel Madden and Michael Stonebraker Smile: a system to support machine learning on EEG data at scale . . . . . 2230--2241 Alastair Green and Paolo Guagliardo and Leonid Libkin and Tobias Lindaaker and Victor Marsault and Stefan Plantikow and Martin Schuster and Petra Selmer and Hannes Voigt Updating graph databases with Cypher . . 2242--2254 Asya Kamsky Adapting TPC-C benchmark to measure performance of multi-document transactions in MongoDB . . . . . . . . 2254--2262 Feifei Li Cloud-native database systems at Alibaba: opportunities and challenges 2263--2272 Alexander Boehm In-memory for the masses: enabling cost-efficient deployments of in-memory data management platforms for business applications . . . . . . . . . . . . . . 2273--2275 Murtadha Al Hubail and Ali Alsuliman and Michael Blow and Michael Carey and Dmitry Lychagin and Ian Maxon and Till Westmann Couchbase analytics: NoETL for scalable NoSQL data analysis . . . . . . . . . . 2275--2286 Adrian Colyer Performance in the spotlight . . . . . . 2287--2289 Azza Abouzied and Daniel J. Abadi and Kamil Bajda-Pawlikowski and Avi Silberschatz Integration of large-scale data processing systems and traditional parallel database technology . . . . . . 2290--2299 Brian F. Cooper and P. P. S. Narayan and Raghu Ramakrishnan and Utkarsh Srivastava and Adam Silberstein and Philip Bohannon and Hans-Arno Jacobsen and Nick Puz and Daniel Weaver and Ramana Yerneni PNUTS to Sherpa: lessons from Yahoo!'s cloud database . . . . . . . . . . . . . 2300--2307 Wang-Chiew Tan What I probably did right and what I think I could have done better . . . . . 2308--2308 Aditya Parameswaran Enabling data science for the majority 2309--2322 Theodoras Rekatsinas and Sudeepa Roy and Manasi Vartak and Ce Zhang and Neoklis Polyzotis Opportunities for data management research in the era of horizontal AI/ML 2323--2323
Claude Barthels and Ingo Müller and Konstantin Taranov and Gustavo Alonso and Torsten Hoefler Strong consistency is not hard to get: two-phase locking and two-phase commit on thousands of cores . . . . . . . . . 2325--2338 Ziheng Wei and Uwe Leck and Sebastian Link Discovery and ranking of embedded uniqueness constraints . . . . . . . . . 2339--2352 Lingyang Chu and Yanyan Zhang and Yu Yang and Lanjun Wang and Jian Pei Online density bursting subgraph detection from temporal graphs . . . . . 2353--2365 Pedro Holanda and Mark Raasveldt and Stefan Manegold and Hannes Mühleisen Progressive indexes: indexing for interactive data analysis . . . . . . . 2366--2378 Masatoshi Hanai and Toyotaro Suzumura and Wen Jun Tan and Elvis Liu and Georgios Theodoropoulos and Wentong Cai Distributed edge partitioning for trillion-edge graphs . . . . . . . . . . 2379--2392 Manos Athanassoulis and Kenneth S. Bògh and Stratos Idreos Optimal column layout for hybrid workloads . . . . . . . . . . . . . . . 2393--2407 Stavros Sintos and Pankaj K. Agarwal and Jun Yang Selecting data to clean for fact checking: minimizing uncertainty vs. maximizing surprise . . . . . . . . . . 2408--2421
Shuchi Chawla and Shaleen Deep and Paraschos Koutrisw and Yifeng Teng Revenue maximization for query pricing 1--14 Jieming Shi and Renchi Yang and Tianyuan Jin and Xiaokui Xiao and Yin Yang Realtime top-$k$ Personalized PageRank over large graphs on GPUs . . . . . . . 15--28 Sheng Wang and Zhifeng Bao and J. Shane Culpepper and Timos Sellis and Xiaolin Qin Fast large-scale trajectory clustering 29--42 Herodotos Herodotou and Elena Kakoulli Automating distributed tiered storage management in cluster computing . . . . 43--56 Jinho Jung and Hong Hu and Joy Arulraj and Taesoo Kim and Woonhak Kang APOLLO: automatic detection and diagnosis of performance regressions in database systems . . . . . . . . . . . . 57--70 Muhsen Owaida and Gustavo Alonso and Laura Fogliarini and Anthony Hock-Koon and Pierre-Etienne Melet Lowering the latency of data processing pipelines through FPGA based hardware acceleration . . . . . . . . . . . . . . 71--85
Shaofeng Cai and Gang Chen and Beng Chin Ooi and Jinyang Gao Model slicing for supporting complex analytics with elastic inference cost and resource constraints . . . . . . . . 86--99 Maurice Herlihy and Barbara Liskov and Liuba Shrira Cross-chain deals and adversarial commerce . . . . . . . . . . . . . . . . 100--113 Vasileios Zois and Vassilis J. Tsotras and Walid A. Najjar Efficient main-memory top-$k$ selection for multicore architectures . . . . . . 114--127 Jan Böttcher and Viktor Leis and Thomas Neumann and Alfons Kemper Scalable garbage collection for in-memory MVCC systems . . . . . . . . . 128--141 Bohua Yang and Dong Wen and Lu Qin and Ying Zhang and Xubo Wang and Xuemin Lin Fully dynamic depth-first search in directed graphs . . . . . . . . . . . . 142--154 Chenhao Ma and Reynold Cheng and Laks V. S. Lakshmanan and Tobias Grubenmann and Yixiang Fang and Xiaodong Li LINC: a motif counting algorithm for uncertain graphs . . . . . . . . . . . . 155--168 Saba Eskandarian and Matei Zaharia ObliDB: oblivious query processing for secure databases . . . . . . . . . . . . 169--183 Chang Ge and Ihab F. Ilyas and Florian Kerschbaum Secure multi-party functional dependency discovery . . . . . . . . . . . . . . . 184--196 Minmei Wang and Mingxun Zhou and Shouqian Shi and Chen Qian Vacuum filters: more space-efficient and faster replacement for Bloom and cuckoo filters . . . . . . . . . . . . . . . . 197--210 Yihan Sun and Guy E. Blelloch and Wan Shen Lim and Andrew Pavlo On supporting efficient snapshot isolation for hybrid workloads with multi-versioned indexes . . . . . . . . 211--225
Zhuhe Fang and Beilei Zheng and Chuliang Weng Interleaved multi-vectorizing . . . . . 226--238 Suraj Shetiya and Abolfazl Asudeh and Sadia Ahmed and Gautam Das A unified optimization algorithm for solving ``regret-minimizing representative'' problems . . . . . . . 239--251 Srikanth Kandula and Laurel Orr and Surajit Chaudhuri Pushing data-induced predicates through joins in big-data clusters . . . . . . . 252--265 Eduardo H. M. Pena and Eduardo C. de Almeida and Felix Naumann Discovery of approximate (and exact) denial constraints . . . . . . . . . . . 266--278 Zongheng Yang and Eric Liang and Amog Kamsetty and Chenggang Wu and Yan Duan and Xi Chen and Pieter Abbeel and Joseph M. Hellerstein and Sanjay Krishnan and Ion Stoica Deep unsupervised cardinality estimation 279--292 Zeyu Ding and Yuxin Wang and Danfeng Zhang and Daniel Kifer Free gap information from the differentially private sparse vector and noisy max mechanisms . . . . . . . . . . 293--306 Ji Sun and Guoliang Li An end-to-end learning-based cost estimator . . . . . . . . . . . . . . . 307--319 Yuxiang Zeng and Yongxin Tong and Lei Chen Last-mile delivery made practical: an efficient route planning framework with theoretical guarantees . . . . . . . . . 320--333 Tiago R. Kepe and Eduardo C. de Almeida and Marco A. Z. Alves Database processing-in-memory: an experimental study . . . . . . . . . . . 334--347 Jyoti Leeka and Kaushik Rajan Incorporating super-operators in big-data query optimizers . . . . . . . 348--361 Conggai Li and Fan Zhang and Ying Zhang and Lu Qin and Wenjie Zhang and Xuemin Lin Efficient progressive minimum $k$-core search . . . . . . . . . . . . . . . . . 362--375 Hang Zhu and Zhihao Bai and Jialin Li and Ellis Michael and Dan R. K. Ports and Ion Stoica and Xin Jin Harmonia: near-linear scalability for replicated storage with in-network conflict detection . . . . . . . . . . . 376--389 Brett Walenz and Stavros Sintos and Sudeepa Roy and Jun Yang Learning to sample: counting with complex queries . . . . . . . . . . . . 390--402 Karima Echihabi and Kostas Zoumpatianos and Themis Palpanas and Houda Benbrahim Return of the Lernaean Hydra: experimental evaluation of data series approximate similarity search . . . . . 403--420
Xinjing Zhou and Lidan Shou and Ke Chen and Wei Hu and Gang Chen DPTree: differential indexing for persistent memory . . . . . . . . . . . 421--434 Jeyhun Karimov and Tilmann Rabl and Volker Markl AJoin: ad-hoc stream joins at scale . . 435--448 Chen Luo and Michael J. Carey On performance stability in LSM-based storage systems . . . . . . . . . . . . 449--462 You Peng and Ying Zhang and Xuemin Lin and Wenjie Zhang and Lu Qin and Jingren Zhou Towards bridging theory and practice: hop-constrained $s$--$t$ simple path enumeration . . . . . . . . . . . . . . 463--476 Yuhao Zhang and Arun Kumar Panorama: a data system for unbounded vocabulary querying over video . . . . . 477--491 Kartik Lakhotia and Rajgopal Kannan and Qing Dong and Viktor Prasanna Planting trees for scalable and efficient canonical hub labeling . . . . 492--505 Preethi Lahoti and Krishna P. Gummadi and Gerhard Weikum Operationalizing individual fairness with pairwise fair representations . . . 506--518 Aarati Kakaraparthy and Jignesh M. Patel and Kwanghyun Park and Brian P. Kroth Optimizing databases by learning hidden parameters of solid state drives . . . . 519--532 Daniel Kang and Peter Bailis and Matei Zaharia BlazeIt: optimizing declarative aggregation and limit queries for neural network-based video analytics . . . . . 533--546 Dawei Huang and Dong Young Yoon and Seth Pettie and Barzan Mozafari Joins on samples: a theoretical guide for practitioners . . . . . . . . . . . 547--560 Georgios Karagiannis and Immanuel Trummer and Saehan Jo and Shubham Khandelwal and Xuezhi Wang and Cong Yu Mining an ``anti-knowledge base'' from Wikipedia updates with applications to fact checking and beyond . . . . . . . . 561--573 Lucas Lersch and Xiangpeng Hao and Ismail Oukid and Tianzheng Wang and Thomas Willhalm Evaluating persistent memory range indexes . . . . . . . . . . . . . . . . 574--587
Jonathan Goldstein and Ahmed Abdelhamid and Mike Barnett and Sebastian Burckhardt and Badrish Chandramouli and Darren Gehring and Niel Lebeck and Christopher Meiklejohn and Umar Farooq Minhas and Ryan Newton and Rahee Ghosh Peshawaria and Tal Zaccai and Irene Zhang A.M.B.R.O.S.I.A: providing performant virtual resiliency for distributed applications . . . . . . . . . . . . . . 588--601 Dian Ouyang and Long Yuan and Lu Qin and Lijun Chang and Ying Zhang and Xuemin Lin Efficient shortest path index maintenance on dynamic road networks with theoretical guarantees . . . . . . 602--615 Elias Stehle and Hans-Arno Jacobsen ParPaRaw: massively parallel parsing of delimiter-separated raw data . . . . . . 616--628 Yihe Huang and William Qian and Eddie Kohler and Barbara Liskov and Liuba Shrira Opportunities for optimism in contended main-memory multicore transactions . . . 629--642 Bolong Zheng and Xi Zhao and Lianggui Weng and Nguyen Quoc Viet Hung and Hang Liu and Christian S. Jensen PM-LSH: a fast and accurate LSH framework for high-dimensional approximate NN search . . . . . . . . . 643--655 Yahui Sun and Jun Luo and Theodoros Lappas and Xiaokui Xiao and Bin Cui Hunting multiple bumps in graphs . . . . 656--669 Renchi Yang and Jieming Shi and Xiaokui Xiao and Yin Yang and Sourav S. Bhowmick Homogeneous network embedding for massive graphs via reweighted personalized PageRank . . . . . . . . . 670--683 Abdulhakim Qahtan and Nan Tang and Mourad Ouzzani and Yang Cao and Michael Stonebraker Pattern functional dependencies for data cleaning . . . . . . . . . . . . . . . . 684--697 Joyce Jiyoung Whang and Rundong Du and Sangwon Jung and Geon Lee and Barry Drake and Qingqing Liu and Seonggoo Kang and Haesun Park MEGA: multi-view semi-supervised clustering of hypergraphs . . . . . . . 698--711 Ioannis Koumarelas and Thorsten Papenbrock and Felix Naumann MDedup: duplicate detection with matching dependencies . . . . . . . . . 712--725 Van-Dang Tran and Hiroyuki Kato and Zhenjiang Hu Programmable view update strategies on relations . . . . . . . . . . . . . . . 726--739 Avinash Kumar and Zuozhi Wang and Shengquan Ni and Chen Li Amber: a debuggable dataflow system based on the actor model . . . . . . . . 740--753 Filippo Schiavio and Daniele Bonetta and Walter Binder Dynamic speculative optimizations for SQL compilation in Apache Spark . . . . 754--767 Mourad Khayati and Alberto Lerner and Zakhar Tymchenko and Philippe Cudré-Mauroux Mind the gap: an experimental evaluation of imputation of missing values techniques in time series . . . . . . . 768--782
Mohammad Hasanzadeh Mofrad and Rami Melhem and Yousuf Ahmad and Mohammad Hammoud Graphite: a NUMA-aware HPC system for graph analytics based on a new MPI * X parallelism model . . . . . . . . . . . 783--797 Naama Boer and Daniel Deutch and Nave Frost and Tova Milo Personal insights for altering decisions of tree-based ensembles over time . . . 798--811 You Peng and Ying Zhang and Xuemin Lin and Lu Qin and Wenjie Zhang Answering billion-scale label-constrained reachability queries within microsecond . . . . . . . . . . . 812--825 Ruihong Huang and Shaoxu Song and Yunsu Lee and Jungho Park and Soo-Hyung Kim and Sungmin Yi Effective and efficient retrieval of structured entities . . . . . . . . . . 826--839 Utku Sirin and Anastasia Ailamaki Micro-architectural analysis of OLAP: limitations and opportunities . . . . . 840--853 Yixiang Fang and Yixing Yang and Wenjie Zhang and Xuemin Lin and Xin Cao Effective and efficient community search over large heterogeneous information networks . . . . . . . . . . . . . . . . 854--867 Suyash Gupta and Sajjad Rahnama and Jelle Hellings and Mohammad Sadoghi ResilientDB: global scale resilient blockchain fabric . . . . . . . . . . . 868--883 Henning Funke and Jens Teubner Data-parallel query processing on non-uniform data . . . . . . . . . . . . 884--897 Zonghao Feng and Qiong Luo Evaluating memory-hard proof-of-work algorithms on three processors . . . . . 898--911 Seokki Lee and Bertram Ludäscher and Boris Glavic Approximate summaries for why and why-not provenance . . . . . . . . . . . 912--924 Hao Jiang and Chunwei Liu and Qi Jin and John Paparrizos and Aaron J. Elmore PIDS: attribute decomposition for improved compression and query performance in columnar storage . . . . 925--938 Abolfazl Asudeh and H. V. Jagadish and You (Will) Wu and Cong Yu On detecting cherry-picked trendlines 939--952
Masayo Ota and Heiko Müller and Juliana Freire and Divesh Srivastava Data-driven domain discovery for structured datasets . . . . . . . . . . 953--967 Jieming Shi and Tianyuan Jin and Renchi Yang and Xiaokui Xiao and Yin Yang Realtime index-free single source SimRank processing on web-scale graphs 966--980 Jiachuan Wang and Peng Cheng and Libin Zheng and Chao Feng and Lei Chen and Xuemin Lin and Zheng Wang Demand-aware route planning for shared mobility services . . . . . . . . . . . 979--991 Benjamin Hilprecht and Andreas Schmidt and Moritz Kulessa and Alejandro Molina and Kristian Kersting and Carsten Binnig DeepDB: learn from data, not from queries! . . . . . . . . . . . . . . . . 992--1005 Yuepeng Wang and Rushi Shah and Abby Criswell and Rong Pan and Isil Dillig Data migration using datalog program synthesis . . . . . . . . . . . . . . . 1006--1019 Xiaowei Zhu and Guanyu Feng and Marco Serafini and Xiaosong Ma and Jiping Yu and Lei Xie and Ashraf Aboulnaga and Wenguang Chen LiveGraph: a transactional graph storage system with purely sequential adjacency list scans . . . . . . . . . . . . . . . 1020--1034 Xueling Lin and Haoyang Li and Hao Xin and Zijian Li and Lei Chen KBPearl: a knowledge base population system supported by joint entity and relation linking . . . . . . . . . . . . 1035--1049 Tianyi Li and Ruikai Huang and Lu Chen and Christian S. Jensen and Torben Bach Pedersen Compression of uncertain trajectories in road networks . . . . . . . . . . . . . 1050--1063 Supreeth Shastri and Vinay Banakar and Melissa Wasserman and Arun Kumar and Vijay Chidambaram Understanding and benchmarking the impact of GDPR on database systems . . . 1064--1077 Jihang Liu and Shimin Chen and Lujun Wang LB+Trees: optimizing persistent index performance on $3$DXPoint memory . . . . 1078--1090 Lucas Lersch and Ivan Schreter and Ismail Oukid and Wolfgang Lehner Enabling low tail latency on multicore key-value stores . . . . . . . . . . . . 1091--1104 Chunbin Lin and Etienne Boursier and Yannis Papakonstantinou Plato: approximate analytics over compressed time series with tight deterministic error guarantees . . . . . 1105--1118 Prasun Gera and Hyojong Kim and Piyush Sao and Hyesoon Kim and David Bader Traversing large graphs on GPUs with unified memory . . . . . . . . . . . . . 1119--1133 Haoyue Ping and Julia Stoyanovich and Benny Kimelfeld Supporting hard queries over probabilistic preferences . . . . . . . 1134--1146
Baotong Lu and Xiangpeng Hao and Tianzheng Wang and Eric Lo Dash: scalable hashing on persistent memory . . . . . . . . . . . . . . . . . 1147--1161 Paolo Ferragina and Giorgio Vinciguerra The PGM-index: a fully-dynamic compressed learned index with provable worst-case bounds . . . . . . . . . . . 1162--1175 Minghua Ma and Zheng Yin and Shenglin Zhang and Sheng Wang and Christopher Zheng and Xinhao Jiang and Hanwen Hu and Cheng Luo and Yilin Li and Nengjun Qiu and Feifei Li and Changcheng Chen and Dan Pei Diagnosing root causes of intermittent slow queries in cloud databases . . . . 1176--1189 Xuhao Chen and Roshan Dathathri and Gurbinder Gill and Keshav Pingali Pangolin: an efficient and flexible graph mining system on CPU and GPU . . . 1190--1205 Markus Dreseler and Martin Boissier and Tilmann Rabl and Matthias Uflacker Quantifying TPC-H choke points and their optimizations . . . . . . . . . . . . . 1206--1220 Yuanbing Li and Xian Wu and Yifei Jin and Jian Li and Guoliang Li Efficient algorithms for crowd-aided categorization . . . . . . . . . . . . . 1221--1233 Shaowei Wang and Yuqiu Qian and Jiachun Du and Wei Yang and Liusheng Huang and Hongli Xu Set-valued data publication with local privacy: tight error bounds and efficient mechanisms . . . . . . . . . . 1234--1247 Leonidas Fegaras and Hasanuzzaman Noor Translation of array-based loops to distributed data-parallel programs . . . 1248--1260 Wenfei Fan and Muyang Liu and Chao Tian and Ruiqi Xu and Jingren Zhou Incrementalization of graph partitioning algorithms . . . . . . . . . . . . . . . 1261--1274 Shao-Heng Ko and Hsu-Chao Lai and Hong-Han Shuai and Wang-Chien Lee and Philip S. Yu and De-Nian Yang Optimizing item and subgroup configurations for social-aware VR shopping . . . . . . . . . . . . . . . . 1275--1289 Savvas Savvides and Darshika Khandelwal and Patrick Eugster Efficient confidentiality-preserving data analytics over symmetrically encrypted datasets . . . . . . . . . . . 1290--1303 Gurbinder Gill and Roshan Dathathri and Loc Hoang and Ramesh Peri and Keshav Pingali Single machine graph analytics on massive datasets using Intel Optane DC Persistent Memory . . . . . . . . . . . 1304--1318
Victor Zakhary and Divyakant Agrawal and Amr El Abbadi Atomic commitment across blockchains . . 1319--1331 Ajit Mathew and Changwoo Min HydraList: a scalable in-memory index using asynchronous updates and partial replication . . . . . . . . . . . . . . 1332--1345 A. Jesse Jiryu Davis and Max Hirschhorn and Judah Schvimer Extreme modelling in practice . . . . . 1346--1358 Bingqing Lyu and Lu Qin and Xuemin Lin and Ying Zhang and Zhengping Qian and Jingren Zhou Maximum biclique search at billion scale 1359--1372 Nadiia Chepurko and Ryan Marcus and Emanuel Zgraggen and Raul Castro Fernandez and Tim Kraska and David Karger ARDA: automatic relational data augmentation for machine learning . . . 1373--1387 Wail Y. Alkowaileet and Sattam Alsubaiee and Michael J. Carey An LSM-based tuple compaction framework for Apache AsterixDB . . . . . . . . . . 1388--1400 Roee Shraga and Avigdor Gal and Haggai Roitman ADnEV: cross-domain schema matching using deep similarity matrix adjustment and evaluation . . . . . . . . . . . . . 1401--1415 Xuanhe Zhou and Ji Sun and Guoliang Li and Jianhua Feng Query performance prediction for concurrent queries using graph embedding 1416--1428 Ahmed Alquraan and Alex Kogan and Virendra J. Marathe and Samer Al-Kiswany Scalable, near-zero loss disaster recovery for distributed data stores . . 1429--1442 Kejing Lu and Hongya Wang and Wei Wang and Mineichi Kudo VHP: approximate nearest neighbor search via virtual hypersphere partitioning . . 1443--1455 Hyunjoon Kim and Seunghwan Min and Kunsoo Park and Xuemin Lin and Seok-Hee Hong and Wook-Shin Han IDAR: fast supergraph search using DAG integration . . . . . . . . . . . . . . 1456--1468 Mariia Seleznova and Behrooz Omidvar-Tehrani and Sihem Amer-Yahia and Eric Simon Guided exploration of user groups . . . 1469--1482 Long Gong and Huayi Wang and Mitsunori Ogihara and Jun Xu iDEC: indexable distance estimating codes for approximate nearest neighbor search . . . . . . . . . . . . . . . . . 1483--1497 Song Bian and Qintian Guo and Sibo Wang and Jeffrey Xu Yu Efficient algorithms for budgeted influence maximization on massive social networks . . . . . . . . . . . . . . . . 1498--1510 Arneish Prateek and Arijit Khan and Akshit Goyal and Sayan Ranu Mining Top-$k$ pairs of correlated subgraphs in a large network . . . . . . 1511--1524 Yehonatan Buchnik and Roy Friedman FireLedger: a high throughput blockchain consensus protocol . . . . . . . . . . . 1525--1539 Kefei Wang and Jian Liu and Feng Chen Put an elephant into a fridge: optimizing cache efficiency for in-memory key--value stores . . . . . . 1540--1554 Simon Aagaard Pedersen and Bin Yang and Christian S. Jensen Anytime stochastic routing with hybrid learning . . . . . . . . . . . . . . . . 1555--1567 Qizhen Zhang and Yifan Cai and Xinyi Chen and Sebastian Angel and Ang Chen and Vincent Liu and Boon Thau Loo Understanding the effect of data center resource disaggregation on production DBMSs . . . . . . . . . . . . . . . . . 1568--1581 Nikolaos Tziavelis and Deepak Ajwani and Wolfgang Gatterbauer and Mirek Riedewald and Xiaofeng Yang Optimal algorithms for ranked enumeration of answers to full conjunctive queries . . . . . . . . . . 1582--1597 Laxman Dhulipala and Charles McGuffey and Hongbo Kang and Yan Gu and Guy E. Blelloch and Phillip B. Gibbons and Julian Shun Sage: parallel semi-asymmetric graph algorithms for NVRAMs . . . . . . . . . 1598--1613
Yuqing Zhu and Jing Tang and Xueyan Tang Pricing influential nodes in online social networks . . . . . . . . . . . . 1614--1627 Bintao Sun and Maximilien Danisch and T-H. Hubert Chan and Mauro Sozio KClist++: a simple algorithm for finding $k$-clique densest subgraphs in large graphs . . . . . . . . . . . . . . . . . 1628--1640 Kevin Wellenzohn and Michael H. Böhlen and Sven Helmer Dynamic interleaving of content and structure for robust indexing of semi-structured hierarchical data . . . 1641--1653 Shubhangi Agarwal and Sourav Dutta and Arnab Bhattacharya ChiSeL: graph similarity search using chi-squared statistics in large probabilistic graphs . . . . . . . . . . 1654--1668 Zijing Tan and Ai Ran and Shuai Ma and Sheng Qin Fast incremental discovery of pointwise order dependencies . . . . . . . . . . . 1669--1681 Ester Livshits and Alireza Heidari and Ihab F. Ilyas and Benny Kimelfeld Approximate denial constraints . . . . . 1682--1695 Robin Rehrmann and Carsten Binnig and Alexander Böhm and Kihong Kim and Wolfgang Lehner Sharing opportunities for OLTP workloads in different isolation levels . . . . . 1696--1708 Stefan Neumann and Pauli Miettinen Biclustering and Boolean matrix factorization in data streams . . . . . 1709--1722 Xun Jian and Yue Wang and Lei Chen Effective and efficient relational community detection and search in large dynamic heterogeneous information networks . . . . . . . . . . . . . . . . 1723--1736 Hyeonji Kim and Byeong-Hoon So and Wook-Shin Han and Hongrae Lee Natural language to SQL: where are we today? . . . . . . . . . . . . . . . . . 1737--1750 Yulin Che and Zhuohang Lai and Shixuan Sun and Yue Wang and Qiong Luo Accelerating truss decomposition on heterogeneous processors . . . . . . . . 1751--1764 Rohan Mukherjee and Swarat Chaudhuri and Chris Jermaine Searching a database of source codes using contextualized code search . . . . 1765--1778 Yan Li and Tingjian Ge and Cindy Chen Data stream event prediction based on timing knowledge and state transitions 1779--1792 Frank McSherry and Andrea Lattuada and Malte Schwarzkopf and Timothy Roscoe Shared arrangements: practical inter-query sharing for streaming dataflows . . . . . . . . . . . . . . . 1793--1806
Peeyush Gupta and Michael J. Carey and Sharad Mehrotra and oberto Yus SmartBench: a benchmark for data management in smart spaces . . . . . . . 1807--1820 Paul Boniol and Themis Palpanas Series2Graph: graph-based subsequence anomaly detection for time series . . . 1821--1834 Dan Zhang and Madelon Hulsebos and Yoshihiko Suhara and Çagatay Demiralp and Jinfeng Li and Wang-Chiew Tan Sato: contextual semantic type detection in tables . . . . . . . . . . . . . . . 1835--1848 Qijian He and Wei Yang and Bingren Chen and Yangyang Geng and Liusheng Huang TransNet: training privacy-preserving neural network over transformed layer 1849--1862 Wenfei Fan and Ruochun Jin and Muyang Liu and Ping Lu and Chao Tian and Jingren Zhou Capturing associations in graphs . . . . 1863--1876 Alexander Renz-Wieland and Rainer Gemulla and Steffen Zeuch and Volker Markl Dynamic parameter allocation in parameter servers . . . . . . . . . . . 1877--1890 Michael Freitag and Maximilian Bandle and Tobias Schmidt and Alfons Kemper and Thomas Neumann Adopting worst-case optimal joins in relational database systems . . . . . . 1891--1904 Ryan McKenna and Raj Kumar Maity and Arya Mazumdar and Gerome Miklau A workload-adaptive mechanism for linear queries under local differential privacy 1905--1918 Yisu Remy Wang and Shana Hutchison and Jonathan Leang and Bill Howe and Dan Suciu SPORES: sum-product optimization via relational equality saturation for large scale linear algebra . . . . . . . . . . 1919--1932 Raul Castro Fernandez and Pranav Subramaniam and Michael J. Franklin Data market platforms: trading data assets to solve data problems . . . . . 1933--1947 Mohammad Mahdavi and Ziawasch Abedjan Baran: effective error correction via a unified context representation and transfer learning . . . . . . . . . . . 1948--1961 Ju Fan and Junyou Chen and Tongyu Liu and Yuwei Shen and Guoliang Li and Xiaoyong Du Relational data synthesis using generative adversarial networks: a design space exploration . . . . . . . . 1962--1975 Lei Yang and Hong Wu and Tieying Zhang and Xuntao Cheng and Feifei Li and Lei Zou and Yujie Wang and Rongyao Chen and Jianying Wang and Gui Huang Leaper: a learned prefetcher for cache invalidation in LSM-tree based storage engines . . . . . . . . . . . . . . . . 1976--1989 Daniel Kang and Edward Gan and Peter Bailis and Tatsunori Hashimoto and Matei Zaharia Approximate selection with guarantees using proxies . . . . . . . . . . . . . 1990--2003 Minji Kang and Soyee Choi and Gihwan Oh and Sang-Won Lee 2R: efficiently isolating cold pages in flash storages . . . . . . . . . . . . . 2004--2017 Bahar Ghadiri Bashardoost and Renée J. Miller and Kelly Lyons and Fatemeh Nargesian Knowledge translation . . . . . . . . . 2018--2032 Devin Petersohn and Stephen Macke and Doris Xin and William Ma and Doris Lee and Xiangxi Mo and Joseph E. Gonzalez and Joseph M. Hellerstein and Anthony D. Joseph and Aditya Parameswaran Towards scalable dataframe systems . . . 2033--2046 Yi Lu and Xiangyao Yu and Lei Cao and Samuel Madden Aria: a fast and practical deterministic OLTP database . . . . . . . . . . . . . 2047--2060 Dongjing Miao and Zhipeng Cai and Jianzhong Li and Xiangyu Gao and Xianmin Liu The computation of optimal subset repairs . . . . . . . . . . . . . . . . 2061--2074 Christina Christodoulakis and Eric B. Munson and Moshe Gabel and Angela Demke Brown and Renée J. Miller Pytheas: pattern-based table discovery in CSV files . . . . . . . . . . . . . . 2075--2089 Yuncheng Wu and Shaofeng Cai and Xiaokui Xiao and Gang Chen and Beng Chin Ooi Privacy preserving vertical federated learning for tree-based models . . . . . 2090--2103 Ahmed Al-Baghdadi and Xiang Lian Topic-based community search over spatial-social networks . . . . . . . . 2104--2117 Manuel Fritz and Michael Behringer and Holger Schwarz LOG-Means: efficiently estimating the number of clusters in large datasets . . 2118--2131 Simeon Krastnikov and Florian Kerschbaum and Douglas Stebila Efficient oblivious database joins . . . 2132--2145 Ousmane Issa and Angela Bonifati and Farouk Toumani Evaluating top-$k$ queries with inconsistency degrees . . . . . . . . . 2146--2158 Supun Nakandala and Yuhao Zhang and Arun Kumar Cerebro: a data system for optimized deep learning model selection . . . . . 2159--2173 Edward Gan and Peter Bailis and Moses Charikar CoopStore: optimizing precomputed summaries for aggregation . . . . . . . 2174--2187 Satoshi Koide and Chuan Xiao and Yoshiharu Ishikawa Fast subtrajectory similarity search in road networks under weighted edit distance constraints . . . . . . . . . . 2188--2201 Yu Liu and Lei Zou and Qian Ge and Zhewei Wei SimTab: accuracy-guaranteed SimRank queries through tighter confidence bounds and multi-armed bandits . . . . . 2202--2214 Anshuman Dutt and Chi Wang and Vivek Narasayya and Surajit Chaudhuri Efficiently approximating selectivity functions using low overhead regression models . . . . . . . . . . . . . . . . . 2215--2228 Yin Lin and Yifan Guan and Abolfazl Asudeh and H. V. Jagadish Identifying insufficient data coverage in databases with multiple relations . . 2229--2242 Lingxiao Li and Muhammad Aamir Cheema and Mohammed Eunus Ali and Hua Lu and David Taniar Continuously monitoring alternative shortest paths on road networks . . . . 2243--2255 Geon Lee and Jihoon Ko and Kijung Shin Hypergraph motifs: concepts, algorithms, and discoveries . . . . . . . . . . . . 2256--2269 Johann Birnick and Thomas Bläsius and Tobias Friedrich and Felix Naumann and Thorsten Papenbrock and Martin Schirneck Hitting set enumeration with partial information for unique column combination discovery . . . . . . . . . 2270--2283 Yue Chen and Zhida Chen and Gao Cong and Ahmed R. Mahmood and Walid G. Aref SSTD: a distributed system on streaming spatio-textual data . . . . . . . . . . 2284--2296 Haneen Mohammed and Ziyun Wei and Eugene Wu and Ravi Netravali Continuous prefetch for interactive data applications . . . . . . . . . . . . . . 2297--2311 Zheng Wang and Cheng Long and Gao Cong and Yiding Liu Efficient and effective similar subtrajectory search with deep reinforcement learning . . . . . . . . . 2312--2325 Zequn Sun and Qingheng Zhang and Wei Hu and Chengming Wang and Muhao Chen and Farahnaz Akrami and Chengkai Li A benchmarking study of embedding-based entity alignment for knowledge graphs 2326--2340 Jianzhong Qi and Guanli Liu and Christian S. Jensen and Lars Kulik Effectively learning spatial indices . . 2341--2354 Qiyu Liu and Libin Zheng and Yanyan Shen and Lei Chen Stable learned bloom filters for data streams . . . . . . . . . . . . . . . . 2355--2367 Zhongjun Jin and Yeye He and Surajit Chauduri Auto-transform: learning-to-transform by patterns . . . . . . . . . . . . . . . . 2368--2381 Jan Kossmann and Stefan Halfpap and Marcel Jankrift and Rainer Schlosser Magic mirror in my hand, which is the best in the land?: an experimental evaluation of index selection algorithms 2382--2395 Patrick Damme and Annett Ungethüm and Johannes Pietrzyk and Alexander Krause and Dirk Habich and Wolfgang Lehner MorphStore: analytical query engine with a holistic compression-enabled processing model . . . . . . . . . . . . 2396--2410 Panos Parchas and Yonatan Naamad and Peter Van Bouwel and Christos Faloutsos and Michalis Petropoulos Fast and effective distribution-key recommendation for Amazon Redshift . . . 2411--2423 Primal Pappachan and Roberto Yus and Sharad Mehrotra and Johann-Christoph Freytag Sieve: a middleware approach to scalable access control for database management systems . . . . . . . . . . . . . . . . 2424--2437 Vikram Sreekanti and Chenggang Wu and Xiayue Charles Lin and Johann Schleier-Smith and Joseph E. Gonzalez and Joseph M. Hellerstein and Alexey Tumanov Cloudburst: stateful functions-as-a-service . . . . . . . . . 2438--2452 Abhijit Suprem and Joy Arulraj and Calton Pu and Joao Ferreira ODIN: automated drift detection and recovery in video analytics . . . . . . 2453--2465 Chengzhi Piao and Weiguo Zheng and Yu Rong and Hong Cheng Maximizing the reduction ability for near-maximum independent set computation 2466--2478 Zhao Chen and Peng Cheng and Lei Chen and Xuemin Lin and Cyrus Shahabi Fair task assignment in spatial crowdsourcing . . . . . . . . . . . . . 2479--2492 Hao Zhang and Jeffrey Xu Yu and Yikai Zhang and Kangfei Zhao and Hong Cheng Distributed subgraph counting: a general approach . . . . . . . . . . . . . . . . 2493--2507 Georgios Karagiannis and Mohammed Saeed and Paolo Papotti and Immanuel Trummer Scrutinizer: a mixed-initiative approach to large-scale, data-driven claim verification . . . . . . . . . . . . . . 2508--2521 Evgeny Krivosheev and Siarhei Bykau and Fabio Casati and Sunil Prabhakar Detecting and preventing confused labels in crowdsourced data . . . . . . . . . . 2522--2535 Rong-Hua Li and Sen Gao and Lu Qin and Guoren Wang and Weihua Yang and Jeffrey Xu Yu Ordering heuristics for $k$-clique listing . . . . . . . . . . . . . . . . 2536--2548 Jinfeng Li and Yuliang Li and Xiaolan Wang and Wang-Chiew Tan Deep or simple models for semantic tagging?: it depends on your data . . . 2549--2562 Muhammad Bilal and Marco Serafini and Marco Canini and Rodrigo Rodrigues Do the best cloud configurations grow on trees?: an experimental evaluation of black box algorithms for optimizing cloud workloads . . . . . . . . . . . . 2563--2575 Alexander Zhou and Yue Wang and Lei Chen Finding large diverse communities on networks: the edge maximum $ k*$-partite clique . . . . . . . . . . . . . . . . . 2576--2589 Albert van der Linde and João Leitão and Nuno Preguiça Practical client-side replication: weak consistency semantics for insecure settings . . . . . . . . . . . . . . . . 2590--2605 Kexin Rong and Yao Lu and Peter Bailis and Srikanth Kandula and Philip Levis Approximate partition selection for big-data workloads using summary statistics . . . . . . . . . . . . . . . 2606--2619 Christian Winter and Tobias Schmidt and Thomas Neumann and Alfons Kemper Meet me halfway: split maintenance of continuous views . . . . . . . . . . . . 2620--2633 Youmin Chen and Youyou Lu and Kedong Fang and Qing Wang and Jiwu Shu uTree: a persistent B+-tree with low tail latency . . . . . . . . . . . . . . 2634--2648 Peter Boncz and Thomas Neumann and Viktor Leis FSST: fast random access string compression . . . . . . . . . . . . . . 2649--2661 Lukas Vogel and Viktor Leis and Alexander van Renen and Thomas Neumann and Satoshi Imamura and Alfons Kemper Mosaic: a budget-conscious storage engine for relational database systems 2662--2675 Jovan Blanusa and Radu Stoica and Paolo Ienne and Kubilay Atasu Manycore clique enumeration with fast set intersections . . . . . . . . . . . 2676--2690 Johes Bater and Yongjoo Park and Xi He and Xiao Wang and Jennie Rogers SAQE: practical privacy-preserving approximate query processing for data federations . . . . . . . . . . . . . . 2691--2705 Caitlin Kuhlman and Elke Rundensteiner Rank aggregation algorithms for fair consensus . . . . . . . . . . . . . . . 2706--2719 Brad Glasbergen and Michael Abebe and Khuzaima Daudjee and Amit Levi Sentinel: universal analysis and insight for data systems . . . . . . . . . . . . 2720--2733 Jingzhi Fang and Yanyan Shen and Yue Wang and Lei Chen Optimizing DNN computation graph using graph substitutions . . . . . . . . . . 2734--2746 Jaydeep Sen and Chuan Lei and Abdul Quamar and Fatma Özcan and Vasilis Efthymiou and Ayushi Dalmia and Greg Stager and Ashish Mittal and Diptikalyan Saha and Karthik Sankaranarayanan ATHENA++: natural language querying for complex nested SQL queries . . . . . . . 2747--2759 Min Xu and Bolin Ding and Tianhao Wang and Jingren Zhou Collecting and analyzing data jointly from multiple services under local differential privacy . . . . . . . . . . 2760--2772 Yifan Gan and Xueyuan Ren and Drew Ripberger and Spyros Blanas and Yang Wang IsoDiff: debugging anomalies caused by weak isolation . . . . . . . . . . . . . 2773--2786 Marina Barsky and Jonathan Gabor and Mariano P. Consens and Alex Thomo Suffix rank: a new scalable algorithm for indexing large string collections 2787--2800
Yuxiang Zeng and Yongxin Tong and Yuguang Song and Lei Chen The simpler the better: an indexing approach for shared-route planning queries . . . . . . . . . . . . . . . . 3517--3530 Takayuki Tanabe and Takashi Hoshino and Hideyuki Kawashima and Osamu Tatebe An analysis of concurrency control protocols for in-memory databases with CCBench . . . . . . . . . . . . . . . . 3531--3544 Tianhao Wang and Bolin Ding and Min Xu and Zhicong Huang and Cheng Hong and Jingren Zhou and Ninghui Li and Somesh Jha Improving utility and security of the shuffler-based differential privacy . . 3545--3558 Andreas Kipf and Damian Chromejko and Alexander Hall and Peter Boncz and David G. Andersen Cuckoo index: a lightweight secondary index structure . . . . . . . . . . . . 3559--3572 Michael Abebe and Brad Glasbergen and Khuzaima Daudjee MorphoSys: automatic physical design metamorphosis for distributed database systems . . . . . . . . . . . . . . . . 3573--3587 Soheil Behnezhad and Laxman Dhulipala and Hossein Esfandiari and Jakub Lacki and Vahab Mirrokni and Warren Schudy Parallel graph algorithms in constant adaptive rounds: theory meets practice 3588--3602 Runhui Wang and Dong Deng DeltaPQ: lossless product quantization code compression for high dimensional similarity search . . . . . . . . . . . 3603--3616
Ryan Marcus and Andreas Kipf and Alexander van Renen and Mihail Stoian and Sanchit Misra and Alfons Kemper and Thomas Neumann and Tim Kraska Benchmarking learned indexes . . . . . . 1--13 Zuozhi Wang and Kai Zeng and Botong Huang and Wei Chen and Xiaozong Cui and Bo Wang and Ji Liu and Liya Fan and Dachuan Qu and Zhenyu Hou and Tao Guan and Chen Li and Jingren Zhou Tempura: a general cost-based optimizer framework for incremental data processing . . . . . . . . . . . . . . . 14--27 Geon Heo and Yuji Roh and Seonghyeon Hwang and Dayun Lee and Steven Euijong Whang Inspector gadget: a data programming-based labeling system for industrial images . . . . . . . . . . . 28--36 Renchi Yang and Jieming Shi and Xiaokui Xiao and Yin Yang and Juncheng Liu and Sourav S. Bhowmick Scaling attributed network embedding to massive graphs . . . . . . . . . . . . . 37--49 Yuliang Li and Jinfeng Li and Yoshihiko Suhara and AnHai Doan and Wang-Chiew Tan Deep entity matching with pre-trained language models . . . . . . . . . . . . 50--60 Zongheng Yang and Amog Kamsetty and Sifei Luan and Eric Liang and Yan Duan and Xi Chen and Ion Stoica NeuroCard: one cardinality estimator for all tables . . . . . . . . . . . . . . . 61--73
Jialin Ding and Vikram Nathan and Mohammad Alizadeh and Tim Kraska Tsunami: a learned multi-dimensional index for correlated data and skewed workloads . . . . . . . . . . . . . . . 74--86 Daniel Kang and Ankit Mathur and Teja Veeramacheneni and Peter Bailis and Matei Zaharia Jointly optimizing preprocessing and inference for DNN-based visual analytics 87--100 Prashanth Menon and Amadou Ngom and Lin Ma and Todd C. Mowry and Andrew Pavlo Permutable compiled queries: dynamically adapting compiled queries without recompiling . . . . . . . . . . . . . . 101--113 Seung Won Min and Vikram Sharma Mailthody and Zaid Qureshi and Jinjun Xiong and Eiman Ebrahimi and Wen-mei Hwu EMOGI: efficient memory-access for out-of-memory graph-traversal in GPUs 114--127 Yinda Zhang and Jinyang Li and Yutian Lei and Tong Yang and Zhetao Li and Gong Zhang and Bin Cui On-off sketch: a fast and accurate sketch on persistence . . . . . . . . . 128--140 Luan Tran and Min Y. Mun and Cyrus Shahabi Real-time distance-based outlier detection in data streams . . . . . . . 141--153 Olga Poppe and Tayo Amuneke and Dalitso Banda and Aritra De and Ari Green and Manon Knoertzer and Ehi Nosakhare and Karthik Rajendran and Deepak Shankargouda and Meina Wang and Alan Au and Carlo Curino and Qun Guo and Alekh Jindal and Ajay Kalhan and Morgan Oslake and Sonia Parchani and Vijay Ramani and Raj Sellappan and Saikat Sen and Sheetal Shrotri and Soundararajan Srinivasan and Ping Xia and Shize Xu and Alicia Yang and Yiwen Zhu Seagull: an infrastructure for load prediction and optimized resource allocation . . . . . . . . . . . . . . . 154--162 Sheng Wang and Yuan Sun and Zhifeng Bao On the efficiency of K-means clustering: evaluation, optimization, and algorithm selection . . . . . . . . . . . . . . . 163--175 Shixuan Sun and Xibo Sun and Yulin Che and Qiong Luo and Bingsheng He RapidMatch: a holistic approach to subgraph query processing . . . . . . . 176--188 Yu Xia and Xiangyao Yu and Andrew Pavlo and Srinivas Devadas Taurus: lightweight parallel logging for in-memory database management systems 189--201 Johns Paul and Bingsheng He and Shengliang Lu and Chiew Tong Lau Improving execution efficiency of just-in-time compilation based query processing on GPUs . . . . . . . . . . . 202--214 Shuang Wang and Hakan Ferhatosmanoglu PPQ-trajectory: spatio-temporal quantization for querying in large trajectory repositories . . . . . . . . 215--227 Xiao Hu and Shouzhuo Sun and Shweta Patwa and Debmalya Panigrahi and Sudeepa Roy Aggregated deletion propagation for counting conjunctive query answers . . . 228--240
Chen Luo and Michael J. Carey Breaking down memory walls: adaptive memory management in LSM-based storage systems . . . . . . . . . . . . . . . . 241--254 Bojan Karlas and Peng Li and Renzhi Wu and Nezihe Merve Gürel and Xu Chu and Wentao Wu and Ce Zhang Nearest neighbor classifiers over incomplete information: from certain answers to certain predictions . . . . . 255--267 Kyle Kingsbury and Peter Alvaro Elle: inferring isolation anomalies from experimental observations . . . . . . . 268--280 Martin Kiefer and Ilias Poulakis and Sebastian Breß and Volker Markl Scotch: generating FPGA-accelerators for sketching at line rate . . . . . . . . . 281--293 Mourad Khayati and Ines Arous and Zakhar Tymchenko and Philippe Cudré-Mauroux ORBITS: online recovery of missing values in multiple time series streams 294--306 Xiang Deng and Huan Sun and Alyssa Lees and You Wu and Cong Yu TURL: table understanding through representation learning . . . . . . . . 307--319 Long Guo and Lifeng Hua and Rongfei Jia and Fei Fang and Binqiang Zhao and Bin Cui EdgeDIPN: a unified deep intent prediction network deployed at the edge 320--328 Yiming Lin and Daokun Jiang and Roberto Yus and Georgios Bouloukakis and Andrew Chio and Sharad Mehrotra and Nalini Venkatasubramanian Locater: cleaning wifi connectivity datasets for semantic localization . . . 329--341 Hao Liu and Jindong Han and Yanjie Fu and Jingbo Zhou and Xinjiang Lu and Hui Xiong Multi-modal transportation recommendation with unified route representation learning . . . . . . . . 342--350 Yue Wang and Ruiqi Xu and Zonghao Feng and Yulin Che and Lei Chen and Qiong Luo and Rui Mao Disk: a distributed framework for single-source SimRank with accuracy guarantee . . . . . . . . . . . . . . . 351--363 Diego Didona and Nikolas Ioannou and Radu Stoica and Kornilios Kourtis Toward a better understanding and evaluation of tree structures on flash SSDs . . . . . . . . . . . . . . . . . . 364--377 Jianyu Yang and Tianhao Wang and Ninghui Li and Xiang Cheng and Sen Su Answering multi-dimensional range queries under local differential privacy 378--390 Dimitris Palyvos-Giannas and Bastian Havers and Marina Papatriantafilou and Vincenzo Gulisano Ananke: a streaming framework for live forward provenance . . . . . . . . . . . 391--403 Kartik Lakhotia and Rajgopal Kannan and Viktor Prasanna and Cesar A. F. De Rose Receipt: refine coarse-grained independent tasks for parallel tip decomposition of bipartite graphs . . . 404--417 Shaleen Deep and Anja Gruenheid and Paraschos Koutris and Jeffrey Naughton and Stratis Viglas Comprehensive and efficient workload compression . . . . . . . . . . . . . . 418--430 Yongjun He and Jiacheng Lu and Tianzheng Wang CoroBase: coroutine-oriented main-memory database engine . . . . . . . . . . . . 431--444 Jaclyn Smith and Michael Benedikt and Milos Nikolic and Amir Shaikhha Scalable querying of nested data . . . . 445--457
Long Gong and Ziheng Liu and Liang Liu and Jun Xu and Mitsunori Ogihara and Tong Yang Space- and computationally-efficient set reconciliation via parity bitmap sketch (PBS) . . . . . . . . . . . . . . . . . 458--470 Suraj Shetiya and Saravanan Thirumuruganathan and Nick Koudas and Gautam Das Astrid: accurate selectivity estimation for string predicates using deep learning . . . . . . . . . . . . . . . . 471--484 Nan Zheng and Zachary G. Ives Compact, tamper-resistant archival of fine-grained provenance . . . . . . . . 485--497 Ingo Müller and Ghislain Fourny and Stefan Irimescu and Can Berker Cikis and Gustavo Alonso Rumble: data independence for large messy data sets . . . . . . . . . . . . 498--506 Adriane Chapman and Paolo Missier and Giulia Simonelli and Riccardo Torlone Capturing and querying fine-grained provenance of preprocessing pipelines in data science . . . . . . . . . . . . . . 507--520 Victor A. E. Farias and Felipe T. Brito and Cheryl Flynn and Javam C. Machado and Subhabrata Majumdar and Divesh Srivastava Local dampening: differential privacy for non-numeric queries via local sensitivity . . . . . . . . . . . . . . 521--533 Tianyu Li and Matthew Butrovich and Amadou Ngom and Wan Shen Lim and Wes McKinney and Andrew Pavlo Mainlining databases: supporting fast transactional workloads on universal columnar data file formats . . . . . . . 534--546 Shengliang Lu and Bingsheng He and Yuchen Li and Hao Fu Accelerating exact constrained shortest paths on GPUs . . . . . . . . . . . . . 547--559 Songsong Mo and Zhifeng Bao and Ping Zhang and Zhiyong Peng Towards an efficient weighted random walk domination . . . . . . . . . . . . 560--572 Guimu Guo and Da Yan and M. Tamer Özsu and Zhe Jiang and Jalal Khalil Scalable mining of maximal quasi-cliques: an algorithm-system codesign approach . . . . . . . . . . . 573--585 Eleftherios Kokoris-Kogias and Enis Ceyhun Alp and Linus Gasser and Philipp Jovanovic and Ewa Syta and Bryan Ford CALYPSO: private data management for decentralized ledgers . . . . . . . . . 586--599 Kyle Deeds and Brian Hentschel and Stratos Idreos Stacked filters: learning to filter by structure . . . . . . . . . . . . . . . 600--612 Prithu Banerjee and Wei Chen and Laks V. S. Lakshmanan Maximizing social welfare in a competitive diffusion model . . . . . . 613--625 Shashank Gugnani and Arjun Kashyap and Xiaoyi Lu Understanding the idiosyncrasies of real persistent memory . . . . . . . . . . . 626--639 Abraham Gale and Amélie Marian Explaining monotonic ranking functions 640--652 Laxman Dhulipala and Changwan Hong and Julian Shun ConnectIt: a framework for static and incremental parallel graph connectivity algorithms . . . . . . . . . . . . . . . 653--667 Wissam Mammar Kouadri and Mourad Ouziri and Salima Benbernou and Karima Echihabi and Themis Palpanas and Iheb Ben Amor Quality of sentiment analysis tools: the reasons of inconsistency . . . . . . . . 668--681 Rolando Garcia and Eric Liu and Vikram Sreekanti and Bobby Yan and Anusha Dandamudi and Joseph E. Gonzalez and Joseph M. Hellerstein and Koushik Sen Hindsight logging for model training . . 682--693 Lin Jiang and Junqiao Qiu and Zhijia Zhao Scalable structural index construction for JSON analytics . . . . . . . . . . . 694--707 Ran Rui and Hao Li and Yi-Cheng Tu Efficient join algorithms for large database tables in a multi-GPU environment . . . . . . . . . . . . . . 708--720
Shuyuan Yan and Bolin Ding and Wei Guo and Jingren Zhou and Zhewei Wei and Xiaowei Jiang and Sheng Xu FlashP: an analytical pipeline for real-time forecasting of time-series relational data . . . . . . . . . . . . 721--729 Chi Thang Duong and Trung Dung Hoang and Hongzhi Yin and Matthias Weidlich and Quoc Viet Hung Nguyen and Karl Aberer Efficient streaming subgraph isomorphism with graph neural networks . . . . . . . 730--742 Yi Lu and Xiangyao Yu and Lei Cao and Samuel Madden Epoch-based commit and replication in distributed OLTP databases . . . . . . . 743--756 Zhe Lin and Fan Zhang and Xuemin Lin and Wenjie Zhang and Zhihong Tian Hierarchical core maintenance on large dynamic graphs . . . . . . . . . . . . . 757--770 Jayashree Mohan and Amar Phanishayee and Ashish Raniwala and Vijay Chidambaram Analyzing and mitigating data stalls in DNN training . . . . . . . . . . . . . . 771--784 Daokun Hu and Zhiwen Chen and Jianbing Wu and Jianhua Sun and Hao Chen Persistent memory hash indexes: an experimental evaluation . . . . . . . . 785--798 Cheng Chen and Jun Yang and Mian Lu and Taize Wang and Zhao Zheng and Yuqiang Chen and Wenyuan Dai and Bingsheng He and Weng-Fai Wong and Guoan Wu and Yuping Zhao and Andy Rudoff Optimizing in-memory database engine for AI-powered on-line decision augmentation using persistent memory . . . . . . . . 799--812 Arif Usta and Akifhan Karakayali and Özgür Ulusoy DBTagger: multi-task learning for keyword mapping in NLIDBs using bi-directional recurrent neural networks 813--821 Ritesh Sarkhel and Arnab Nandi Improving information extraction from visually rich documents using visual span representations . . . . . . . . . . 822--834 Gang Liu and Leying Chen and Shimin Chen Zen: a high-throughput log-free OLTP engine for non-volatile main memory . . 835--848 Tianxi Ji and Pan Li and Emre Yilmaz and Erman Ayday and Yanfang (Fanny) Ye and Jinyuan Sun Differentially private binary- and matrix-valued data query: an XOR mechanism . . . . . . . . . . . . . . . 849--862
Supun Nakandala and Yuhao Zhang and Arun Kumar Errata for ``Cerebro: a data system for optimized deep learning model selection'' . . . . . . . . . . . . . . 863--863 Lujia Yin and Yiming Zhang and Zhaoning Zhang and Yuxing Peng and Peng Zhao ParaX: boosting deep learning for big data analytics on many-core CPUs . . . . 864--877 Walter Cai and Philip A. Bernstein and Wentao Wu and Badrish Chandramouli Optimization of threshold functions over streams . . . . . . . . . . . . . . . . 878--889 Xuliang Zhu and Xin Huang and Byron Choi and Jiaxin Jiang and Zhaonian Zou and Jianliang Xu Budget constrained interactive search for multiple targets . . . . . . . . . . 890--902 Yangjun Chen and Hoang Hai Nguyen On the string matching with $k$ differences in DNA databases . . . . . . 903--915 Yasuhiro Fujiwara and Sekitoshi Kanai and Yasutoshi Ida and Atsutoshi Kumagai and Naonori Ueda Fast algorithm for anchor graph hashing 916--928 Wangda Zhang and Junyoung Kim and Kenneth A. Ross and Eric Sedlar and Lukas Stadler Adaptive code generation for data-intensive analytics . . . . . . . . 929--942 Efthymia Tsamoura and David Carral and Enrico Malizia and Jacopo Urbani Materializing knowledge bases via trigger graphs . . . . . . . . . . . . . 943--956 Jinfei Liu and Jian Lou and Junxu Liu and Li Xiong and Jian Pei and Jimeng Sun Dealer: an end-to-end model marketplace with differential privacy . . . . . . . 957--969 Sajjadur Rahman and Mangesh Bendre and Yuyang Liu and Shichu Zhu and Zhaoyuan Su and Karrie Karahalios and Aditya G. Parameswaran NOAH: interactive spreadsheet exploration with dynamic hierarchical overviews . . . . . . . . . . . . . . . 970--983 Yixing Yang and Yixiang Fang and Maria E. Orlowska and Wenjie Zhang and Xuemin Lin Efficient bi-triangle counting for large bipartite networks . . . . . . . . . . . 984--996 Sandeep Tata and Navneet Potti and James B. Wendt and Lauro Beltrão Costa and Marc Najork and Beliz Gunel Glean: structured extractions from templatic documents . . . . . . . . . . 997--1005 Jun Gao and Jiazun Chen and Zhao Li and Ji Zhang ICS-GNN: lightweight interactive community search via graph neural network . . . . . . . . . . . . . . . . 1006--1018 Yuanyuan Sun and Sheng Wang and Huorong Li and Feifei Li Building enclave-native storage engines for practical encrypted databases . . . 1019--1032 James Thorne and Majid Yazdani and Marzieh Saeidi and Fabrizio Silvestri and Sebastian Riedel and Alon Halevy From natural language processing to neural databases . . . . . . . . . . . . 1033--1039 Haibo Wang and Chaoyi Ma and Olufemi O. Odegbile and Shigang Chen and Jih-Kwon Peir Randomized error removal for online spread estimation in data streaming . . 1040--1052 Dean De Leo and Peter Boncz Teseo and the analysis of structural dynamic graphs . . . . . . . . . . . . . 1053--1066 Tim Gubner and Peter Boncz Charting the design space of query execution using VOILA . . . . . . . . . 1067--1079 Zhiqi Wang and Jin Xue and Zili Shao Heracles: an efficient storage model and data flushing for performance monitoring timeseries . . . . . . . . . . . . . . . 1080--1092 Stephen Macke and Hongpu Gong and Doris Jung-Lin Lee and Andrew Head and Doris Xin and Aditya Parameswaran Fine-grained lineage for safer notebook interactions . . . . . . . . . . . . . . 1093--1101 Anton Tsitsulin and Marina Munkhoeva and Davide Mottin and Panagiotis Karras and Ivan Oseledets and Emmanuel Müller FREDE: anytime graph embeddings . . . . 1102--1110 Xiaodong Li and Reynold Cheng and Kevin Chen-Chuan Chang and Caihua Shan and Chenhao Ma and Hongtai Cao On analyzing graphs with motif-paths . . 1111--1123
Dimitris Tsaras and George Trimponias and Lefteris Ntaflos and Dimitris Papadias Collective influence maximization for multiple competing products with an awareness-to-influence model . . . . . . 1124--1136 Yahui Sun and Xiaokui Xiao and Bin Cui and Saman Halgamuge and Theodoros Lappas and Jun Luo Finding group Steiner trees in graphs with both vertex and edge weights . . . 1137--1149 Tenindra Abeywickrama and Victor Liang and Kian-Lee Tan Optimizing bipartite matching in real-world applications by incremental cost computation . . . . . . . . . . . . 1150--1158 Immanuel Trummer The case for NLP-enhanced database tuning: towards tuning tools that ``read the manual'' . . . . . . . . . . . . . . 1159--1165 Sujaya Maiyya and Faisal Nawab and Divyakant Agrawal and Amr El Abbadi Errata for ``Unifying consensus and atomic commitment for effective cloud data management'' . . . . . . . . . . . 1166--1166 Zsolt István and Soujanya Ponnapalli and Vijay Chidambaram Software-defined data protection: low overhead policy compliance at the storage layer is within reach! . . . . . 1167--1174 Tianyi Li and Lu Chen and Christian S. Jensen and Torben Bach Pedersen TRACE: real-time compression of streaming trajectories in road networks 1175--1187 Arkaprava Saha and Ruben Brokkelkamp and Yllka Velaj and Arijit Khan and Francesco Bonchi Shortest paths and centrality in uncertain networks . . . . . . . . . . . 1188--1201 Tongyu Liu and Ju Fan and Yinqing Luo and Nan Tang and Guoliang Li and Xiaoyong Du Adaptive data augmentation for supervised learning over missing data 1202--1214 Fuheng Zhao and Sujaya Maiyya and Ryan Wiener and Divyakant Agrawal and Amr El Abbadi KLL$^\pm $ approximate quantile sketches over dynamic datasets . . . . . . . . . 1215--1227 Dimitrije Jankov and Binhang Yuan and Shangyu Luo and Chris Jermaine Distributed numerical and machine learning computations via two-phase execution of aggregated join trees . . . 1228--1240 Dana Van Aken and Dongsheng Yang and Sebastien Brillard and Ari Fiorino and Bohan Zhang and Christian Bilien and Andrew Pavlo An inquiry into machine learning-based automatic configuration tuning services on real-world database management systems . . . . . . . . . . . . . . . . 1241--1253
Nan Tang and Ju Fan and Fangyi Li and Jianhong Tu and Xiaoyong Du and Guoliang Li and Sam Madden and Mourad Ouzzani RPT: relational pre-trained transformer is almost all you need towards democratizing data preparation . . . . . 1254--1261 Jia Zou and Amitabh Das and Pratik Barhate and Arun Iyengar and Binhang Yuan and Dimitrije Jankov and Chris Jermaine Lachesis: automatic partitioning for UDF-centric analytics . . . . . . . . . 1262--1275 Jiacheng Wu and Yong Zhang and Shimin Chen and Jin Wang and Yu Chen and Chunxiao Xing Updatable learned index with precise positions . . . . . . . . . . . . . . . 1276--1288 Ziquan Fang and Lu Pan and Lu Chen and Yuntao Du and Yunjun Gao MDTP: a multi-source deep traffic prediction framework over spatio-temporal trajectory data . . . . 1289--1297 Seunghwan Min and Sung Gwan Park and Kunsoo Park and Dora Giammarresi and Giuseppe F. Italiano and Wook-Shin Han Symmetric continuous subgraph matching with bidirectional dynamic programming 1298--1310 Tomoya Suzuki and Kazuhiro Hiwada and Hirotsugu Kajihara and Shintaro Sano and Shuou Nomura and Tatsuo Shiozawa Approaching DRAM performance by using microsecond-latency flash memory for small-sized random read accesses: a new access method and its graph applications 1311--1324 Abdelghny Orogat and Isabelle Liu and Ahmed El-Roby CBench: towards better evaluation of question answering over knowledge graphs 1325--1337 Binhang Yuan and Dimitrije Jankov and Jia Zou and Yuxin Tang and Daniel Bourgeois and Chris Jermaine Tensor relational algebra for distributed machine learning system design . . . . . . . . . . . . . . . . . 1338--1350 Wenfei Fan and Chao Tian and Yanghao Wang and Qiang Yin Parallel discrepancy detection and incremental detection . . . . . . . . . 1351--1364 Tiantian Liu and Huan Li and Hua Lu and Muhammad Aamir Cheema and Lidan Shou Towards crowd-aware indoor path planning 1365--1377 Surabhi Gupta and Karthik Ramachandra Procedural extensions of SQL: understanding their usage in the wild 1378--1391 Sagar Bharadwaj and Praveen Gupta and Ranjita Bhagwan and Saikat Guha Discovering related data at scale . . . 1392--1400 Stefano Cereda and Stefano Valladares and Paolo Cremonesi and Stefano Doni CGPTuner: a contextual Gaussian process bandit approach for the automatic tuning of IT configurations under varying workload conditions . . . . . . . . . . 1401--1413 Filippo Schiavio and Daniele Bonetta and Walter Binder Language-agnostic integrated queries in a managed polyglot runtime . . . . . . . 1414--1426 Chinmay Kulkarni and Badrish Chandramouli and Ryan Stutsman Achieving high throughput and elasticity in a larger-than-memory store . . . . . 1427--1440 Kai Yao and Lijun Chang Efficient size-bounded community search over large networks . . . . . . . . . . 1441--1453
Jianwen Zhao and Yufei Tao Minimum vertex augmentation . . . . . . 1454--1466 Kevin P. Gaffney and Robert Claus and Jignesh M. Patel Database isolation by scheduling . . . . 1467--1480 Jong-Hyeok Park and Soyee Choi and Gihwan Oh and Sang-Won Lee SaS: SSD as SQL database system . . . . 1481--1488 Rong Zhu and Ziniu Wu and Yuxing Han and Kai Zeng and Andreas Pfadler and Zhengping Qian and Jingren Zhou and Bin Cui FLAT: fast, lightweight and accurate method for cardinality estimation . . . 1489--1502 Tsz Nam Chan and Zhe Li and Leong Hou U. and Jianliang Xu and Reynold Cheng Fast augmentation algorithms for network kernel density visualization . . . . . . 1503--1516 Jiawei Wang and Cheng Li and Kai Ma and Jingze Huo and Feng Yan and Xinyu Feng and Yinlong Xu AUTOGR: automated geo-replication with fast system performance and preserved application semantics . . . . . . . . . 1517--1530 Qing Liu and Xuliang Zhu and Xin Huang and Jianliang Xu Local algorithms for distance-generalized core decomposition over large dynamic graphs . . . . . . . 1531--1543 Lawrence Benson and Hendrik Makait and Tilmann Rabl Viper: an efficient hybrid PMem-DRAM key-value store . . . . . . . . . . . . 1544--1556 Sepanta Zeighami and Cyrus Shahabi and John Krumm Estimating spread of contact-based contagions in a population through sub-sampling . . . . . . . . . . . . . . 1557--1569 Herodotos Herodotou and Elena Kakoulli Trident: task scheduling over tiered storage systems in big data platforms 1570--1582 Zicun Cong and Lingyang Chu and Yu Yang and Jian Pei Comprehensible counterfactual explanation on Kolmogorov--Smirnov test 1583--1596 Hongkuan Zhou and Ajitesh Srivastava and Hanqing Zeng and Rajgopal Kannan and Viktor Prasanna Accelerating large scale real-time GNN inference using channel pruning . . . . 1597--1605 Viktor Leis and Maximilian Kuschewski Towards cost-optimal query processing in the cloud . . . . . . . . . . . . . . . 1606--1612 Shufeng Gong and Chao Tian and Qiang Yin and Wenyuan Yu and Yanfeng Zhang and Liang Geng and Song Yu and Ge Yu and Jingren Zhou Automating incremental graph processing with flexible memoization . . . . . . . 1613--1625 Theo Jepsen and Alberto Lerner and Fernando Pedone and Robert Soulé and Philippe Cudré-Mauroux In-network support for transaction triaging . . . . . . . . . . . . . . . . 1626--1639 Xiaoying Wang and Changbo Qu and Weiyuan Wu and Jiannan Wang and Qingqing Zhou Are we ready for learned cardinality estimation? . . . . . . . . . . . . . . 1640--1654 Jakub Lemiesz On the algebra of data sketches . . . . 1655--1667 Guanhao Hou and Xingguang Chen and Sibo Wang and Zhewei Wei Massively parallel algorithms for Personalized Pagerank . . . . . . . . . 1668--1680 Maximilian Schleich and Zixuan Geng and Yihong Zhang and Dan Suciu GeCo: quality counterfactual explanations in real time . . . . . . . 1681--1693 Ricardo Salazar and Felix Neutatz and Ziawasch Abedjan Automated feature engineering for algorithmic fairness . . . . . . . . . . 1694--1702
Raghavendra Addanki and Sainyam Galhotra and Barna Saha How to design robust algorithms using noisy comparison Oracle . . . . . . . . 1703--1716 Paul Boniol and John Paparrizos and Themis Palpanas and Michael J. Franklin SAND: streaming subsequence anomaly detection . . . . . . . . . . . . . . . 1717--1729 Yingtai Xiao and Zeyu Ding and Yuxin Wang and Danfeng Zhang and Daniel Kifer Optimizing fitness-for-use of differentially private linear queries 1730--1742 Xinle Cao and Jian Liu and Hao Lu and Kui Ren Cryptanalysis of an encrypted database in SIGMOD '14 . . . . . . . . . . . . . 1743--1755 Tianyuan Jin and Yu Yang and Renchi Yang and Jieming Shi and Keke Huang and Xiaokui Xiao Unconstrained submodular maximization with modular costs: tight approximation and application to profit maximization 1756--1768 Yuhao Zhang and Frank McQuillan and Nandish Jayaram and Nikhil Kak and Ekta Khanna and Orhan Kislal and Domino Valdano and Arun Kumar Distributed deep learning on data systems: a comparative analysis of approaches . . . . . . . . . . . . . . . 1769--1782 Siyuan Sheng and Qun Huang and Sa Wang and Yungang Bao PR-sketch: monitoring per-key aggregation of streaming data with nearly full accuracy . . . . . . . . . . 1783--1796 Dimitrios Koutsoukos and Supun Nakandala and Konstantinos Karanasos and Karla Saur and Gustavo Alonso and Matteo Interlandi Tensors: an abstraction for general data processing . . . . . . . . . . . . . . . 1797--1804 David Pujol and Yikai Wu and Brandon Fain and Ashwin Machanavajjhala Budget sharing for multi-analyst differential privacy . . . . . . . . . . 1805--1817 Rudi Poepsel-Lemaitre and Martin Kiefer and Joscha von Hein and Jorge-Arnulfo Quiané-Ruiz and Volker Markl In the land of data streams where synopses are missing, one framework to bring them all . . . . . . . . . . . . . 1818--1831 Yifan Li and Xiaohui Yu and Nick Koudas Data acquisition for improving machine learning models . . . . . . . . . . . . 1832--1844 Xiaoshuang Chen and Kai Wang and Xuemin Lin and Wenjie Zhang and Lu Qin and Ying Zhang Efficiently answering reachability and path queries on temporal bipartite graphs . . . . . . . . . . . . . . . . . 1845--1858 Paolo Ciaccia and Davide Martinenghi and Riccardo Torlone Preference queries over taxonomic domains . . . . . . . . . . . . . . . . 1859--1871 Baoyue Yan and Xuntao Cheng and Bo Jiang and Shibin Chen and Canfang Shang and Jianying Wang and Gui Huang and Xinjun Yang and Wei Cao and Feifei Li Revisiting the design of LSM-tree Based OLTP storage engine with persistent memory . . . . . . . . . . . . . . . . . 1872--1885 Chang Ge and Shubhankar Mohapatra and Xi He and Ihab F. Ilyas Kamino: constraint-aware differentially private data synthesis . . . . . . . . . 1886--1899 Yingqiang Zhang and Chaoyi Ruan and Cheng Li and Xinjun Yang and Wei Cao and Feifei Li and Bo Wang and Jing Fang and Yuhui Wang and Jingze Huo and Chao Bi Towards cost-effective and elastic cloud database deployment via memory disaggregation . . . . . . . . . . . . . 1900--1912 Ralph Peeters and Christian Bizer Dual-objective fine-tuning of BERT for entity matching . . . . . . . . . . . . 1913--1921
Maciej Besta and Zur Vonarburg-Shmaria and Yannick Schaffner and Leonardo Schwarz and Grzegorz Kwasniewski and Lukas Gianinazzi and Jakub Beranek and Kacper Janda and Tobias Holenstein and Sebastian Leisinger and Peter Tatkowski and Esref Ozdemir and Adrian Balla and Marcin Copik and Philipp Lindenberger and Marek Konieczny and Onur Mutlu and Torsten Hoefler GraphMineSuite: enabling high-performance and programmable graph mining algorithms with set algebra . . . 1922--1935 Keita Takenouchi and Takashi Ishio and Joji Okada and Yuji Sakata PATSQL: efficient synthesis of SQL queries from example tables with quick inference of projected columns . . . . . 1937--1949 Jie Liu and Wenqian Dong and Qingqing Zhou and Dong Li Fauce: fast and accurate deep ensembles with uncertainty for cardinality estimation . . . . . . . . . . . . . . . 1950--1963 Mengzhao Wang and Xiaoliang Xu and Qiang Yue and Yuxiang Wang A comprehensive survey and experimental comparison of graph-based approximate nearest neighbor search . . . . . . . . 1964--1978 Zifeng Yuan and Huey Eng Chua and Sourav S. Bhowmick and Zekun Ye and Wook-Shin Han and Byron Choi Towards plug-and-play visual graph query interfaces: data-driven selection of canned patterns for large networks . . . 1979--1991 Shixuan Sun and Yuhang Chen and Shengliang Lu and Bingsheng He and Yuchen Li ThunderRW: an in-memory graph random walk engine . . . . . . . . . . . . . . 1992--2005 Zheng Dong and Xin Huang and Guorui Yuan and Hengshu Zhu and Hui Xiong Butterfly-core community search over labeled graphs . . . . . . . . . . . . . 2006--2018 Parimarjan Negi and Ryan Marcus and Andreas Kipf and Hongzi Mao and Nesime Tatbul and Tim Kraska and Mohammad Alizadeh Flow-loss: learning cardinality estimates that matter . . . . . . . . . 2019--2032 Michael Yu and Dong Wen and Lu Qin and Ying Zhang and Wenjie Zhang and Xuemin Lin On querying historical $k$-cores . . . . 2033--2045 Graham Cormode and Samuel Maddock and Carsten Maple Frequency estimation under local differential privacy . . . . . . . . . . 2046--2058 Fatjon Zogaj and José Pablo Cambronero and Martin C. Rinard and Jürgen Cito Doing more with less: characterizing dataset downsampling for AutoML . . . . 2059--2072 Yifan Li and Xiaohui Yu and Nick Koudas LES 3: learning-based exact set similarity search . . . . . . . . . . . 2073--2086 Seung Won Min and Kun Wu and Sitao Huang and Mert Hidayetoglu and Jinjun Xiong and Eiman Ebrahimi and Deming Chen and Wen-mei Hwu Large graph convolutional network training with GPU-oriented data communication architecture . . . . . . . 2087--2100 Yifei Yang and Matt Youill and Matthew Woicik and Yizhou Liu and Xiangyao Yu and Marco Serafini and Ashraf Aboulnaga and Michael Stonebraker FlexPushdownDB: hybrid pushdown and caching in a cloud DBMS . . . . . . . . 2101--2113 Zhiwei Chen and Shaoxu Song and Ziheng Wei and Jingyun Fang and Jiang Long Approximating median absolute deviation with bounded error . . . . . . . . . . . 2114--2126 Mengxuan Zhang and Lei Li and Xiaofang Zhou An experimental evaluation and guideline for path finding in weighted dynamic network . . . . . . . . . . . . . . . . 2127--2140 Brecht Vandevoort and Bas Ketsman and Christoph Koch and Frank Neven Robustness against read committed for transaction templates . . . . . . . . . 2141--2153 Huayi Zhang and Lei Cao and Samuel Madden and Elke Rundensteiner LANCET: labeling complex data at scale 2154--2166 Yang Li and Yu Shen and Wentao Zhang and Jiawei Jiang and Bolin Ding and Yaliang Li and Jingren Zhou and Zhi Yang and Wentao Wu and Ce Zhang and Bin Cui VolcanoML: speeding up end-to-end AutoML via scalable search space decomposition 2167--2176 Peng Cheng and Jiabao Jin and Lei Chen and Xuemin Lin and Libin Zheng A queueing-theoretic framework for vehicle dispatching in dynamic car-hailing . . . . . . . . . . . . . . 2177--2189 Kuntai Cai and Xiaoyu Lei and Jianxin Wei and Xiaokui Xiao Data synthesis via differentially private Markov random fields . . . . . . 2190--2202 Michael Whittaker and Ailidani Ailijiang and Aleksey Charapko and Murat Demirbas and Neil Giridharan and Joseph M. Hellerstein and Heidi Howard and Ion Stoica and Adriana Szekeres Scaling replicated state machines with compartmentalization . . . . . . . . . . 2203--2215 Subhadeep Sarkar and Dimitris Staratzis and Ziehen Zhu and Manos Athanassoulis Constructing and analyzing the LSM compaction design space . . . . . . . . 2216--2229 Jelle Hellings and Mohammad Sadoghi ByShard: sharding in a Byzantine environment . . . . . . . . . . . . . . 2230--2243 Otmar Ertl SetSketch: filling the gap between MinHash and HyperLogLog . . . . . . . . 2244--2257 Ergute Bao and Yin Yang and Xiaokui Xiao and Bolin Ding CGM: an enhanced mechanism for streaming data collection with local differential privacy . . . . . . . . . . . . . . . . 2258--2270 Dean De Leo and Per Fuchs and Peter Boncz Errata for ``Teseo and the analysis of structural dynamic graphs'': (PVLDB \bf 14(6):1053--1066) . . . . . . . . . . . 2271--2272 Mashaal Musleh and Sofiane Abbar and Rade Stanojevic and Mohamed Mokbel QARTA: an ML-based system for accurate map services . . . . . . . . . . . . . . 2273--2282 Teddy Cunningham and Graham Cormode and Hakan Ferhatosmanoglu and Divesh Srivastava Real-world trajectory sharing with local differential privacy . . . . . . . . . . 2283--2295 Phanwadee Sinthong and Michael J. Carey PolyFrame: a retargetable query-based approach to scaling dataframes . . . . . 2296--2304 Jessica Shi and Laxman Dhulipala and David Eisenstat and Jakub Lacki and Vahab Mirrokni Scalable community detection via parallel correlation clustering . . . . 2305--2313 Cheng Xu and Ce Zhang and Jianliang Xu and Jian Pei SlimChain: scaling blockchain transactions through off-chain storage and parallel processing . . . . . . . . 2314--2326 Side Li and Arun Kumar Towards an optimized GROUP by abstraction for large-scale machine learning . . . . . . . . . . . . . . . . 2327--2340 Daniel Kang and John Guibas and Peter Bailis and Tatsunori Hashimoto and Yi Sun and Matei Zaharia Accelerating approximate aggregation queries with expensive predicates . . . 2341--2354 Tobias Schmidt and Maximilian Bandle and Jana Giceva A four-dimensional analysis of partitioned approximate filters . . . . 2355--2368 Monica Chiosa and Thomas B. Preußer and Gustavo Alonso SKT: a one-pass multi-sketch data analytics accelerator . . . . . . . . . 2369--2382 Philipp Fent and Thomas Neumann A practical approach to groupjoin and nested aggregates . . . . . . . . . . . 2383--2396 Ziyun Wei and Immanuel Trummer and Connor Anderson Robust voice querying with MUVE: optimally visualizing results of phonetically similar queries . . . . . . 2397--2409 Yinjun Wu and James Weimer and Susan B. Davidson CHEF: a cheap and fast pipeline for iteratively cleaning label uncertainties 2410--2418 Tarique Siddiqui and Surajit Chaudhuri and Vivek Narasayya COMPARE: accelerating groupwise comparison in relational databases for data analytics . . . . . . . . . . . . . 2419--2431 Dominik Durner and Badrish Chandramouli and Yinan Li Crystal: a unified cache storage system for analytical databases . . . . . . . . 2432--2444 Valerio Cetorelli and Paolo Atzeni and Valter Crescenzi and Franco Milicchio The smallest extraction problem . . . . 2445--2458 Saravanan Thirumuruganathan and Han Li and Nan Tang and Mourad Ouzzani and Yash Govind and Derek Paulsen and Glenn Fung and AnHai Doan Deep learning for blocking in entity matching: a design space exploration . . 2459--2472 Wentao Zhang and Zhi Yang and Yexin Wang and Yu Shen and Yang Li and Liang Wang and Bin Cui GRAIN: improving data efficiency of \em graph neural networks via diversified \em influence maximization . . . . . . . 2473--2482 Maximilian Bandle and Jana Giceva Database technology for the masses: sub-operators as first-class entities 2483--2490 Pranjal Gupta and Amine Mhedhbi and Semih Salihoglu Columnar storage and list-based processing for graph database management systems . . . . . . . . . . . . . . . . 2491--2504 Yiwen Zhu and Matteo Interlandi and Abhishek Roy and Krishnadhan Das and Hiren Patel and Malay Bag and Hitesh Sharma and Alekh Jindal Phoebe: a learning-based checkpoint optimizer . . . . . . . . . . . . . . . 2505--2518 Fatemeh Nargesian and Abolfazl Asudeh and H. V. Jagadish Tailoring data source distributions for fairness-aware data integration . . . . 2519--2532 Parikshit Bansal and Prathamesh Deshpande and Sunita Sarawagi Missing value imputation on multidimensional time series . . . . . . 2533--2545 El Kindi Rezig and Mourad Ouzzani and Walid G. Aref and Ahmed K. Elmagarmid and Ahmed R. Mahmood and Michael Stonebraker Horizon: scalable dependency-driven data cleaning . . . . . . . . . . . . . . . . 2546--2554 Ted Shaowang and Nilesh Jain and Dennis D. Matthews and Sanjay Krishnan Declarative data serving: the future of machine learning inference on the edge 2555--2562 Junwen Yang and Yeye He and Surajit Chaudhuri Auto-pipeline: synthesizing complex data pipelines by-target using reinforcement learning and search . . . . . . . . . . 2563--2575 Brandon Lockhart and Jinglin Peng and Weiyuan Wu and Jiannan Wang and Eugene Wu Explaining inference queries with Bayesian optimization . . . . . . . . . 2576--2585 Chunwei Liu and Hao Jiang and John Paparrizos and Aaron J. Elmore Decomposed bounded floats for fast compression and queries . . . . . . . . 2586--2598 Nikolaos Tziavelis and Wolfgang Gatterbauer and Mirek Riedewald Beyond equi-joins: ranking, enumeration and factorization . . . . . . . . . . . 2599--2612 Vincent Jacob and Fei Song and Arnaud Stiegler and Bijan Rad and Yanlei Diao and Nesime Tatbul Exathlon: a benchmark for explainable anomaly detection over time series . . . 2613--2626 Michael Kuchnik and George Amvrosiadis and Virginia Smith Progressive compressed records: taking a byte out of deep learning data . . . . . 2627--2641 Abdulrahman Alsaudi and Yasser Altowim and Sharad Mehrotra and Yaming Yu TQEL: framework for query-driven linking of top-$k$ entities in social media blogs . . . . . . . . . . . . . . . . . 2642--2654
Tsz Nam Chan and Pak Lon Ip and Leong Hou U. and Weng Hou Tong and Shivansh Mittal and Ye Li and Reynold Cheng KDV-explorer: a near real-time kernel density visualization system for spatial analysis . . . . . . . . . . . . . . . . 2655--2658 Zhebin Zhang and Dajie Dong and Yuhang Ma and Yilong Ying and Dawei Jiang and Ke Chen and Lidan Shou and Gang Chen Refiner: a reliable incentive-driven federated learning system powered by blockchain . . . . . . . . . . . . . . . 2659--2662 Valter Uotila and Jiaheng Lu and Dieter Gawlick and Zhen Hua Liu and Souripriya Das and Gregory Pogossiants MultiCategory: multi-model query processing meets category theory and functional programming . . . . . . . . . 2663--2666 Qichen Wang and Chaoqi Zhang and Danish Alsayed and Ke Yi and Bin Wu and Feifei Li and Chaoqun Zhan Cquirrel: continuous query processing over acyclic relational schemas . . . . 2667--2670 Yuetian Mao and Shuai Yuan and Nan Cui and Tianjiao Du and Beijun Shen and Yuting Chen DeFiHap: detecting and fixing HiveQL anti-patterns . . . . . . . . . . . . . 2671--2674 Ahmed Helal and Mossad Helali and Khaled Ammar and Essam Mansour A demonstration of KGLac: a data discovery and enrichment platform for data science . . . . . . . . . . . . . . 2675--2678 Pierre Faure-Giovagnoli and Marie Le Guilly and Jean-Marc Petit and Vasile-Marian Scuturici ADESIT: visualize the limits of your data in a machine learning process . . . 2679--2682 Yinzhao Yan and Raymond Chi-Wing Wong Path advisor: a multi-functional campus map tool for shortest path . . . . . . . 2683--2686 Liangde Li and Supun Nakandala and Arun Kumar Intermittent human-in-the-loop model selection using Cerebro: a demonstration 2687--2690 Henning Funke and Jens Teubner Low-latency compilation of SQL queries to machine code . . . . . . . . . . . . 2691--2694 Sven Groppe and Rico Klinckenberg and Benjamin Warnke Sound of databases: sonification of a semantic web database engine . . . . . . 2695--2698 Zihao Chen and Zhizhen Xu and Chen Xu and Juan Soto and Volker Markl and Weining Qian and Aoying Zhou HyMAC: a hybrid matrix computation system . . . . . . . . . . . . . . . . . 2699--2702 Jingbo Xu and Zhanning Bai and Wenfei Fan and Longbin Lai and Xue Li and Zhao Li and Zhengping Qian and Lei Wang and Lei Wang and Yanyan Wang and Wenyuan Yu and Jingren Zhou GraphScope: a one-stop large graph processing system . . . . . . . . . . . 2703--2706 Alexander Renz-Wieland and Tobias Drobisch and Zoi Kaoudi and Rainer Gemulla and Volker Markl Just move it!: dynamic parameter allocation in action . . . . . . . . . . 2707--2710 Abdelghny Orogat and Ahmed El-Roby CBench: demonstrating comprehensive evaluation of question answering systems over knowledge graphs through deep analysis of benchmarks . . . . . . . . . 2711--2714 Lucas Woltmann and Dominik Olwig and Claudio Hartmann and Dirk Habich and Wolfgang Lehner PostCENN: postgreSQL with machine learning models for cardinality estimation . . . . . . . . . . . . . . . 2715--2718 Jinyang Li and Yuval Moskovitch and H. V. Jagadish DENOUNCER: detection of unfairness in classifiers . . . . . . . . . . . . . . 2719--2722 Sofiane Abbar and Rade Stanojevic and Mashaal Musleh and Mohamed ElShrif and Mohamed Mokbel A demonstration of QARTA: an ML-based system for accurate map services . . . . 2723--2726 Jaclyn Smith and Michael Benedikt and Brandon Moore and Milos Nikolic TraNCE: transforming nested collections efficiently . . . . . . . . . . . . . . 2727--2730 Ralf Diestelkämper and Seokki Lee and Boris Glavic and Melanie Herschel Debugging missing answers for spark queries over nested data with Breadcrumb 2731--2734 Renzhi Wu and Prem Sakala and Peng Li and Xu Chu and Yeye He Demonstration of Panda: a weakly supervised entity matching system . . . 2735--2738 Jiabin Liu and Fu Zhu and Chengliang Chai and Yuyu Luo and Nan Tang Automatic data acquisition for deep learning . . . . . . . . . . . . . . . . 2739--2742 Xuanhe Zhou and Lianyuan Jin and Ji Sun and Xinyang Zhao and Xiang Yu and Jianhua Feng and Shifu Li and Tianqing Wang and Kun Li and Luyang Liu DBMind: a self-driving platform in openGauss . . . . . . . . . . . . . . . 2743--2746 Qiongqiong Lin and Jiayao Zhang and Jinfei Liu and Kui Ren and Jian Lou and Junxu Liu and Li Xiong and Jian Pei and Jimeng Sun Demonstration of dealer: an end-to-end model marketplace with differential privacy . . . . . . . . . . . . . . . . 2747--2750 Tianyu Mu and Hongzhi Wang and Shenghe Zheng and Shaoqing Zhang and Cheng Liang and Haoyun Tang Assassin: an automatic classification system based on algorithm selection . . 2751--2754 Lei Cao and Dongqing Xiao and Yizhou Yan and Samuel Madden and Guoliang Li ATLANTIC: making database differentially private and faster with accuracy guarantee . . . . . . . . . . . . . . . 2755--2758 Anze Xie and Anders Carlsson and Jason Mohoney and Roger Waleffe and Shanan Peters and Theodoros Rekatsinas and Shivaram Venkataraman Demo of Marius: a system for large-scale graph embeddings . . . . . . . . . . . . 2759--2762 Heiko Müller and Sonia Castelo and Munaf Qazi and Juliana Freire From papers to practice: the \pkgopenclean open-source data cleaning library . . . . . . . . . . . . . . . . 2763--2766 Yongming Ge and Vanessa Lin and Maureen Daum and Brandon Haynes and Alvin Cheung and Magdalena Balazinska Demonstration of apperception: a database management system for geospatial video data . . . . . . . . . 2767--2770 Mary Karatzoglidi and Paraskevas Kerasiotis and Verena Kantere Automated energy consumption forecasting with EnForce . . . . . . . . . . . . . . 2771--2774 Myung-Hwan Jang and Yong-Yeon Jo and Sang-Wook Kim RealGraph web: a graph analysis platform on the web . . . . . . . . . . . . . . . 2775--2778 Arthita Ghosh and Deven Bansod and Arpit Narechania and Prashanth Dintyala and Su Timurturkan and Joy Arulraj Interactive demonstration of SQLCheck 2779--2782 Yiming Lin and Pramod Khargonekar and Sharad Mehrotra and Nalini Venkatasubramanian T-cove: an exposure tracing system based on cleaning wi-fi events on organizational premises . . . . . . . . 2783--2786 Paul Y. Wang and Sainyam Galhotra and Romila Pradhan and Babak Salimi Demonstration of generating explanations for black-box algorithms using Lewis . . 2787--2790 Sonia Castelo and Rémi Rampin and Aécio Santos and Aline Bessa and Fernando Chirigati and Juliana Freire Auctus: a dataset search engine for data discovery and augmentation . . . . . . . 2791--2794 Mohammed Suhail Rehman and Silu Huang and Aaron J. Elmore A demonstration of RELIC: a system for retrospective lineage inference of data workflows . . . . . . . . . . . . . . . 2795--2798 Zhihao Chen and Haizhen Zhuo and Quanqing Xu and Xiaodong Qi and Chengyu Zhu and Zhao Zhang and Cheqing Jin and Aoying Zhou and Ying Yan and Hui Zhang SChain: a scalable consortium blockchain exploiting intra- and inter-block concurrency . . . . . . . . . . . . . . 2799--2802 Chrysovalantis Anastasiou and Constantinos Costa and Panos K. Chrysanthis and Cyrus Shahabi EPICGen: an experimental platform for indoor congestion generation and forecasting . . . . . . . . . . . . . . 2803--2806 Hiba Arnaout and Simon Razniewski and Gerhard Weikum and Jeff Z. Pan Wikinegata: a knowledge base with interesting negative statements . . . . 2807--2810 Jinwei Zhu and Kun Cheng and Jiayang Liu and Liang Guo Full encryption: an end to end encryption mechanism in GaussDB . . . . 2811--2814 Antonis Mandamadiotis and Stavroula Eleftherakis and Apostolos Glenis and Dimitrios Skoutas and Yannis Stavrakas and Georgia Koutrika DatAgent: the imminent age of intelligent data assistants . . . . . . 2815--2818 El Kindi Rezig and Anshul Bhandari and Anna Fariha and Benjamin Price and Allan Vanterpool and Vijay Gadepally and Michael Stonebraker DICE: data discovery by example . . . . 2819--2822 Felix Schuhknecht and Aaron Priesterroth and Justus Henneberg and Reza Salkhordeh AnyOLAP: analytical processing of arbitrary data-intensive applications without ETL . . . . . . . . . . . . . . 2823--2826 Vincent Jacob and Fei Song and Arnaud Stiegler and Bijan Rad and Yanlei Diao and Nesime Tatbul A demonstration of the Exathlon benchmarking platform for explainable anomaly detection . . . . . . . . . . . 2827--2830 Amir Shaikhha and Maximilian Schleich and Dan Olteanu An intermediate representation for hybrid database and machine learning workloads . . . . . . . . . . . . . . . 2831--2834 Eliana Pastor and Andrew Gavgavian and Elena Baralis and Luca de Alfaro How divergent is your data? . . . . . . 2835--2838 Auday Berro and Mohammad-Ali Yaghub Zade Fard and Marcos Baez and Boualem Benatallah and Khalid Benabdeslem An extensible and reusable pipeline for automated utterance paraphrases . . . . 2839--2842 Kaustubh Beedkar and David Brekardin and Jorge-Anulfo Quiané-Ruiz and Volker Markl Compliant geo-distributed data processing in action . . . . . . . . . . 2843--2846 Piyush Yadav and Dhaval Salwala and Felipe Arruda Pontes and Praneet Dhingra and Edward Curry Query-driven video event processing for the Internet of Multimedia Things . . . 2847--2850 Nikolaos Koutroumanis and Nikolaos Kousathanas and Christos Doulkeridis and Akrivi Vlachou A demonstration of NoDA: unified access to NoSQL stores . . . . . . . . . . . . 2851--2854 Rathijit Sen and Abhishek Roy and Alekh Jindal and Rui Fang and Jeff Zheng and Xiaolei Liu and Ruiping Li AutoExecutor: predictive parallelism for Spark SQL queries . . . . . . . . . . . 2855--2858 Jiaxiang Liu and Karl Knopf and Yiqing Tan and Bolin Ding and Xi He Catch a blowfish alive: a demonstration of policy-aware differential privacy for interactive data exploration . . . . . . 2859--2862 Paul Ouellette and Aidan Sciortino and Fatemeh Nargesian and Bahar Ghadiri Bashardoost and Erkang Zhu and Ken Q. Pu and Renée J. Miller RONIN: data lake exploration . . . . . . 2863--2866 Paul Boniol and John Paparrizos and Themis Palpanas and Michael J. Franklin SAND in action: subsequence anomaly detection for streams . . . . . . . . . 2867--2870 Christos Koutras and Kyriakos Psarakis and George Siachamis and Andra Ionescu and Marios Fragkoulis and Angela Bonifati and Asterios Katsifodimos Valentine in action: matching tabular data at scale . . . . . . . . . . . . . 2871--2874 Sheng Guan and Hanchao Ma and Sutanay Choudhury and Yinghui Wu GEDet: detecting erroneous nodes with a few examples . . . . . . . . . . . . . . 2875--2878 Wenfei Fan and Tao He and Longbin Lai and Xue Li and Yong Li and Zhao Li and Zhengping Qian and Chao Tian and Lei Wang and Jingbo Xu and Youyang Yao and Qiang Yin and Wenyuan Yu and Jingren Zhou and Diwen Zhu and Rong Zhu GraphScope: a unified engine for big graph processing . . . . . . . . . . . . 2879--2892 Zeyuan Shang and Emanuel Zgraggen and Benedetto Buratti and Philipp Eichmann and Navid Karimeddiny and Charlie Meyer and Wesley Runnels and Tim Kraska Davos: a system for interactive data-driven decision making . . . . . . 2893--2905 An Qin and Mengbai Xiao and Yongwei Wu and Xinjie Huang and Xiaodong Zhang Mixer: efficiently understanding and retrieving visual content at web-scale 2906--2917 David Justo and Shaoqing Yi and Lukas Stadler and Nadia Polikarpova and Arun Kumar Towards a polyglot framework for factorized ML . . . . . . . . . . . . . 2918--2931 Niv Dayan and Moshe Twitto and Yuval Rochman and Uri Beitler and Itai Ben Zion and Edward Bortnikov and Shmuel Dashevsky and Ofer Frishman and Evgeni Ginzburg and Igal Maly and Avraham (Poza) Meir and Mark Mokryn and Iddo Naiss and Noam Rabinovich The end of Moore's law and the rise of the data processor . . . . . . . . . . . 2932--2944 Derek G. Murray and Jirí Simsa and Ana Klimovic and Ihor Indyk \pkgtf.data: a machine learning data processing framework . . . . . . . . . . 2945--2958 Mohamed Eltabakh and Anantha Subramanian and Awny Al-Omari and Mohammed Al-Kateb and Sanjay Nair and Mahbub Hasan and Wellington Cabrera and Charles Zhang and Amit Kishore and Snigdha Prasad Not black-box anymore!: enabling analytics-aware optimizations in Teradata Vantage . . . . . . . . . . . . 2959--2971 Yingda Chen and Jiamang Wang and Yifeng Lu and Ying Han and Zhiqiang Lv and Xuebin Min and Hua Cai and Wei Zhang and Haochuan Fan and Chao Li and Tao Guan and Wei Lin and Yangqing Jia and Jingren Zhou Fangorn: adaptive execution framework for heterogeneous workloads on shared clusters . . . . . . . . . . . . . . . . 2972--2985 Ankur Agiwal and Kevin Lai and Gokul Nath Babu Manoharan and Indrajit Roy and Jagan Sankaranarayanan and Hao Zhang and Tao Zou and Min Chen and Zongchang (Jim) Chen and Ming Dai and Thanh Do and Haoyu Gao and Haoyan Geng and Raman Grover and Bo Huang and Yanlai Huang and Zhi (Adam) Li and Jianyi Liang and Tao Lin and Li Liu and Yao Liu and Xi Mao and Yalan (Maya) Meng and Prashant Mishra and Jay Patel and Rajesh S. R. and Vijayshankar Raman and Sourashis Roy and Mayank Singh Shishodia and Tianhang Sun and Ye (Justin) Tang and Junichi Tatemura and Sagar Trehan and Ramkumar Vadali and Prasanna Venkatasubramanian and Gensheng Zhang and Kefei Zhang and Yupu Zhang and Zeleng Zhuang and Goetz Graefe and Divyakant Agrawal and Jeff Naughton and Sujata Kosalge and Hakan Hacìgümü\cs Napa: powering scalable data warehousing with robust query performance at Google 2986--2997 Rubao Lee and Minghong Zhou and Chi Li and Shenggang Hu and Jianping Teng and Dongyang Li and Xiaodong Zhang The art of balance: a RateupDBTM experience of building a CPU/GPU hybrid database product . . . . . . . . . . . . 2999--3013 Audrey Cheng and Xiao Shi and Lu Pan and Anthony Simpson and Neil Wheaton and Shilpa Lawande and Nathan Bronson and Peter Bailis and Natacha Crooks and Ion Stoica RAMP-TAO: layering atomic transactions on Facebook's online TAO data store . . 3014--3027 Guoliang Li and Xuanhe Zhou and Ji Sun and Xiang Yu and Yue Han and Lianyuan Jin and Wenbo Li and Tianqing Wang and Shifu Li openGauss: an autonomous database system 3028--3042 Rahul Potharaju and Terry Kim and Eunjin Song and Wentao Wu and Lev Novik and Apoorve Dave and Andrew Fogarty and Pouria Pirzadeh and Vidip Acharya and Gurleen Dhody and Jiying Li and Sinduja Ramanujam and Nicolas Bruno and César A. Galindo-Legaria and Vivek Narasayya and Surajit Chaudhuri and Anil K. Nori and Tomas Talius and Raghu Ramakrishnan Hyperspace: the indexing subsystem of Azure Synapse . . . . . . . . . . . . . 3043--3055 Bolong Zheng and Lei Bi and Juan Cao and Hua Chai and Jun Fang and Lu Chen and Yunjun Gao and Xiaofang Zhou and Christian S. Jensen SpeakNav: voice-based route description language understanding for template-driven path search . . . . . . 3056--3068 Ana Sofia Gomes and João Oliveirinha and Pedro Cardoso and Pedro Bizarro Railgun: managing large streaming windows under MAD requirements . . . . . 3069--3082 Pavan Edara and Mosha Pasumansky Big metadata: when metadata is big data 3083--3095 Josh Stoddard and Adam Mustafa and Naveen Goela Tanium Reveal: a federated search engine for querying unstructured file data on large enterprise networks . . . . . . . 3096--3109 Can Gencer and Marko Topolnik and Viliam \vDurina and Emin Demirci and Ensar B. Kahveci and Ali Gürbüz and Ond\vrej Luká\vs and József Bartók and Grzegorz Gierlach and Franti\vsek Hartman and Ufuk Yìlmaz and Mehmet Do\ugan and Mohamed Mandouh and Marios Fragkoulis and Asterios Katsifodimos Hazelcast Jet: low-latency stream processing at the 99.99-th percentile 3110--3121 Abhishek Roy and Alekh Jindal and Priyanka Gomatam and Xiating Ouyang and Ashit Gosalia and Nishkam Ravi and Swinky Mann and Prakhar Jain SparkCruise: workload optimization in managed spark clusters at Microsoft . . 3122--3134 Tyler Akidau and Edmon Begoli and Slava Chernyak and Fabian Hueske and Kathryn Knight and Kenneth Knowles and Daniel Mills and Dan Sotolongo Watermarks in stream processing systems: semantics and comparative analysis of Apache Flink and Google cloud dataflow 3135--3147 Conor Power and Hiren Patel and Alekh Jindal and Jyoti Leeka and Bob Jenkins and Michael Rys and Ed Triou and Dexin Zhu and Lucky Katahanas and Chakrapani Bhat Talapady and Joshua Rowe and Fan Zhang and Rich Draves and Marc Friedman and Ivan Santa Maria Filho and Amrish Kumar The Cosmos big data platform at Microsoft: over a decade of progress and a decade to look forward . . . . . . . . 3148--3161 Ippokratis Pandis The evolution of Amazon Redshift . . . . 3162--3174 Simon Razniewski and Hiba Arnaout and Shrestha Ghosh and Fabian Suchanek On the limits of machine knowledge: completeness, recall and negation in web-scale knowledge bases . . . . . . . 3175--3177 Laurel Orr and Atindriyo Sanyal and Xiao Ling and Karan Goel and Megan Leszczynski Managing ML pipelines: feature stores and the coming wave of embedding ecosystems . . . . . . . . . . . . . . . 3178--3181 Yuliang Li and Xiaolan Wang and Zhengjie Miao and Wang-Chiew Tan Data augmentation for ML-driven data preparation and integration . . . . . . 3182--3185 Ramon Antonio Rodriges Zalipynis Array DBMS: past, present, and (near) future . . . . . . . . . . . . . . . . . 3186--3189 Guoliang Li and Xuanhe Zhou and Lei Cao Machine learning for databases . . . . . 3190--3193 Saeed Kargar and Faisal Nawab Extending the lifetime of NVM: challenges and opportunities . . . . . . 3194--3197 Karima Echihabi and Kostas Zoumpatianos and Themis Palpanas New trends in high-D vector similarity search: AI-driven, progressive, and distributed . . . . . . . . . . . . . . 3198--3201 Alekh Jindal and Matteo Interlandi Machine learning for cloud data systems: the progress so far and the path forward 3202--3205 Susan B. Davidson It's not just cookies and tea . . . . . 3206--3206 Thomas Neumann Evolution of a compiling query engine 3207--3210 Andrew Pavlo and Matthew Butrovich and Lin Ma and Prashanth Menon and Wan Shen Lim and Dana Van Aken and William Zhang Make your database system dream of electric sheep: towards self-driving operation . . . . . . . . . . . . . . . 3211--3221 Tim Kraska Towards instance-optimized data systems 3222--3232 Gerhard Weikum Knowledge graphs 2021: a data odyssey 3233--3238 Zachary G. Ives and Rachel Pottinger and Arun Kumar and Johannes Gehrke and Jana Giceva The future of data(base) education: is the ``cow book'' dead? . . . . . . . . . 3239--3239 Luis Remis and Chaunté W. Lacewell Using VDMS to index and search 100M images . . . . . . . . . . . . . . . . . 3240--3252
Jian Liu and Kefei Wang and Feng Chen TSCache: an efficient flash-based caching scheme for time-series data workloads . . . . . . . . . . . . . . . 3253--3266 Huayi Wang and Jingfan Meng and Long Gong and Jun Xu and Mitsunori Ogihara MP-RW-LSH: an efficient multi-probe LSH solution to ANNS-L$_1$ . . . . . . . . . 3267--3280 Theofilos Mailis and Yannis Kotidis and Stamatis Christoforidis and Evgeny Kharlamov and Yannis Ioannidis View selection over knowledge graphs in triple stores . . . . . . . . . . . . . 3281--3294 Dongjie Li and Siyi Lv and Yanyu Huang and Yijing Liu and Tong Li and Zheli Liu and Liang Guo Frequency-hiding order-preserving encryption with small client storage . . 3295--3307 Dimitrios Koutsoukos and Ingo Müller and Renato Marroquín and Ana Klimovic and Gustavo Alonso Modularis: modular relational analytics over heterogeneous distributed platforms 3308--3321 Yunkai Lou and Chaokun Wang and Tiankai Gu and Hao Feng and Jun Chen and Jeffrey Xu Yu Time-topology analysis . . . . . . . . . 3322--3334 Daniel Bernau and Günther Eibl and Philip W. Grassal and Hannah Keller and Florian Kerschbaum Quantifying identifiability to choose and audit $\epsilon$ in differentially private deep learning . . . . . . . . . 3335--3347 Rodrigo Laigner and Yongluan Zhou and Marcos Antonio Vaz Salles and Yijian Liu and Marcos Kalinowski Data management in microservices: state of the practice, challenges, and research directions . . . . . . . . . . 3348--3361 Remmelt Ammerlaan and Gilbert Antonius and Marc Friedman and H. M. Sajjad Hossain and Alekh Jindal and Peter Orenberg and Hiren Patel and Shi Qiao and Vijay Ramani and Lucas Rosenblatt and Abhishek Roy and Irene Shaffer and Soundarajan Srinivasan and Markus Weimer PerfGuard: deploying ML-for-systems without performance regressions, almost! 3362--3375 Bailu Ding and Surajit Chaudhuri and Johannes Gehrke and Vivek Narasayya DSB: a decision support benchmark for workload-driven and traditional database systems . . . . . . . . . . . . . . . . 3376--3388 Daniel Hernández and Luis Galárraga and Katja Hose Computing how-provenance for SPARQL queries via query rewriting . . . . . . 3389--3401 Junxiong Wang and Immanuel Trummer and Debabrota Basu UDO: universal database optimization using reinforcement learning . . . . . . 3402--3414 Anja Feldmann Internet traffic analysis at scale . . . 3415--3415 Danai Koutra The power of summarization in graph mining and learning: smaller data, faster methods, more interpretability 3416--3416 Nigam Shah Summarizing patients like mine via an on-demand consultation service . . . . . 3417--3417 Joaquin Vanschoren Towards scalable online machine learning collaborations with OpenML . . . . . . . 3418--3418 Manasi Vartak From ML models to intelligent applications: the rise of MLOps . . . . 3419--3419 Matei Zaharia Designing production-friendly machine learning . . . . . . . . . . . . . . . . 3420--3420
Kang Zhao and Liuyihan Song and Yingya Zhang and Pan Pan and Yinghui Xu and Rong Jin ANN softmax: acceleration of extreme classification training . . . . . . . . 1--10 Gyeong-In Yu and Saeed Amizadeh and Sehoon Kim and Artidoro Pagnoni and Ce Zhang and Byung-Gon Chun and Markus Weimer and Matteo Interlandi WindTunnel: towards differentiable ML pipelines beyond a single model . . . . 11--20 Athinagoras Skiadopoulos and Qian Li and Peter Kraft and Kostis Kaffes and Daniel Hong and Shana Mathew and David Bestor and Michael Cafarella and Vijay Gadepally and Goetz Graefe and Jeremy Kepner and Christos Kozyrakis and Tim Kraska and Michael Stonebraker and Lalith Suresh and Matei Zaharia DBOS: a DBMS-oriented operating system 21--30 Arjit Jain and Sunita Sarawagi and Prithviraj Sen Deep indexed active learning for matching heterogeneous entity representations . . . . . . . . . . . . 31--45 Xuanhe Zhou and Guoliang Li and Chengliang Chai and Jianhua Feng A learned query rewrite system using Monte Carlo tree search . . . . . . . . 46--58 Yin Lin and Brit Youngmann and Yuval Moskovitch and H. V. Jagadish and Tova Milo On detecting cherry-picked generalizations . . . . . . . . . . . . 59--71 Jiayi Wang and Chengliang Chai and Jiabin Liu and Guoliang Li FACE: a normalizing flow based cardinality estimator . . . . . . . . . 72--84 Ji Sun and Jintao Zhang and Zhaoyan Sun and Guoliang Li and Nan Tang Learned cardinality estimation: a design space exploration and a comparative evaluation . . . . . . . . . . . . . . . 85--97 Dong He and Maureen Daum and Walter Cai and Magdalena Balazinska DeepEverest: accelerating declarative top-$K$ queries for deep neural network interpretation . . . . . . . . . . . . . 98--111 Subarna Chatterjee and Meena Jagadeesan and Wilson Qin and Stratos Idreos Cosine: a cloud-cost optimized self-designing key--value storage engine 112--126 Muhammad Adnan and Yassaman Ebrahimzadeh Maboud and Divya Mahajan and Prashant J. Nair Accelerating recommendation system training by leveraging popular choices 127--140
Jianye Yang and Yun Peng and Wenjie Zhang $ (p, q) $-biclique counting and enumeration for large sparse bipartite graphs . . . . . . . . . . . . . . . . . 141--153 Dan Graur and Ingo Müller and Mason Proffitt and Ghislain Fourny and Gordon T. Watts and Gustavo Alonso Evaluating query languages and systems for high-energy physics data . . . . . . 154--168 Kongzhang Hao and Long Yuan and Wenjie Zhang Distributed hop-constrained $s$--$t$ simple path enumeration at billion scale 169--182 Jingzhi Fang and Yanyan Shen and Yue Wang and Lei Chen ETO: accelerating optimization of DNN operators by high-performance tensor program reuse . . . . . . . . . . . . . 183--195 Philipp Marian Grulich and Steffen Zeuch and Volker Markl Babelfish: efficient execution of polyglot queries . . . . . . . . . . . . 196--210 Alexander Zhou and Yue Wang and Lei Chen Butterfly counting on uncertain bipartite graphs . . . . . . . . . . . . 211--223 Yue Cui and Kai Zheng and Dingshan Cui and Jiandong Xie and Liwei Deng and Feiteng Huang and Xiaofang Zhou METRO: a generic graph neural network framework for multivariate time series forecasting . . . . . . . . . . . . . . 224--236 Congcong Ge and Xiaoze Liu and Lu Chen and Yunjun Gao and Baihua Zheng LargeEA: aligning entities for large-scale knowledge graphs . . . . . . 237--245 Kejing Lu and Mineichi Kudo and Chuan Xiao and Yoshiharu Ishikawa HVS: hierarchical graph structure based on Voronoi diagrams for solving approximate nearest neighbor search . . 246--258 Arif Arman and Dmitri Loguinov Origami: a high-performance mergesort framework . . . . . . . . . . . . . . . 259--271 Renzhi Wu and Bolin Ding and Xu Chu and Zhewei Wei and Xiening Dai and Tao Guan and Jingren Zhou Learning to be a statistician: learned estimator for number of distinct values 272--284 Shangdi Yu and Yiqiu Wang and Yan Gu and Laxman Dhulipala and Julian Shun ParChain: a framework for parallel hierarchical agglomerative clustering using nearest-neighbor chain . . . . . . 285--298 Komal Chauhan and Kartik Jain and Sayan Ranu and Srikanta Bedathur and Amitabha Bagchi Answering regular path queries through exemplars . . . . . . . . . . . . . . . 299--311 Xupeng Miao and Hailin Zhang and Yining Shi and Xiaonan Nie and Zhi Yang and Yangyu Tao and Bin Cui HET: scaling out huge embedding model training via cache-enabled distributed framework . . . . . . . . . . . . . . . 312--320 Pengfei Li and Yu Hua and Jingnan Jia and Pengfei Zuo FINEdex: a fine-grained learned index scheme for scalable and concurrent memory systems . . . . . . . . . . . . . 321--334 Jiyang Bai and Peixiang Zhao TaGSim: type-aware graph similarity learning and computation . . . . . . . . 335--347 Yuqing Zhu and Jing Tang and Xueyan Tang and Lei Chen Analysis of influence contribution in social advertising . . . . . . . . . . . 348--360 Georgios Theodorakis and Fotios Kounelis and Peter Pietzuch and Holger Pirk Scabbard: single-node fault-tolerant stream processing . . . . . . . . . . . 361--374 George Konstantinidis and Jet Holt and Adriane Chapman Enabling personal consent in databases 375--387
Yejia Liu and Weiyuan Wu and Lampros Flokas and Jiannan Wang and Eugene Wu Enabling SQL-based training data debugging for federated learning . . . . 388--400 Kapil Vaidya and Anshuman Dutt and Vivek Narasayya and Surajit Chaudhuri Leveraging query logs and machine learning for parametric query optimization . . . . . . . . . . . . . . 401--413 Yao Lu and Srikanth Kandula and Arnd Christian König and Surajit Chaudhuri Pre-training summarization models of structured datasets for cardinality estimation . . . . . . . . . . . . . . . 414--426 Susie Xi Rao and Shuai Zhang and Zhichao Han and Zitao Zhang and Wei Min and Zhiyao Chen and Yinan Shan and Yang Zhao and Ce Zhang xFraud: explainable fraud transaction detection . . . . . . . . . . . . . . . 427--436 Ye Yuan and Delong Ma and Zhenyu Wen and Zhiwei Zhang and Guoren Wang Subgraph matching over graph federation 437--450 Xing Niu and Boris Glavic and Ziyu Liu and Pengyuan Li and Dieter Gawlick and Vasudha Krishnaswamy and Zhen Hua Liu and Danica Porobic Provenance-based data skipping . . . . . 451--464 Di Jin and Bunyamin Sisman and Hao Wei and Xin Luna Dong and Danai Koutra Deep transfer learning for multi-source entity linkage via domain adaptation . . 465--477 Lu Xing and Eric Lee and Tong An and Bo-Cheng Chu and Ahmed Mahmood and Ahmed M. Aly and Jianguo Wang and Walid G. Aref An experimental evaluation and investigation of waves of misery in $r$-trees . . . . . . . . . . . . . . . 478--490 Yongyi Liu and Ahmed R. Mahmood and Amr Magdy and Sergio Rey PRUC: P-regions with user-defined constraint . . . . . . . . . . . . . . . 491--503 Yile Chen and Xiucheng Li and Gao Cong and Cheng Long and Zhifeng Bao and Shang Liu and Wanli Gu and Fuzheng Zhang Points-of-interest relationship inference with spatial-enriched graph neural networks . . . . . . . . . . . . 504--512 Tsz Nam Chan and Pak Lon Ip and Leong Hou U. and Byron Choi and Jianliang Xu SAFE: a share-and-aggregate bandwidth exploration framework for kernel density visualization . . . . . . . . . . . . . 513--526 Jens Dittrich and Joris Nix and Christian Schön The next 50 years in database indexing or: the case for automatically generated index structures . . . . . . . . . . . . 527--540 Koral Chapnik and Ilya Kolchinsky and Assaf Schuster DARLING: data-aware load shedding in complex event processing systems . . . . 541--554 Danyang Zhuo and Kaiyuan Zhang and Zhuohan Li and Siyuan Zhuang and Stephanie Wang and Ang Chen and Ion Stoica Rearchitecting in-memory object stores for low latency . . . . . . . . . . . . 555--568 Pingchuan Ma and Shuai Wang MT-teql: evaluating and augmenting neural NLIDB on real-world linguistic and schema variations . . . . . . . . . 569--582 Jessica Shi and Laxman Dhulipala and Julian Shun Theoretically and practically efficient parallel nucleus decomposition . . . . . 583--596 Baotong Lu and Jialin Ding and Eric Lo and Umar Farooq Minhas and Tianzheng Wang APEX: a high-performance learned index on persistent memory . . . . . . . . . . 597--610 David Campos and Tung Kieu and Chenjuan Guo and Feiteng Huang and Kai Zheng and Bin Yang and Christian S. Jensen Unsupervised time series outlier detection with diversity-driven convolutional ensembles . . . . . . . . 611--623 Xiaoye Miao and Yangyang Wu and Lu Chen and Yunjun Gao and Jun Wang and Jianwei Yin Efficient and effective data imputation with influence functions . . . . . . . . 624--632 Adrian Kochsiek and Rainer Gemulla Parallel training of knowledge graph embedding models: a comparison of techniques . . . . . . . . . . . . . . . 633--645 Gerardo Vitagliano and Lan Jiang and Felix Naumann Detecting layout templates in complex multiregion files . . . . . . . . . . . 646--658 Kajetan Maliszewski and Jorge-Arnulfo Quiané-Ruiz and Jonas Traub and Volker Markl What is the price for joining securely?: benchmarking equi-joins in trusted execution environments . . . . . . . . . 659--672 Van Long Ho and Nguyen Ho and Torben Bach Pedersen Efficient temporal pattern mining in big time series using mutual information . . 673--685 Junhua Zhang and Long Yuan and Wentao Li and Lu Qin and Ying Zhang Efficient label-constrained shortest path queries on road networks: a tree decomposition approach . . . . . . . . . 686--698 Sahaana Suri and Ihab F. Ilyas and Christopher Ré and Theodoros Rekatsinas Ember: no-code context enrichment via similarity-based keyless joins . . . . . 699--712 Tin Vu and Ahmed Eldawy and Vagelis Hristidis and Vassilis Tsotras Incremental partitioning for efficient spatial data analytics . . . . . . . . . 713--726 Doris Jung-Lin Lee and Dixin Tang and Kunal Agarwal and Thyne Boonmark and Caitlyn Chen and Jake Kang and Ujjaini Mukhopadhyay and Jerry Song and Micah Yong and Marti A. Hearst and Aditya G. Parameswaran Lux: always-on visualization recommendations for exploratory dataframe workflows . . . . . . . . . . 727--738 Devin Petersohn and Dixin Tang and Rehan Durrani and Areg Melik-Adamyan and Joseph E. Gonzalez and Anthony D. Joseph and Aditya G. Parameswaran Flexible rule-based decomposition and metadata independence in modin: a parallel dataframe system . . . . . . . 739--751
Yuxing Han and Ziniu Wu and Peizhi Wu and Rong Zhu and Jingyi Yang and Liang Wei Tan and Kai Zeng and Gao Cong and Yanzhao Qin and Andreas Pfadler and Zhengping Qian and Jingren Zhou and Jiangneng Li and Bin Cui Cardinality estimation in DBMS: a comprehensive benchmark evaluation . . . 752--765 Qizhen Zhang and Philip A. Bernstein and Daniel S. Berger and Badrish Chandramouli Redy: remote dynamic memory cache . . . 766--779 Martin Boissier Robust and budget-constrained encoding configurations for in-memory database systems . . . . . . . . . . . . . . . . 780--793 Shulong Tan and Weijie Zhao and Ping Li Fast neural ranking on bipartite graph indices . . . . . . . . . . . . . . . . 794--803 Shaoduo Gan and Jiawei Jiang and Binhang Yuan and Ce Zhang and Xiangru Lian and Rui Wang and Jianbin Chang and Chengjun Liu and Hongmei Shi and Shengzhuo Zhang and Xianghong Li and Tengxu Sun and Sen Yang and Ji Liu Bagua: scaling up distributed learning with system relaxations . . . . . . . . 804--813 Tsz Nam Chan and Pak Lon Ip and Leong Hou U. and Byron Choi and Jianliang Xu SWS: a complexity-optimized solution for spatial-temporal kernel density visualization . . . . . . . . . . . . . 814--827 Junxu Liu and Jian Lou and Li Xiong and Jinfei Liu and Xiaofeng Meng Projected federated averaging with heterogeneous differential privacy . . . 828--840 Daniel Haimovich and Dima Karamshuk and Thomas J. Leeper and Evgeniy Riabenko and Milan Vojnovic Popularity prediction for social media over arbitrary time horizons . . . . . . 841--849 Ishita Doshi and Dhritiman Das and Ashish Bhutani and Rajeev Kumar and Rushi Bhatt and Niranjan Balasubramanian LANNS: a web-scale approximate nearest neighbor lookup system . . . . . . . . . 850--858 Eduardo H. M. Pena and Eduardo C. de Almeida and Felix Naumann Fast detection of denial constraint violations . . . . . . . . . . . . . . . 859--871 Bowen Yu and Guanyu Feng and Huanqi Cao and Xiaohan Li and Zhenbo Sun and Haojie Wang and Xiaowei Zhu and Weimin Zheng and Wenguang Chen Chukonu: a fully-featured high-performance big data framework that integrates a native compute engine into Spark . . . . . . . . . . . . . . . . . 872--885 Sian Jin and Chengming Zhang and Xintong Jiang and Yunhe Feng and Hui Guan and Guanpeng Li and Shuaiwen Leon Song and Dingwen Tao COMET: a novel memory-efficient deep learning training framework by using error-bounded lossy compression . . . . 886--899 Zitao Li and Bolin Ding and Ce Zhang and Ninghui Li and Jingren Zhou Federated matrix factorization with privacy guarantee . . . . . . . . . . . 900--913 Chi Thang Duong and Trung Dung Hoang and Hongzhi Yin and Matthias Weidlich and Quoc Viet Hung Nguyen and Karl Aberer Scalable robust graph embedding with Spark . . . . . . . . . . . . . . . . . 914--922 Debjyoti Paul and Jie Cao and Feifei Li and Vivek Srikumar Database workload characterization with query plan encoders . . . . . . . . . . 923--935 Abhishek Modi and Kaushik Rajan and Srinivas Thimmaiah and Prakhar Jain and Swinky Mann and Ayushi Agarwal and Ajith Shetty and Shahid K. I. and Ashit Gosalia and Partho Sarthi New query optimization techniques in the Spark engine of Azure synapse . . . . . 936--948 Phanwadee Sinthong and Dhaval Patel and Nianjun Zhou and Shrey Shrivastava and Arun Iyengar and Anuradha Bhamidipaty DQDF: data-quality-aware dataframes . . 949--957 Archita Agarwal and Marilyn George and Aaron Jeyaraj and Malte Schwarzkopf Retrofitting GDPR compliance onto legacy databases . . . . . . . . . . . . . . . 958--970 Xinle Wu and Dalin Zhang and Chenjuan Guo and Chaoyang He and Bin Yang and Christian S. Jensen AutoCTS: automated correlated time series forecasting . . . . . . . . . . . 971--983 Sivaprasad Sudhir and Michael Cafarella and Samuel Madden Replicated layout for in-memory database systems . . . . . . . . . . . . . . . . 984--997
Anupam Sanghi and Shadab Ahmed and Jayant R. Haritsa Projection-compliant database generation 998--1010 Guodong Jin and Semih Salihoglu Making RDBMSs efficient on graph workloads through predefined joins . . . 1011--1023 Shaleen Deep and Xiao Hu and Paraschos Koutris Ranked enumeration of join queries with projections . . . . . . . . . . . . . . 1024--1037 Ahnjae Shin and Joo Seong Jeong and Do Yoon Kim and Soyoung Jung and Byung-Gon Chun Hippo: sharing computations in hyper-parameter optimization . . . . . . 1038--1052 Arik Rinberg and Tomer Solomon and Roee Shlomo and Guy Khazma and Gal Lushi and Idit Keidar and Paula Ta-Shma DSON: JSON CRDT using delta-mutations for document stores . . . . . . . . . . 1053--1065 Sepanta Zeighami and Ritesh Ahuja and Gabriel Ghinita and Cyrus Shahabi A neural database for differentially private spatial range queries . . . . . 1066--1078 Marcel Maltry and Jens Dittrich A critical analysis of recursive model indexes . . . . . . . . . . . . . . . . 1079--1091 Zerui Ge and Dumitrel Loghin and Beng Chin Ooi and Pingcheng Ruan and Tianwen Wang Hybrid blockchain database systems: design and performance . . . . . . . . . 1092--1104 Angela Bonifati and Stefania Dumbrava and George Fletcher and Jan Hidders and Matthias Hofer and Wim Martens and Filip Murlak and Joshua Shinavier and S\lawek Staworko and Dominik Tomaszuk Threshold queries in theory and in the wild . . . . . . . . . . . . . . . . . . 1105--1118 Moritz Sichert and Thomas Neumann User-defined operators: efficiently integrating custom algorithms into modern databases . . . . . . . . . . . . 1119--1131
Yue Wang and Vivek Narasayya and Yeye He and Surajit Chaudhuri PACk: an efficient partition-based distributed agglomerative hierarchical clustering algorithm for deduplication 1132--1145 Lijun Chang and Zhiyi Wang A near-optimal approach to edge connectivity-based hierarchical graph decomposition . . . . . . . . . . . . . 1146--1158 Yongxin Tong and Xuchen Pan and Yuxiang Zeng and Yexuan Shi and Chunbo Xue and Zimu Zhou and Xiaofei Zhang and Lei Chen and Yi Xu and Ke Xu and Weifeng Lv Hu-Fu: efficient and secure spatial queries over data federation . . . . . . 1159--1172 Per Fuchs and Domagoj Margan and Jana Giceva Sortledton: a universal, transactional graph data structure . . . . . . . . . . 1173--1186 Bowen Zhang and Shengan Zheng and Zhenlin Qi and Linpeng Huang NBTree: a lock-free PM-friendly persistent B+-tree for eADR-enabled PM systems . . . . . . . . . . . . . . . . 1187--1200 Shreshth Tuli and Giuliano Casale and Nicholas R. Jennings TranAD: deep transformer networks for anomaly detection in multivariate time series data . . . . . . . . . . . . . . 1201--1214 Fuheng Zhao and Divyakant Agrawal and Amr El Abbadi and Ahmed Metwally SpaceSaving$ \pm $: an optimal algorithm for frequency estimation and frequent items in the bounded-deletion model . . 1215--1227 Chenguang Zheng and Hongzhi Chen and Yuxuan Cheng and Zhezheng Song and Yifan Wu and Changji Li and James Cheng and Hao Yang and Shuai Zhang ByteGNN: efficient graph neural network training at large scale . . . . . . . . 1228--1242 Yuli Jiang and Yu Rong and Hong Cheng and Xin Huang and Kangfei Zhao and Junzhou Huang Query driven-graph neural networks for community search: from non-attributed, attributed, to interactive attributed 1243--1255 Yang Li and Yu Shen and Huaijun Jiang and Wentao Zhang and Jixiang Li and Ji Liu and Ce Zhang and Bin Cui Hyper-tune: towards efficient hyper-parameter tuning at scale . . . . 1256--1265 Koen Minartz and Jens E. d'Hondt and Odysseas Papapetrou Multivariate correlations discovery in static and streaming data . . . . . . . 1266--1278 Olga Poppe and Qun Guo and Willis Lang and Pankaj Arora and Morgan Oslake and Shize Xu and Ajay Kalhan Moneyball: proactive auto-scaling in Microsoft Azure SQL database serverless 1279--1287 Kewei Cheng and Xian Li and Yifan Ethan Xu and Xin Luna Dong and Yizhou Sun PGE: robust product graph embedding learning for error detection . . . . . . 1288--1296 Naga Nithin Manne and Shilvi Satpati and Tanu Malik and Amitabha Bagchi and Ashish Gehani and Amitabh Chaudhary CHEX: multiversion replay with ordered checkpoints . . . . . . . . . . . . . . 1297--1310
Tomer Even and Guy Even and Adam Morrison Prefix filter: practically and theoretically better than bloom . . . . 1311--1323 Hiroyuki Yamada and Jun Nemoto Scalar DL: scalable and practical Byzantine fault detection for transactional database systems . . . . . 1324--1336 Gyuyeong Kim and Wonjun Lee In-network leaderless replication for distributed data stores . . . . . . . . 1337--1349 Xin Sun and Xin Huang and Di Jin Fast algorithms for core maximization on large graphs . . . . . . . . . . . . . . 1350--1362 Shuye Pan and Peng Wang and Chen Wang and Wei Wang and Jianmin Wang NLC: search correlated window pairs on long time series . . . . . . . . . . . . 1363--1375 Hanzhi Wang and Zhewei Wei and Junhao Gan and Ye Yuan and Xiaoyong Du and Ji-Rong Wen Edge-based local push for personalized PageRank . . . . . . . . . . . . . . . . 1376--1389 Harry Kai-Ho Chan and Huan Li and Xiao Li and Hua Lu Continuous social distance monitoring in indoor space . . . . . . . . . . . . . . 1390--1402 Xibo Sun and Shixuan Sun and Qiong Luo and Bingsheng He An in-depth study of continuous subgraph matching . . . . . . . . . . . . . . . . 1403--1416 Abhinav Mishra and Ram Sriharsha and Sichen Zhong OnlineSTL: scaling time series decomposition by $100 \times$ . . . . . 1417--1425 Haoyu Li and Qizhi Chen and Yixin Zhang and Tong Yang and Bin Cui Stingy sketch: a sketch framework for accurate and fast frequency estimation 1426--1438 Yang Wang and Miao Yu and Yujie Hui and Fang Zhou and Yuyang Huang and Rui Zhu and Xueyuan Ren and Tianxi Li and Xiaoyi Lu A study of database performance sensitivity to experiment settings . . . 1439--1452 Zemin Chao and Hong Gao and Yinan An and Jianzhong Li The inherent time complexity and an efficient algorithm for subsequence matching problem . . . . . . . . . . . . 1453--1465 Chengliang Chai and Jiabin Liu and Nan Tang and Guoliang Li and Yuyu Luo Selective data acquisition in the wild for model charging . . . . . . . . . . . 1466--1478 Wenfei Fan and Wenzhi Fu and Ruochun Jin and Ping Lu and Chao Tian Discovering association rules from big graphs . . . . . . . . . . . . . . . . . 1479--1492 Xiaolin Han and Reynold Cheng and Chenhao Ma and Tobias Grubenmann DeepTEA: effective and efficient online time-dependent trajectory outlier detection . . . . . . . . . . . . . . . 1493--1505 Giovanni Simonini and Luca Zecchini and Sonia Bergamaschi and Felix Naumann Entity resolution on-demand . . . . . . 1506--1518
Afnan Alhazmi and Tom Blount and George Konstantinidis ForBackBench: a benchmark for chasing vs. query-rewriting . . . . . . . . . . 1519--1532 Jeremy Chen and Yuqing Huang and Mushi Wang and Semih Salihoglu and Ken Salem Accurate summary-based cardinality estimation through the lens of cardinality estimation graphs . . . . . 1533--1545 Xuankun Liao and Qing Liu and Jiaxin Jiang and Xin Huang and Jianliang Xu and Byron Choi Distributed D-core decomposition over large directed graphs . . . . . . . . . 1546--1558 Lu Chen and Chengfei Liu and Rui Zhou and Jiajie Xu and Jianxin Li Efficient maximal biclique enumeration for large sparse bipartite graphs . . . 1559--1571 Hongkuan Zhou and Da Zheng and Israt Nisa and Vasileios Ioannidis and Xiang Song and George Karypis TGL: a general framework for temporal GNN training on billion-scale graphs . . 1572--1580 Binhang Yuan and Cameron R. Wolfe and Chen Dun and Yuxin Tang and Anastasios Kyrillidis and Chris Jermaine Distributed learning of fully connected neural networks using independent subnet training . . . . . . . . . . . . . . . . 1581--1590 Sebastian Burckhardt and Badrish Chandramouli and Chris Gillum and David Justo and Konstantinos Kallas and Connor McMahon and Christopher S. Meiklejohn and Xiangfeng Zhu Netherite: efficient execution of serverless workflows . . . . . . . . . . 1591--1604 Andy Huynh and Harshal A. Chaudhari and Evimaria Terzi and Manos Athanassoulis Endure: a robust tuning paradigm for LSM trees under workload uncertainty . . . . 1605--1618 Hongzheng Li and Yingxia Shao and Junping Du and Bin Cui and Lei Chen An I/O-efficient disk-based graph system for scalable second-order random walk of large graphs . . . . . . . . . . . . . . 1619--1631 Kapil Vaidya and Subarna Chatterjee and Eric Knorr and Michael Mitzenmacher and Stratos Idreos and Tim Kraska SNARF: a learning-enhanced range filter 1632--1644 Xin Chen and You Peng and Sibo Wang and Jeffrey Xu Yu DLCR: efficient indexing for label-constrained reachability queries on large dynamic graphs . . . . . . . . 1645--1657 Yue Zhao and Gao Cong and Jiachen Shi and Chunyan Miao QueryFormer: a tree transformer model for query plan representation . . . . . 1658--1670 Leon Lee and Siphrey Xie and Yunus Ma and Shimin Chen Index checkpoints for instant recovery in in-memory database systems . . . . . 1671--1683 Mahdi Esmailoghli and Jorge-Arnulfo Quiané-Ruiz and Ziawasch Abedjan MATE: multi-attribute table extraction 1684--1696 John Paparrizos and Yuhao Kang and Paul Boniol and Ruey S. Tsay and Themis Palpanas and Michael J. Franklin TSB-UAD: an end-to-end benchmark suite for univariate time-series anomaly detection . . . . . . . . . . . . . . . 1697--1711 Manuel Leone and Stefano Huber and Akhil Arora and Alberto García-Durán and Robert West A critical re-evaluation of neural methods for entity alignment . . . . . . 1712--1725 Matteo Paganelli and Francesco Del Buono and Andrea Baraldi and Francesco Guerra Analyzing how BERT performs entity matching . . . . . . . . . . . . . . . . 1726--1738
Balaji Arun and Binoy Ravindran Scalable Byzantine fault tolerance via partial decentralization . . . . . . . . 1739--1752 Huan Li and Lanjing Yi and Bo Tang and Hua Lu and Christian S. Jensen Efficient and error-bounded spatiotemporal quantile monitoring in edge computing environments . . . . . . 1753--1765 Fumiyuki Kato and Tsubasa Takahashi and Shun Takagi and Yang Cao and Seng Pei Liew and Masatoshi Yoshikawa HDPView: differentially private materialized view for exploring high dimensional relational data . . . . . . 1766--1778 Sebastian Schmidl and Phillip Wenig and Thorsten Papenbrock Anomaly detection in time series: a comprehensive evaluation . . . . . . . . 1779--1797 Brit Youngmann and Sihem Amer-Yahia and Aurelien Personnaz Guided exploration of data summaries . . 1798--1807 Xinyi Zhang and Zhuo Chang and Yang Li and Hong Wu and Jian Tan and Feifei Li and Bin Cui Facilitating database tuning with hyper-parameter optimization: a comprehensive experimental evaluation 1808--1821 Zuan Wang and Xiaofeng Ding and Hai Jin and Pan Zhou Efficient secure and verifiable location-based skyline queries over encrypted data . . . . . . . . . . . . . 1822--1834 Zhuoyue Zhao and Dong Xie and Feifei Li AB-tree: index for concurrent random sampling and updates . . . . . . . . . . 1835--1847 Chenguang Fang and Shaoxu Song and Yinan Mei On repairing timestamps for regular interval time series . . . . . . . . . . 1848--1860 Wenfei Fan and Ruochun Jin and Ping Lu and Chao Tian and Ruiqi Xu Towards event prediction in temporal graphs . . . . . . . . . . . . . . . . . 1861--1874 Yihuai Liang and Yan Li and Byeong-Seok Shin Decentralized crowdsourcing for human intelligence tasks with efficient on-chain cost . . . . . . . . . . . . . 1875--1888 Yue Wang and Ruiqi Xu and Xun Jian and Alexander Zhou and Lei Chen Towards distributed bitruss decomposition on bipartite graphs . . . 1889--1901 Luca Gagliardelli and George Papadakis and Giovanni Simonini and Sonia Bergamaschi and Themis Palpanas Generalized supervised meta-blocking . . 1902--1910 Mijin An and Soojun Im and Dawoon Jung and Sang-Won Lee Your read is our priority in flash storage . . . . . . . . . . . . . . . . 1911--1923 Arindam Bhattacharya and Chathur Gudesa and Amitabha Bagchi and Srikanta Bedathur New wine in an old bottle: data-aware hash functions for Bloom filters . . . . 1924--1936 Jingshu Peng and Zhao Chen and Yingxia Shao and Yanyan Shen and Lei Chen and Jiannong Cao Sancus: staleness-aware communication-avoiding full-graph decentralized training in large-scale graph neural networks . . . . . . . . . 1937--1950 Marco Bucchi and Alejandro Grez and Andrés Quintana and Cristian Riveros and Stijn Vansummeren CORE: a complex event recognition engine 1951--1964 Audrey Cheng and Xiao Shi and Aaron Kabcenell and Shilpa Lawande and Hamza Qadeer and Jason Chan and Harrison Tin and Ryan Zhao and Peter Bailis and Mahesh Balakrishnan and Nathan Bronson and Natacha Crooks and Ion Stoica TAOBench: an end-to-end benchmark for social network workloads . . . . . . . . 1965--1977
Aarati Kakaraparthy and Jignesh M. Patel and Brian P. Kroth and Kwanghyun Park VIP hashing: adapting to skew in popularity of data on the fly . . . . . 1978--1990 Tobias Vinçon and Christian Knödler and Leonardo Solis-Vasquez and Arthur Bernhardt and Sajjad Tamimi and Lukas Weber and Florian Stock and Andreas Koch and Ilia Petrov Near-data processing in database systems on native computational storage under HTAP workloads . . . . . . . . . . . . . 1991--2004 Karima Echihabi and Panagiota Fatourou and Kostas Zoumpatianos and Themis Palpanas and Houda Benbrahim Hercules against data series similarity search . . . . . . . . . . . . . . . . . 2005--2018 Tarique Siddiqui and Wentao Wu and Vivek Narasayya and Surajit Chaudhuri DISTILL: low-overhead data-driven techniques for filtering and costing indexes for scalable index tuning . . . 2019--2031 Zhihui Yang and Zuozhi Wang and Yicong Huang and Yao Lu and Chen Li and X. Sean Wang Optimizing machine learning inference queries with correlative proxy models 2032--2044 Li Su and Xiaoming Qin and Zichao Zhang and Rui Yang and Le Xu and Indranil Gupta and Wenyuan Yu and Kai Zeng and Jingren Zhou Banyan: a scoped dataflow engine for graph query service . . . . . . . . . . 2045--2057 Ziyue Huang and Yuan Qiu and Ke Yi and Graham Cormode Frequency estimation under multiparty differential privacy: one-shot and streaming . . . . . . . . . . . . . . . 2058--2070 Ahsan Ali and Riccardo Pinciroli and Feng Yan and Evgenia Smirni Optimizing inference serving on serverless platforms . . . . . . . . . . 2071--2084 Wail Y. Alkowaileet and Michael J. Carey Columnar formats for schemaless LSM-based document stores . . . . . . . 2085--2097 Yu-Xuan Qiu and Dong Wen and Lu Qin and Wentao Li and Rong-Hua Li and Ying Zhang Efficient shortest path counting on large road networks . . . . . . . . . . 2098--2110 Fangcheng Fu and Xupeng Miao and Jiawei Jiang and Huanran Xue and Bin Cui Towards communication-efficient vertical federated learning training via cache-enabled local updates . . . . . . 2111--2120 Yifan Zhu and Lu Chen and Yunjun Gao and Baihua Zheng and Pengfei Wang DESIRE: an efficient dynamic cluster-based forest indexing for similarity search in multi-metric spaces 2121--2133 Junghoon Kim and Kaiyu Feng and Gao Cong and Diwen Zhu and Wenyuan Yu and Chunyan Miao ABC: attributed bipartite co-clustering 2134--2147 Jinzhao Xiao and Yuxiang Huang and Changyu Hu and Shaoxu Song and Xiangdong Huang and Jianmin Wang Time series data encoding for efficient storage: a comparative analysis in Apache IoTDB . . . . . . . . . . . . . . 2148--2160 Teng Zhang and Jian Tan and Xin Cai and Jianying Wang and Feifei Li and Jianling Sun SA-LSM: optimize data layout for LSM-tree based storage using survival analysis . . . . . . . . . . . . . . . . 2161--2174 Paolo Ferragina and Giovanni Manzini and Travis Gagie and Dominik Köppl and Gonzalo Navarro and Manuel Striani and Francesco Tosoni Improving matrix-vector multiplication via lossless grammar-compressed matrices 2175--2187 Shangyu Wu and Yufei Cui and Jinghuan Yu and Xuan Sun and Tei-Wei Kuo and Chun Jason Xue NFL: robust learned index via distribution transformation . . . . . . 2188--2200 Hamidreza Zare and Viveck Ramesh Cadambe and Bhuvan Urgaonkar and Nader Alfares and Praneet Soni and Chetan Sharma and Arif A. Merchant LEGOStore: a linearizable geo-distributed store combining replication and erasure coding . . . . . 2201--2215 Michael Simpson and Farnoosh Hashemi and Laks V. S. Lakshmanan Misinformation mitigation under differential propagation rates and temporal penalties . . . . . . . . . . . 2216--2229 Lixi Zhou and Jiaqing Chen and Amitabh Das and Hong Min and Lei Yu and Ming Zhao and Jia Zou Serving deep learning models with deduplication from relational databases 2230--2243 Zichun Huang and Shimin Chen Density-optimized intersection-free mapping and matrix multiplication for join-project operations . . . . . . . . 2244--2256 Shiva Jahangiri and Michael J. Carey and Johann-Christoph Freytag Design trade-offs for a robust dynamic hybrid hash join . . . . . . . . . . . . 2257--2269 Yannis Foufoulas and Alkis Simitsis and Lefteris Stamatogiannakis and Yannis Ioannidis YeSQL: ``you extend SQL'' with rich and highly performant user-defined functions in relational databases . . . . . . . . 2270--2283 Shqiponja Ahmetaj and Bianca Löhnert and Magdalena Ortiz and Mantas Simkus Magic shapes for SHACL validation . . . 2284--2296
Arpit Merchant and Aristides Gionis and Michael Mathioudakis Succinct graph representations as distance oracles: an experimental evaluation . . . . . . . . . . . . . . . 2297--2306 Yangqin Jiang and Yixiang Fang and Chenhao Ma and Xin Cao and Chunshan Li Effective community search over large star-schema heterogeneous information networks . . . . . . . . . . . . . . . . 2307--2320 Kai Ming Ting and Zongyou Liu and Hang Zhang and Ye Zhu A new distributional treatment for time series and an anomaly detection investigation . . . . . . . . . . . . . 2321--2333 Benjamin Denham and Edmund M-K. Lai and Roopak Sinha and M. Asif Naeem Witan: unsupervised labelling function generation for assisted data programming 2334--2347 Ergute Bao and Yizheng Zhu and Xiaokui Xiao and Yin Yang and Beng Chin Ooi and Benjamin Hong Meng Tan and Khin Mi Mi Aung Skellam mixture mechanism: a novel approach to federated learning with differential privacy . . . . . . . . . . 2348--2360 Benjamin Hilprecht and Carsten Binnig Zero-shot cost models for out-of-the-box learned cost prediction . . . . . . . . 2361--2374 Dalsu Choi and Hyunsik Yoon and Hyubjin Lee and Yon Dohn Chung Waffle: in-memory grid index for moving objects with reinforcement learning-based configuration tuning system . . . . . . . . . . . . . . . . . 2375--2388 Michael Jungmair and André Kohn and Jana Giceva Designing an open framework for query optimization and compilation . . . . . . 2389--2401 Lam-Duy Nguyen and Sang-Won Lee and Beomseok Nam In-page shadowing and two-version timestamp ordering for mobile DBMSs . . 2402--2414 Shixuan Sun and Xibo Sun and Bingsheng He and Qiong Luo RapidFlow: an efficient approach to continuous subgraph matching . . . . . . 2415--2427 Mossad Helali and Essam Mansour and Ibrahim Abdelaziz and Julian Dolby and Kavitha Srinivas A scalable AutoML approach based on graph neural networks . . . . . . . . . 2428--2436 Primal Pappachan and Shufan Zhang and Xi He and Sharad Mehrotra Don't be a tattle-tale: preventing leakages through data dependencies on access control protected data . . . . . 2437--2449 Qingyu Xu and Feng Zhang and Zhiming Yao and Lv Lu and Xiaoyong Du and Dong Deng and Bingsheng He Efficient load-balanced butterfly counting on GPU . . . . . . . . . . . . 2450--2462 Lawrence Benson and Leon Papke and Tilmann Rabl PerMA-bench: benchmarking persistent memory access . . . . . . . . . . . . . 2463--2476 Yuliang He and Duo Lu and Kaisong Huang and Tianzheng Wang Evaluating persistent memory range indexes: part two . . . . . . . . . . . 2477--2490 Bobbi W. Yogatama and Weiwei Gong and Xiangyao Yu Orchestrating data placement and query execution in heterogeneous CPU-GPU DBMS 2491--2503 Weicheng Wang and Raymond Chi-Wing Wong Interactive mining with ordered and unordered attributes . . . . . . . . . . 2504--2516 Wenzhe Yang and Sheng Wang and Yuan Sun and Zhiyong Peng Fast dataset search with earth mover's distance . . . . . . . . . . . . . . . . 2517--2529 João L. M. Pereira and João Casanova and Helena Galhardas and Dennis Shasha AcX: system, techniques, and experiments for acronym expansion . . . . . . . . . 2530--2544 Hongzhi Chen and Changji Li and Chenguang Zheng and Chenghuan Huang and Juncheng Fang and James Cheng and Jian Zhang G-tran: a high performance distributed graph database with a decentralized architecture . . . . . . . . . . . . . . 2545--2558 Arnd Christian König and Yi Shan and Tobias Ziegler and Aarati Kakaraparthy and Willis Lang and Justin Moeller and Ajay Kalhan and Vivek Narasayya Tenant placement in over-subscribed database-as-a-service clusters . . . . . 2559--2571 Yue Chen and Kaiyu Feng and Gao Cong and Han Mao Kiah Example-based spatial pattern matching 2572--2584 Zeshun Peng and Yanfeng Zhang and Qian Xu and Haixu Liu and Yuxiao Gao and Xiaohua Li and Ge Yu NeuChain: a fast permissioned blockchain system with deterministic ordering . . . 2585--2598 Ryan McKenna and Brett Mullins and Daniel Sheldon and Gerome Miklau AIM: an adaptive and iterative mechanism for differentially private synthetic data . . . . . . . . . . . . . . . . . . 2599--2612 Etienne Toussaint and Paolo Guagliardo and Leonid Libkin and Juan Sequeda Troubles with nulls, views from the users . . . . . . . . . . . . . . . . . 2613--2625 Yeonhong Park and Sunhong Min and Jae W. Lee Ginex: SSD-enabled billion-scale graph neural network training on a single machine via provably optimal in-memory caching . . . . . . . . . . . . . . . . 2626--2639 Junhua Zhang and Wentao Li and Long Yuan and Lu Qin and Ying Zhang and Lijun Chang Shortest-path queries on complex networks: experiments, analyses, and improvement . . . . . . . . . . . . . . 2640--2652 Sameera Ghayyur and Dhrubajyoti Ghosh and Xi He and Sharad Mehrotra MIDE: accuracy aware minimally invasive data exploration for decision support 2653--2665 Dhrubajyoti Ghosh and Peeyush Gupta and Sharad Mehrotra and Roberto Yus and Yasser Altowim JENNER: just-in-time enrichment in query processing . . . . . . . . . . . . . . . 2666--2678 Jiaoyi Zhang and Yihan Gao CARMI: a cache-aware learned index with a cost-based construction algorithm . . 2679--2691
Monica Chiosa and Fabio Maschi and Ingo Müller and Gustavo Alonso and Norman May Hardware acceleration of compression and encryption in SAP HANA . . . . . . . . . 3277--3291 Martin Graf and Lukas Laskowski and Florian Papsdorf and Florian Sold and Roland Gremmelspacher and Felix Naumann and Fabian Panse Frost: a platform for benchmarking and exploring data matching results . . . . 3292--3305 Changji Li and Hongzhi Chen and Shuai Zhang and Yingqian Hu and Chao Chen and Zhenjie Zhang and Meng Li and Xiangchen Li and Dongqing Han and Xiaohui Chen and Xudong Wang and Huiming Zhu and Xuwei Fu and Tingwei Wu and Hongfei Tan and Hengtian Ding and Mengjin Liu and Kangcheng Wang and Ting Ye and Lei Li and Xin Li and Yu Wang and Chenguang Zheng and Hao Yang and James Cheng ByteGraph: a high-performance distributed graph database in ByteDance 3306--3318 Prakash Das and Shivangi Srivastava and Valentin Moskovich and Anmol Chaturvedi and Anant Mittal and Yongqin Xiao and Mosharaf Chowdhury CDI-E: an elastic cloud service for data engineering . . . . . . . . . . . . . . 3319--3331 Sheng Wang and Yiran Li and Huorong Li and Feifei Li and Chengjin Tian and Le Su and Yanshan Zhang and Yubing Ma and Lie Yan and Yuanyuan Sun and Xuntao Cheng and Xiaolong Xie and Yu Zou Operon: an encrypted database for ownership-preserving data management . . 3332--3345 Caixin Gong and Chengjin Tian and Zhengheng Wang and Sheng Wang and Xiyu Wang and Qiulei Fu and Wu Qin and Long Qian and Rui Chen and Jiang Qi and Ruo Wang and Guoyun Zhu and Chenghu Yang and Wei Zhang and Feifei Li Tair-PMem: a fully durable non-volatile memory database . . . . . . . . . . . . 3346--3358 Branimir Lambov Trie memtables in Cassandra . . . . . . 3359--3371 Pedro Pedreira and Orri Erling and Masha Basmanova and Kevin Wilfong and Laith Sakka and Krishna Pai and Wei He and Biswapesh Chattopadhyay Velox: meta's unified execution engine 3372--3384 Zhenkun Yang and Chuanhui Yang and Fusheng Han and Mingqiang Zhuang and Bing Yang and Zhifeng Yang and Xiaojun Cheng and Yuzhong Zhao and Wenhui Shi and Huafeng Xi and Huang Yu and Bin Liu and Yi Pan and Boxue Yin and Junquan Chen and Quanqing Xu OceanBase: a 707 million tpmC distributed relational database system 3385--3397 Hai Lan and Jiong Xie and Zhifeng Bao and Feifei Li and Wei Tian and Fang Wang and Sheng Wang and Ailin Zhang VRE: a versatile, robust, and economical trajectory data system . . . . . . . . . 3398--3410 Jianjun Chen and Yonghua Ding and Ye Liu and Fangshi Li and Li Zhang and Mingyi Zhang and Kui Wei and Lixun Cao and Dan Zou and Yang Liu and Lei Zhang and Rui Shi and Wei Ding and Kai Wu and Shangyu Luo and Jason Sun and Yuming Liang ByteHTAP: Bytedance's HTAP system with high data freshness and strong data consistency . . . . . . . . . . . . . . 3411--3424 Wolfram Wingerath and Benjamin Wollmer and Markus Bestehorn and Stephan Succo and Sophie Ferrlein and Florian Bücklers and Jörn Domnik and Fabian Panse and Erik Witt and Anil Sener and Felix Gessert and Norbert Ritter Beaconnect: continuous web performance A/B testing at scale . . . . . . . . . . 3425--3431 Zongzhi Chen and Xinjun Yang and Feifei Li and Xuntao Cheng and Qingda Hu and Zheyu Miao and Rongbiao Xie and Xiaofei Wu and Kang Wang and Zhao Song and Haiqing Sun and Zechao Zhuang and Yuming Yang and Jie Xu and Liang Yin and Wenchao Zhou and Sheng Wang CloudJump: optimizing cloud databases for cloud storages . . . . . . . . . . . 3432--3444 Kaiping Zheng and Shaofeng Cai and Horng Ruey Chua and Melanie Herschel and Meihui Zhang and Beng Chin Ooi DyHealth: making neural networks dynamic for effective healthcare analytics . . . 3445--3458 Andrey Mishchenko and Dominique Danco and Abhilash Jindal and Adrian Blue Blueprint: a constraint-solving approach for document extraction . . . . . . . . 3459--3471 Muzhi Yu and Zhaoxiang Lin and Jinan Sun and Runyun Zhou and Guoqiang Jiang and Hua Huang and Shikun Zhang TencentCLS: the cloud log service with high query performances . . . . . . . . 3472--3482 Jiong Xie and Zhen Chen and Jianwei Liu and Fang Wang and Feifei Li and Zhida Chen and Yinpei Liu and Songlu Cai and Zhenhua Fan and Fei Xiao and Yue Chen Ganos: a multidimensional, dynamic, and scene-oriented cloud-native spatial database engine . . . . . . . . . . . . 3483--3495 Sarath Lakshman and Apaar Gupta and Rohan Suri and Scott Lashley and John Liang and Srinath Duvuru and Ravi Mayuram Magma: a high data density storage engine used in Couchbase . . . . . . . . 3496--3508 Joyce Cahoon and Wenjing Wang and Yiwen Zhu and Katherine Lin and Sean Liu and Raymond Truong and Neetu Singh and Chengcheng Wan and Alexandra Ciortea and Sreraman Narasimhan and Subru Krishnan Doppler: automated SKU recommendation in migrating SQL workloads to the cloud . . 3509--3521 Stavros Harizopoulos and Taylor Hopper and Morton Mo and Shyam Sundar Chandrasekaran and Tongguang Chen and Yan Cui and Nandini Ganesh and Gary Helmling and Hieu Pham and Sebastian Wong Meta's next-generation realtime monitoring and analytics platform . . . 3522--3534 Kevin P. Gaffney and Martin Prammer and Larry Brasfield and D. Richard Hipp and Dan Kennedy and Jignesh M. Patel SQLite: past, present, and future . . . 3535--3547 Rentong Guo and Xiaofan Luan and Long Xiang and Xiao Yan and Xiaomeng Yi and Jigao Luo and Qianya Cheng and Weizhi Xu and Jiarui Luo and Frank Liu and Zhenshan Cao and Yanliang Qiao and Ting Wang and Bo Tang and Charles Xie Manu: a cloud native vector database management system . . . . . . . . . . . 3548--3561 Sainyam Galhotra and Udayan Khurana Automated relational data explanation using external semantic knowledge . . . 3562--3565 Andrea Rossi and Donatella Firmani and Paolo Merialdo and Tommaso Teofili Kelpie: an explainability framework for embedding-based link prediction models 3566--3569 Yin Lin and Brit Youngmann and Yuval Moskovitch and H. V. Jagadish and Tova Milo OREO: detection of cherry-picked generalizations . . . . . . . . . . . . 3570--3573 André Kohn and Dominik Moritz and Mark Raasveldt and Hannes Mühleisen and Thomas Neumann DuckDB-wasm: fast analytical processing for the web . . . . . . . . . . . . . . 3574--3577 Yihai Xi and Ning Wang and Xinyu Chen and Yiyi Zhang and Zilong Wang and Zhihong Xu and Yue Wang EasyDR: a human-in-the-loop error detection & repair platform for holistic table cleaning . . . . . . . . . . . . . 3578--3581 Xuchen Pan and Yongxin Tong and Chunbo Xue and Zimu Zhou and Junping Du and Yuxiang Zeng and Yexuan Shi and Xiaofei Zhang and Lei Chen and Yi Xu and Ke Xu and Weifeng Lv Hu-fu: a data federation system for secure spatial queries . . . . . . . . . 3582--3585 Marius Gassen and Benjamin Hättasch and Benjamin Hilprecht and Nadja Geisler and Alexander Fraser and Carsten Binnig Demonstrating CAT: synthesizing data-aware conversational agents for transactional databases . . . . . . . . 3586--3589 Aurélien Personnaz and Brit Youngmann and Sihem Amer-Yahia EDA4SUM: guided exploration of data summaries . . . . . . . . . . . . . . . 3590--3593 Chenjie Li and Juseung Lee and Zhengjie Miao and Boris Glavic and Sudeepa Roy CaJaDE: explaining query results by augmenting provenance with context . . . 3594--3597 Yuki Asada and Victor Fu and Apurva Gandhi and Advitya Gemawat and Lihao Zhang and Dong He and Vivek Gupta and Ehi Nosakhare and Dalitso Banda and Rathijit Sen and Matteo Interlandi Share the tensor tea: how databases can leverage the machine learning ecosystem 3598--3601 Jess Tan and Desmond Yeo and Rachael Neoh and Huey-Eng Chua and Sourav S Bhowmick MOCHA: a tool for visualizing impact of operator choices in query execution plans for database education . . . . . . 3602--3605 Tsz Nam Chan and Pak Lon Ip and Kaiyan Zhao and Leong Hou U and Byron Choi and Jianliang Xu LIBKDV: a versatile kernel density visualization library for geospatial analytics . . . . . . . . . . . . . . . 3606--3609 Arul Ajmani and Aayush Shah and Alexander Shraer and Adam Storm and Rebecca Taft and Oliver Tan and Nathan VanBenschoten A demonstration of multi-region CockroachDB . . . . . . . . . . . . . . 3610--3613 Adriane Chapman and Luca Lauro and Paolo Missier and Riccardo Torlone DPDS: assisting data science with data provenance . . . . . . . . . . . . . . . 3614--3617 Vargha Dadvar and Lukasz Golab and Divesh Srivastava POEM: pattern-oriented explanations of CNN models . . . . . . . . . . . . . . . 3618--3621 Ramon Antonio Rodriges Zalipynis and Nikita Terlych WebArrayDB: a geospatial array DBMS in your web browser . . . . . . . . . . . . 3622--3625 Hai Lan and Yuanjia Zhang and Zhifeng Bao and Yu Dong and Dongxu Huang and Liu Tang and Jian Zhang AutoDI: towards an automatic plan regression analysis . . . . . . . . . . 3626--3629 Susan B. Davidson and Shay Gershtein and Tova Milo and Slava Novgorodov and May Shoshan PHOcus: efficiently archiving photos . . 3630--3633 Kai Huang and Qingqing Ye and Jing Zhao and Xi Zhao and Haibo Hu and Xiaofang Zhou VINCENT: towards efficient exploratory subgraph search in graph databases . . . 3634--3637 Osnat Drien and Matanya Freiman and Yael Amsterdamer ActivePDB: active probabilistic databases . . . . . . . . . . . . . . . 3638--3641 Tommaso Teofili and Donatella Firmani and Nick Koudas and Paolo Merialdo and Divesh Srivastava CERTEM: explaining and debugging black-box entity resolution systems with CERTA . . . . . . . . . . . . . . . . . 3642--3645 Ahmet Kerem Aksoy and Pavel Dushev and Eleni Tzirita Zacharatou and Holmer Hemsen and Marcela Charfuelan and Jorge-Arnulfo Quiané-Ruiz and Begüm Demir and Volker Markl Satellite image search in AgoraEO . . . 3646--3649 Li Yan and Nerissa Xu and Guozhong Li and Sourav S Bhowmick and Byron Choi and Jianliang Xu SENSOR: data-driven construction of sketch-based visual query interfaces for time series data . . . . . . . . . . . . 3650--3653 Angela Bonifati and Stefania Dumbrava and Emile Martinez and Fatemeh Ghasemi and Malo Jaffré and Pacôme Luton and Thomas Pickles DiscoPG: property graph schema discovery and exploration . . . . . . . . . . . . 3654--3657 Wissam Maamar-Kouadri and Salima Benbernou and Mourad Ouziri and Themis Palpanas and Iheb Ben Amor SA-Q: observing, evaluating, and enhancing the quality of the results of sentiment analysis tools . . . . . . . . 3658--3661 Abdelghny Orogat and Ahmed El-Roby SmartBench: demonstrating automatic generation of comprehensive benchmarks for question answering over knowledge graphs . . . . . . . . . . . . . . . . . 3662--3665 Jianhong Tu and Xiaoyue Han and Ju Fan and Nan Tang and Chengliang Chai and Guoliang Li and Xiaoyong Du DADER: hands-off entity resolution with domain adaptation . . . . . . . . . . . 3666--3669 James Gale and Max Seiden and Deepanshu Utkarsh and Jason Frantz and Rob Woollen and Ça\ugatay Demiralp Sigma workbook: a spreadsheet for cloud data warehouses . . . . . . . . . . . . 3670--3673 Zihao Chen and Zhizhen Xu and Baokun Han and Chen Xu and Weining Qian and Aoying Zhou ReMac: a matrix computation system with redundancy elimination . . . . . . . . . 3674--3677 Phillip Wenig and Sebastian Schmidl and Thorsten Papenbrock TimeEval: a benchmarking toolkit for time series anomaly detection algorithms 3678--3681 Alberto Lerner and Matthias Jasny and Theo Jepsen and Carsten Binnig and Philippe Cudré-Mauroux DBMS annihilator: a high-performance database workload generator in action 3682--3685 Zhiyu Liang and Hongzhi Wang FedTSC: a secure federated learning system for interpretable time series classification . . . . . . . . . . . . . 3686--3689 Qingshun Wu and Yafei Li and Huiling Li and Di Zhang and Guanglei Zhu AMRAS: a visual analysis system for spatial crowdsourcing . . . . . . . . . 3690--3693 Hani Al-Sayeh and Muhammad Attahir Jibril and Muhammad Waleed Bin Saeed and Kai-Uwe Sattler SparkCAD: caching anomalies detector for Spark applications . . . . . . . . . . . 3694--3697 Wilco v. Leeuwen and Thomas Mulder and Bram van de Wall and George Fletcher and Nikolay Yakovets AvantGraph query processing engine . . . 3698--3701 Paul Boniol and John Paparrizos and Yuhao Kang and Themis Palpanas and Ruey S. Tsay and Aaron J. Elmore and Michael J. Franklin Theseus: navigating the labyrinth of time-series anomaly detection . . . . . 3702--3705 Dennis Hofmann and Peter VanNostrand and Huayi Zhang and Yizhou Yan and Lei Cao and Samuel Madden and Elke Rundensteiner A demonstration of AutoOD: a self-tuning anomaly detection system . . . . . . . . 3706--3709 Sunny Gakhar and Joyce Cahoon and Wangchao Le and Xiangnan Li and Kaushik Ravichandran and Hiren Patel and Marc Friedman and Brandon Haynes and Shi Qiao and Alekh Jindal and Jyoti Leeka Pipemizer: an optimizer for analytics data pipelines . . . . . . . . . . . . . 3710--3713 Sergey Redyuk and Zoi Kaoudi and Sebastian Schelter and Volker Markl DORIAN in action: assisted design of data science pipelines . . . . . . . . . 3714--3717 Yuntian He and Yue Zhang and Saket Gurukar and Srinivasan Parthasarathy WebMILE: democratizing network representation learning at scale . . . . 3718--3721 Nadja Geisler and Benjamin Hättasch and Carsten Binnig Demonstrating quest: a query-driven framework to explain classification models on tabular data . . . . . . . . . 3722--3725 Drew Ripberger and Yifan Gan and Xueyuan Ren and Spyros Blanas and Yang Wang IsoBugView: interactively debugging isolation bugs in database applications 3726--3729 Yannis Foufoulas and Alkis Simitsis and Yannis Ioannidis YeSQL: rich user-defined functions without the overhead . . . . . . . . . . 3730--3733 Zhihui Yang and Yicong Huang and Zuozhi Wang and Feng Gao and Yao Lu and Chen Li and X. Sean Wang Demonstration of accelerating machine learning inference queries with correlative proxy models . . . . . . . . 3734--3737 Xiaozhen Liu and Zuozhi Wang and Shengquan Ni and Sadeem Alsudais and Yicong Huang and Avinash Kumar and Chen Li Demonstration of collaborative and interactive workflow-based data analytics in Texera . . . . . . . . . . 3738--3741 Ramon Antonio Rodriges Zalipynis SimDB in action: road traffic simulations completely inside array DBMS 3742--3745 Gilbert Badaro and Paolo Papotti Transformers for tabular data representation: a tutorial on models and applications . . . . . . . . . . . . . . 3746--3749 Felix Kiehn and Mareike Schmidt and Daniel Glake and Fabian Panse and Wolfram Wingerath and Benjamin Wollmer and Martin Poppinga and Norbert Ritter Polyglot data management: state of the art & open challenges . . . . . . . . . . 3750--3753 Abdul Wasay and Nesime Tatbul and Justin Gottschlich Machine programming: turning data into programmer productivity . . . . . . . . 3754--3757 Guoliang Li and Haowen Dong and Chao Zhang Cloud databases: new techniques, challenges, and opportunities . . . . . 3758--3761 Amine Mhedhbi and Semih Salihoglu Modern techniques for querying graph-structured relations: foundations, system implementations, and open challenges . . . . . . . . . . . . . . . 3762--3765 Yixiang Fang and Wensheng Luo and Chenhao Ma Densest subgraph discovery on large graphs: applications, challenges, and techniques . . . . . . . . . . . . . . . 3766--3769 Immanuel Trummer From BERT to GPT-3 codex: harnessing the potential of very large language models for data management . . . . . . . . . . 3770--3773 Kaisong Huang and Yuliang He and Tianzheng Wang The past, present and future of indexing on persistent memory . . . . . . . . . . 3774--3777 Zoi Kaoudi and Jorge-Arnulfo Quiané-Ruiz Unified data analytics: state-of-the-art and open problems . . . . . . . . . . . 3778--3781 Wenfei Fan Big graphs: challenges and opportunities 3782--3797 Sihem Amer-Yahia Towards AI-powered data-driven education 3798--3806 Yizhou Sun and Jiawei Han and Xifeng Yan and Philip S. Yu and Tianyi Wu Heterogeneous information networks: the past, the present, and the future . . . 3807--3811 Sudeepa Roy Toward interpretable and actionable data analysis with explanations and causality 3812--3820 Fatma Özcan Reflections on my data management research journey (VLDB women in database research award talk) . . . . . . . . . . 3821--3822 C. Mohan Panel: startups founded by database researchers . . . . . . . . . . . . . . 3823--3825 Magdalena Balazinska and Surajit Chaudhuri and AnHai Doan and Joseph M. Hellerstein and Hanuma Kodavalla and Ippokratis Pandis and Matei Zaharia Cloud data systems: what are the opportunities for the database research community? . . . . . . . . . . . . . . . 3826--3827
Sachin Basil John and Christoph Koch High-Dimensional Data Cubes . . . . . . 3828--3840 Matteo Ceccarello and Johann Gamper Fast and Scalable Mining of Time Series Motifs with Probabilistic Guarantees . . 3841--3853 Daniel Deutch and Amir Gilad and Tova Milo and Amit Mualem and Amit Somech FEDEX: an Explainability Framework for Data Exploration Steps . . . . . . . . . 3854--3868 Maria Xekalaki and Juan Fumero and Athanasios Stratikopoulos and Katerina Doka and Christos Katsakioris and Constantinos Bitsakos and Nectarios Koziris and Christos Kotselidis Enabling Transparent Acceleration of Big Data Frameworks Using Heterogeneous Hardware . . . . . . . . . . . . . . . . 3869--3882 Adriano Fazzone and Tommaso Lanciano and Riccardo Denni and Charalampos E. Tsourakakis and Francesco Bonchi Discovering Polarization Niches via Dense Subgraphs with Attractors and Repulsers . . . . . . . . . . . . . . . 3883--3896 Eunjae Lee and Sam H. Noh and Jiwon Seo Sage: a System for Uncertain Network Analysis . . . . . . . . . . . . . . . . 3897--3910 Hongchao Qin and Rong-Hua Li and Ye Yuan and Guoren Wang and Lu Qin and Zhiwei Zhang Mining Bursting Core in Large Temporal Graphs . . . . . . . . . . . . . . . . . 3911--3923 Xiang Yu and Chengliang Chai and Guoliang Li and Jiabin Liu Cost-Based or Learning-Based?: a Hybrid Query Optimizer for Query Plan Selection 3924--3936 Jingfan Meng and Huayi Wang and Jun Xu and Mitsunori Ogihara ONe Index for All Kernels (ONIAK): a Zero Re-Indexing LSH Solution to ANNS-ALT (After Linear Transformation) 3937--3949 Jiachen Shi and Gao Cong and Xiao-Li Li Learned Index Benefits: Machine Learning Based Index Performance Estimation . . . 3950--3962 Jiachuan Wang and Peng Cheng and Libin Zheng and Lei Chen and Wenjie Zhang Online Ridesharing with Meeting Points 3963--3975 Luigi Bellomarini and Davide Benedetto and Matteo Brandetti and Emanuel Sallinger Exploiting the Power of Equality-Generating Dependencies in Ontological Reasoning . . . . . . . . . 3976--3988 Anders Aamand and Debarati Das and Evangelos Kipouridis and Jakob B. T. Knudsen and Peter M. R. Rasmussen and Mikkel Thorup No Repetition: Fast and Reliable Sampling with Highly Concentrated Hashing . . . . . . . . . . . . . . . . 3989--4001 Lyes Attouche and Mohamed-Amine Baazizi and Dario Colazzo and Giorgio Ghelli and Carlo Sartiani and Stefanie Scherzinger Witness Generation for JSON Schema . . . 4002--4014 Shreya Shankar and Aditya G. Parameswaran Towards Observability for Production Machine Learning Pipelines . . . . . . . 4015--4022 Sekwon Lee and Soujanya Ponnapalli and Sharad Singhal and Marcos K. Aguilera and Kimberly Keeton and Vijay Chidambaram DINOMO: an Elastic, Scalable, High-Performance Key-Value Store for Disaggregated Persistent Memory . . . . 4023--4037 Shreya Shankar and Stephen Macke and Sarah Chasins and Andrew Head and Aditya Parameswaran Bolt-on, Compact, and Rapid Program Slicing for Notebooks . . . . . . . . . 4038--4047 Weijie Sun and Zihuan Xu and Lei Chen Fairness Matters: a Tit-for-Tat Strategy Against Selfish Mining . . . . . . . . . 4048--4061 Jialin Ding and Ryan Marcus and Andreas Kipf and Vikram Nathan and Aniruddha Nrusimha and Kapil Vaidya and Alexander van Renen and Tim Kraska SageDB: an Instance-Optimized Data Analytics System . . . . . . . . . . . . 4062--4078 Keven Richly and Rainer Schlosser and Martin Boissier Budget-Conscious Fine-Grained Configuration Optimization for Spatio-Temporal Applications . . . . . . 4079--4092 Cheng-Yu Hsieh and Jieyu Zhang and Alexander Ratner Nemo: Guiding and Contextualizing Weak Supervision for Interactive Data Programming . . . . . . . . . . . . . . 4093--4105
Jeffrey Helt and Abhinav Sharma and Daniel J. Abadi and Wyatt Lloyd and Jose M. Faleiro C5: cloned concurrency control that always keeps up . . . . . . . . . . . . 1--14 Ruihong Wang and Jianguo Wang and Stratos Idreos and M. Tamer Özsu and Walid G. Aref The case for distributed shared-memory databases with RDMA-enabled memory disaggregation . . . . . . . . . . . . . 15--22 Chenyuan Wu and Mohammad Javad Amiri and Jared Asch and Heena Nagda and Qizhen Zhang and Boon Thau Loo FlexChain: an elastic disaggregated blockchain . . . . . . . . . . . . . . . 23--36 Zhen Zhang and Shuai Zheng and Yida Wang and Justin Chiu and George Karypis and Trishul Chilimbi and Mu Li and Xin Jin MiCS: near-linear scaling for training gigantic model on public cloud . . . . . 37--50 Yi Yang and Yurong Cheng and Ye Yuan and Guoren Wang and Lei Chen and Yongjiao Sun Privacy-preserving cooperative online matching over spatial crowdsourcing platforms . . . . . . . . . . . . . . . 51--63 Jiayi Wang and Chengliang Chai and Nan Tang and Jiabin Liu and Guoliang Li Coresets over multiple tables for feature-rich and data-efficient machine learning . . . . . . . . . . . . . . . . 64--76 Zihao Zhang and Huiqi Hu and Xuan Zhou and Jiang Wang Starry: multi-master transaction processing on semi-leader architecture 77--89 Konstantinos Skitsas and Ioannis G. Papageorgiou and Mohammad Sadegh Talebi and Verena Kantere and Michael N. Katehakis and Panagiotis Karras SIFTER: space-efficient value iteration for finite-horizon MDPs . . . . . . . . 90--98 Geoffrey X. Yu and Markos Markakis and Andreas Kipf and Per-Åke Larson and Umar Farooq Minhas and Tim Kraska TreeLine: an update-in-place key-value store for modern storage . . . . . . . . 99--112 Yuchao Tao and Amir Gilad and Ashwin Machanavajjhala and Sudeepa Roy DPXPlain: privately explaining aggregate query answers . . . . . . . . . . . . . 113--126
Lijun Chang and Mouyi Xu and Darren Strash Efficient maximum $k$-plex computation over large sparse graphs . . . . . . . . 127--139 Tianxun Hu and Tianzheng Wang and Qingqing Zhou Online schema evolution is (almost) free for snapshot databases . . . . . . . . . 140--153 Yifan Wang and Haodi Ma and Daisy Zhe Wang LIDER: an efficient high-dimensional learned index for large-scale dense passage retrieval . . . . . . . . . . . 154--166 Sina Shaham and Gabriel Ghinita and Cyrus Shahabi Models and mechanisms for spatial data fairness . . . . . . . . . . . . . . . . 167--179 Shixun Huang and Wenqing Lin and Zhifeng Bao and Jiachen Sun Influence maximization in real-world closed social networks . . . . . . . . . 180--192 Angela Bonifati and Francesco Del Buono and Francesco Guerra and Donato Tiano Time2Feat: learning interpretable representations for multivariate time series clustering . . . . . . . . . . . 193--201 Xiaochen Li and Yuke Hu and Weiran Liu and Hanwen Feng and Li Peng and Yuan Hong and Kui Ren and Zhan Qin OpBoost: a vertical federated tree boosting framework based on order-preserving desensitization . . . . 202--215 R. Malinga Perera and Bastian Oetomo and Benjamin I. P. Rubinstein and Renata Borovica-Gajic HMAB: self-driving hierarchy of bandits for integrated physical database design tuning . . . . . . . . . . . . . . . . . 216--229 Dimitris Palyvos-Giannas and Katerina Tzompanaki and Marina Papatriantafilou and Vincenzo Gulisano Erebus: explaining the outputs of data streaming queries . . . . . . . . . . . 230--242 Zhou Zhang and Zhaole Chu and Peiquan Jin and Yongping Luo and Xike Xie and Shouhong Wan and Yun Luo and Xufei Wu and Peng Zou and Chunyang Zheng and Guoan Wu and Andy Rudoff PLIN: a persistent learned index for non-volatile memory with high performance and instant recovery . . . . 243--255 Zuozhi Wang and Shengquan Ni and Avinash Kumar and Chen Li Fries: fast and consistent runtime reconfiguration in dataflow systems with transactional guarantees . . . . . . . . 256--268 Renjie Xiao and Zijing Tan and Haojin Wang and Shuai Ma Fast approximate denial constraint discovery . . . . . . . . . . . . . . . 269--281 Haoyu Wang and Shaoxu Song Frequency domain data encoding in Apache IoTDB . . . . . . . . . . . . . . . . . 282--290 Jiping Zheng and Yuan Ma and Wei Ma and Yanhao Wang and Xiaoyang Wang Happiness maximizing sets under group fairness constraints . . . . . . . . . . 291--303 Cedric Renggli and Xiaozhe Yao and Luka Kolar and Luka Rimanic and Ana Klimovic and Ce Zhang SHiFT: an efficient, flexible search engine for transfer learning . . . . . . 304--316 Md. Mouinul Islam and Dong Wei and Baruch Schieber and Senjuti Basu Roy Satisfying complex top-$k$ fairness constraints by preference substitutions 317--329 Nikolai Karpov and Qin Zhang SyncSignature: a simple, efficient, parallelizable framework for tree similarity joins . . . . . . . . . . . . 330--342 Shuang Yang and Yahui Sun and Jiesong Liu and Xiaokui Xiao and Rong-Hua Li and Zhewei Wei Approximating probabilistic group Steiner trees in graphs . . . . . . . . 343--355 Serafeim Papadias and Zoi Kaoudi and Jorge-Arnulfo Quiané-Ruiz and Volker Markl Space-efficient random walks on streaming graphs . . . . . . . . . . . . 356--368 Pengfei Wang and Xiaocan Zeng and Lu Chen and Fan Ye and Yuren Mao and Junhao Zhu and Yunjun Gao PromptEM: prompt-tuning for low-resource generalized entity matching . . . . . . 369--378 Zhihan Guo and Xinyu Zeng and Kan Wu and Wuh-Chwen Hwang and Ziwei Ren and Xiangyao Yu and Mahesh Balakrishnan and Philip A. Bernstein Cornus: atomic commit for a cloud DBMS with storage disaggregation . . . . . . 379--392
Haitao Yuan and Guoliang Li and Zhifeng Bao Route Travel Time Estimation on a Road Network Revisited: Heterogeneity, Proximity, Periodicity and Dynamicity 393--405 Yongji Wu and Matthew Lentz and Danyang Zhuo and Yao Lu Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures . . . . . . . . . . . . 406--419 Zixuan Geng and Maximilian Schleich and Dan Suciu Computing Rule-Based Explanations by Leveraging Counterfactuals . . . . . . . 420--432 Jinfeng Peng and Derong Shen and Nan Tang and Tieying Liu and Yue Kou and Tiezheng Nie and Hang Cui and Ge Yu Self-Supervised and Interpretable Data Cleaning with Sequence Generative Adversarial Networks . . . . . . . . . . 433--446 Francisco Romero and Johann Hauswald and Aditi Partap and Daniel Kang and Matei Zaharia and Christos Kozyrakis Optimizing Video Analytics with Declarative Model Relationships . . . . 447--460 Jiaxin Jiang and Yuan Li and Bingsheng He and Bryan Hooi and Jia Chen and Johan Kok Zhi Kang Spade: a Real-Time Fraud Detection Framework on Evolving Graphs . . . . . . 461--469 Xupeng Miao and Yujie Wang and Youhe Jiang and Chunan Shi and Xiaonan Nie and Hailin Zhang and Bin Cui Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism . . . . . . . . . 470--479 Qitong Wang and Stephen Whitmarsh and Vincent Navarro and Themis Palpanas iEDeaL: a Deep Learning Framework for Detecting Highly Imbalanced Interictal Epileptiform Discharges . . . . . . . . 480--490 Eleni Zapridou and Ioannis Mytilinis and Anastasia Ailamaki Dalton: Learned Partitioning for Distributed Data Streams . . . . . . . . 491--504 Ali Behrouz and Farnoosh Hashemi and Laks V. S. Lakshmanan FirmTruss Community Search in Multilayer Networks . . . . . . . . . . . . . . . . 505--518 Tianyang Xu and Zhao Lu and Yuanyuan Zhu Efficient Triangle-Connected Truss Community Search in Dynamic Graphs . . . 519--531 Ibrahim Sabek and Kapil Vaidya and Dominik Horn and Andreas Kipf and Michael Mitzenmacher and Tim Kraska Can Learned Models Replace Hash Functions? . . . . . . . . . . . . . . . 532--545 Yue Zhao and George H. Chen and Zhihao Jia TOD: GPU-Accelerated Outlier Detection via Tensor Operations . . . . . . . . . 546--560 Chaohong Ma and Xiaohui Yu and Yifan Li and Xiaofeng Meng and Aishan Maoliniyazi FILM: a Fully Learned Index for Larger-Than-Memory Databases . . . . . . 561--573
Miti Mazmudar and Thomas Humphries and Jiaxiang Liu and Matthew Rafuse and Xi He Cache Me If You Can: Accuracy-Aware Inference Engine for Differentially Private Data Exploration . . . . . . . . 574--586 Francesca Falzon and Evangelia Anna Markatou and Zachary Espiritu and Roberto Tamassia Range Search over Encrypted Multi-Attribute Data . . . . . . . . . . 587--600 Xuanle Ren and Le Su and Zhen Gu and Sheng Wang and Feifei Li and Yuan Xie and Song Bian and Chao Li and Fan Zhang HEDA: Multi-Attribute Unbounded Aggregation over Homomorphically Encrypted Database . . . . . . . . . . . 601--614 Chih-Ya Shen and Shao-Heng Ko and Guang-Siang Lee and Wang-Chien Lee and De-Nian Yang Density Personalized Group Query . . . . 615--628 Jinkun Geng and Anirudh Sivaraman and Balaji Prabhakar and Mendel Rosenblum Nezha: Deployable and High-Performance Consensus Using Synchronized Clocks . . 629--642 Ishtiyaque Ahmad and Divyakant Agrawal and Amr El Abbadi and Trinabh Gupta Pantheon: Private Retrieval from Public Key--Value Store . . . . . . . . . . . . 643--656 Francesco Da Dalt and Simon Scherrer and Adrian Perrig Bayesian Sketches for Volume Estimation in Data Streams . . . . . . . . . . . . 657--669 Moin Hussain Moti and Panagiotis Simatis and Dimitris Papadias Waffle: a Workload-Aware and Query-Sensitive Framework for Disk-Based Spatial Indexing . . . . . . . . . . . . 670--683 Eduardo H. M. Pena and Fabio Porto and Felix Naumann Fast Algorithms for Denial Constraint Discovery . . . . . . . . . . . . . . . 684--696 Pu Jiao and Sheng Di and Hanqi Guo and Kai Zhao and Jiannan Tian and Dingwen Tao and Xin Liang and Franck Cappello Toward Quantity-of-Interest Preserving Lossy Compression for Scientific Data 697--710 Gunduz Vehbi Demirci and Aparajita Haldar and Hakan Ferhatosmanoglu Scalable Graph Convolutional Network Training on Distributed-Memory Systems 711--724 Patrick Schäfer and Ulf Leser Motiflets: Simple and Accurate Detection of Motifs in Time Series . . . . . . . . 725--737 Avanika Narayan and Ines Chami and Laurel Orr and Christopher Ré Can Foundation Models Wrangle Your Data? 738--746 Bogyeong Kim and Kyoseung Koo and Undraa Enkhbat and Sohyun Kim and Juhun Kim and Bongki Moon M2Bench: a Database Benchmark for Multi-Model Analytic Workloads . . . . . 747--759 Nirvik Baruah and Peter Kraft and Fiodar Kazhamiaka and Peter Bailis and Matei Zaharia Parallelism-Optimizing Data Placement for Faster Data-Parallel Computations 760--771 Teddy Lazebnik and Amit Somech and Abraham Itzhak Weinberg SubStrat: a Subset-Based Optimization Strategy for Faster AutoML . . . . . . . 772--780 Saket Gurukar and Nikil Pancha and Andrew Zhai and Eric Kim and Samson Hu and Srinivasan Parthasarathy and Charles Rosenberg and Jure Leskovec MultiBiSage: a Web-Scale Recommendation System Using Multiple Bipartite Graphs at Pinterest . . . . . . . . . . . . . . 781--789 Alexandros Zeakis and Dimitrios Skoutas and Dimitris Sacharidis and Odysseas Papapetrou and Manolis Koubarakis TokenJoin: Efficient Filtering for Set Similarity Join with Maximum Weighted Bipartite Matching . . . . . . . . . . . 790--802 Moe Kayali and Dan Suciu Quasi-Stable Coloring for Graph Compression: Approximating Max-Flow, Linear Programs, and Centrality . . . . 803--815 David Pujol and Albert Sun and Brandon Fain and Ashwin Machanavajjhala Multi-Analyst Differential Privacy for Online Query Answering . . . . . . . . . 816--828 Tim Gubner and Peter Boncz Excalibur: a Virtual Machine for Adaptive Fine-grained JIT-Compiled Query Execution based on VOILA . . . . . . . . 829--841 Lianke Qin and Rajesh Jayaram and Elaine Shi and Zhao Song and Danyang Zhuo and Shumo Chu Adore: Differentially Oblivious Relational Database Operators . . . . . 842--855 Shadaj Laddad and Conor Power and Mae Milano and Alvin Cheung and Natacha Crooks and Joseph M. Hellerstein Keep CALM and CRDT On . . . . . . . . . 856--863 Kejing Lu and Yoshiharu Ishikawa and Chuan Xiao MQH: Locality Sensitive Hashing on Multi-level Quantization Errors for Point-to-Hyperplane Distances . . . . . 864--876 Gábor Szárnyas and Jack Waudby and Benjamin A. Steer and Dávid Szakállas and Altan Birler and Mingxi Wu and Yuchen Zhang and Peter Boncz The LDBC Social Network Benchmark: Business Intelligence Workload . . . . . 877--890 Shuai An and Yang Cao Making Cache Monotonic and Consistent 891--904 Ziyun Wei and Immanuel Trummer SkinnerMT: Parallelizing for Efficiency and Robustness in Adaptive Query Processing on Multicore Platforms . . . 905--917 Dujian Ding and Sihem Amer-Yahia and Laks Lakshmanan On Efficient Approximate Queries over Machine Learning Models . . . . . . . . 918--931 Aamod Khatiwada and Roee Shraga and Wolfgang Gatterbauer and Renée J. Miller Integrating Data Lake Tables . . . . . . 932--945 Hongbo Kang and Yiwei Zhao and Guy E. Blelloch and Laxman Dhulipala and Yan Gu and Charles McGuffey and Phillip B. Gibbons PIM-Tree: a Skew-Resistant Index for Processing-in-Memory . . . . . . . . . . 946--958
Mingxuan Li and Yazhe Wang and Shuai Ma and Chao Liu and Dongdong Huo and Yu Wang and Zhen Xu Auto-Tuning with Reinforcement Learning for Permissioned Blockchain Systems . . 1000--1012 Minhui Xie and Youyou Lu and Qing Wang and Yangyang Feng and Jiaqiang Liu and Kai Ren and Jiwu Shu PetPS: Supporting Huge Embedding Models with Persistent Memory . . . . . . . . . 1013--1022 Kashif Rabbani and Matteo Lissandrini and Katja Hose Extraction of Validating Shapes from Very Large Knowledge Graphs . . . . . . 1023--1032 Pu Pang and Gang Deng and Kaihao Bai and Quan Chen and Shixuan Sun and Bo Liu and Yu Xu and Hongbo Yao and Zhengheng Wang and Xiyu Wang and Zheng Liu and Zhuo Song and Yong Yang and Tao Ma and Minyi Guo Async-Fork: Mitigating Query Latency Spikes Incurred by the Fork-based Snapshot Mechanism from the OS Level . . 1033--1045 Qichen Wang and Xiao Hu and Binyang Dai and Ke Yi Change Propagation Without Joins . . . . 1046--1058 Yuexiang Xie and Zhen Wang and Dawei Gao and Daoyuan Chen and Liuyi Yao and Weirui Kuang and Yaliang Li and Bolin Ding and Jingren Zhou FederatedScope: a Flexible Federated Learning Platform for Heterogeneity . . 1059--1072 Boyang Li and Yurong Cheng and Ye Yuan and Yi Yang and QianQian Jin and Guoren Wang ACTA: Autonomy and Coordination Task Assignment in Spatial Crowdsourcing Platforms . . . . . . . . . . . . . . . 1073--1085 Taegeon Um and Byungsoo Oh and Byeongchan Seo and Minhyeok Kweun and Goeun Kim and Woo-Yeon Lee FastFlow: Accelerating Deep Learning Model Training with Smart Offloading of Input Data Pipeline . . . . . . . . . . 1086--1099 Xi Zhao and Bolong Zheng and Xiaomeng Yi and Xiaofan Luan and Charles Xie and Xiaofang Zhou and Christian S. Jensen FARGO: Fast Maximum Inner Product Search via Global Multi-Probing . . . . . . . . 1100--1112 Martin Kiefer and Ilias Poulakis and Eleni Tzirita Zacharatou and Volker Markl Optimistic Data Parallelism for FPGA-Accelerated Sketching . . . . . . . 1113--1125 Héber H. Arcolezi and Sébastien Gambs and Jean-François Couchot and Catuscia Palamidessi On the Risks of Collecting Multidimensional Data Under Local Differential Privacy . . . . . . . . . . 1126--1139 Manos Chatzakis and Panagiota Fatourou and Eleftherios Kosmas and Themis Palpanas and Botao Peng Odyssey: a Journey in the Land of Distributed Data Series Similarity Search . . . . . . . . . . . . . . . . . 1140--1153 Lanting Fang and Kaiyu Feng and Jie Gui and Shanshan Feng and Aiqun Hu Anonymous Edge Representation for Inductive Anomaly Detection in Dynamic Bipartite Graph . . . . . . . . . . . . 1154--1167 Junyong Yang and Ming Zhong and Yuanyuan Zhu and Tieyun Qian and Mengchi Liu and Jeffrey Xu Yu Scalable Time-Range $k$-Core Query on Temporal Graphs . . . . . . . . . . . . 1168--1180 Erkang Zhu and Silu Huang and Surajit Chaudhuri High-Performance Row Pattern Recognition Using Joins . . . . . . . . . . . . . . 1181--1195 Kelin Luo and Alexandre M. Florio and Syamantak Das and Xiangyu Guo A Hierarchical Grouping Algorithm for the Multi-Vehicle Dial-a-Ride Problem 1195--1207
Xiaoxuan Liu and Shuxian Wang and Mengzhu Sun and Sicheng Pan and Ge Li and Siddharth Jha and Cong Yan and Junwen Yang and Shan Lu and Alvin Cheung Leveraging Application Data Constraints to Optimize Database-Backed Web Applications . . . . . . . . . . . . . . 1208--1221 Ferdinand Gruber and Maximilian Bandle and Alexis Engelke and Thomas Neumann and Jana Giceva Bringing Compiling Databases to RISC Architectures . . . . . . . . . . . . . 1222--1234 Hokeun Cha and Xiangpeng Hao and Tianzheng Wang and Huanchen Zhang and Aditya Akella and Xiangyao Yu B$^{\rm link}$-hash: an Adaptive Hybrid Index for In-Memory Time-Series Databases . . . . . . . . . . . . . . . 1235--1248 Wentao Huang and Yunhong Ji and Xuan Zhou and Bingsheng He and Kian-Lee Tan A Design Space Exploration and Evaluation for Main-Memory Hash Joins in Storage Class Memory . . . . . . . . . . 1249--1263 Kaile Huang and Si Liu and Zhenge Chen and Hengfeng Wei and David Basin and Haixiang Li and Anqun Pan Efficient Black-Box Checking of Snapshot Isolation in Databases . . . . . . . . . 1264--1276 Zitao Li and Tianhao Wang and Ninghui Li Differentially Private Vertical Federated Clustering . . . . . . . . . . 1277--1290 Fuheng Zhao and Punnal Ismail Khan and Divyakant Agrawal and Amr El Abbadi and Arpit Gupta and Zaoxing Liu Panakos: Chasing the Tails for Multidimensional Data Streams . . . . . 1291--1304 Jonathan Fürst and Mauricio Fadel Argerich and Bin Cheng VersaMatch: Ontology Matching with Weak Supervision . . . . . . . . . . . . . . 1305--1318 Yushi Sun and Hao Xin and Lei Chen RECA: Related Tables Enhanced Column Semantic Type Annotation Framework . . . 1319--1331 Yiming Li and Yanyan Shen and Lei Chen and Mingxuan Yuan Zebra: When Temporal Graph Neural Networks Meet Temporal Personalized PageRank . . . . . . . . . . . . . . . . 1332--1345 Su Feng and Boris Glavic and Oliver Kennedy Efficient Approximation of Certain and Possible Answers for Ranking and Window Queries over Uncertain Data . . . . . . 1346--1358 Cong Yue and Tien Tuan Anh Dinh and Zhongle Xie and Meihui Zhang and Gang Chen and Beng Chin Ooi and Xiaokui Xiao GlassDB: an Efficient Verifiable Ledger Database System Through Transparency . . 1359--1371 Qian Zhang and Jingyao Li and Hongyao Zhao and Quanqing Xu and Wei Lu and Jinliang Xiao and Fusheng Han and Chuanhui Yang and Xiaoyong Du Efficient Distributed Transaction Processing in Heterogeneous Networks . . 1372--1385 Zhiguo Jiang and Hanhua Chen and Hai Jin Auxo: a Scalable and Efficient Graph Stream Summarization Structure . . . . . 1386--1398 Xiao He and Ye Li and Jian Tan and Bin Wu and Feifei Li OneShotSTL: One-Shot Seasonal-Trend Decomposition For Online Time Series Anomaly Detection And Forecasting . . . 1399--1412 Alexander van Renen and Viktor Leis Cloud Analytics Benchmark . . . . . . . 1413--1425 Adnan Alhomssi and Viktor Leis Scalable and Robust Snapshot Isolation for High-Performance Storage Engines . . 1426--1438 Xiang Li and Fabing Li and Mingyu Gao Flare: a Fast, Secure, and Memory-Efficient Distributed Analytics Framework . . . . . . . . . . . . . . . 1439--1452 Mijin An and Jonghyeok Park and Tianzheng Wang and Beomseok Nam and Sang-Won Lee NV-SQL: Boosting OLTP Performance with Non-Volatile DIMMs . . . . . . . . . . . 1453--1465 Rong Zhu and Wei Chen and Bolin Ding and Xingguang Chen and Andreas Pfadler and Ziniu Wu and Jingren Zhou Lero: a Learning-to-Rank Query Optimizer 1466--1479 Kitaek Lee and Insoon Jo and Jaechan Ahn and Hyuk Lee and Hwang Lee and Woong Sul and Hyungsoo Jung Deploying Computational Storage for HTAP DBMSs Takes More Than Just Computation Offloading . . . . . . . . . . . . . . . 1480--1493 Dixin Tang and Alan Fekete and Indranil Gupta and Aditya G. Parameswaran Transactional Panorama: a Conceptual Framework for User Perception in Analytical Visual Interfaces . . . . . . 1494--1506 Derek Paulsen and Yash Govind and AnHai Doan Sparkly: a Simple yet Surprisingly Strong TF/IDF Blocker for Entity Matching . . . . . . . . . . . . . . . . 1507--1519 Parimarjan Negi and Ziniu Wu and Andreas Kipf and Nesime Tatbul and Ryan Marcus and Sam Madden and Tim Kraska and Mohammad Alizadeh Robust Query Driven Cardinality Estimation under Changing Workloads . . 1520--1533 Han Fu and Chang Liu and Bin Wu and Feifei Li and Jian Tan and Jianling Sun CatSQL: Towards Real World Natural Language to SQL Applications . . . . . . 1534--1547 Ilias Azizi and Karima Echihabi and Themis Palpanas ELPIS: Graph-Based Similarity Search for Scalable Data Science . . . . . . . . . 1548--1559 Carl Duffy and Jaehoon Shim and Sang-Hoon Kim and Jin-Soo Kim Dotori: a Key--Value SSD Based KV Store 1560--1572 David Pujol and Amir Gilad and Ashwin Machanavajjhala PreFair: Privately Generating Justifiably Fair Synthetic Data . . . . 1573--1586 Roee Shraga and Renée J. Miller Explaining Dataset Changes for Semantic Data Versioning with Explain-Da-V . . . 1587--1600
Mihai Budiu and Tej Chajed and Frank McSherry and Leonid Ryzhyk and Val Tannen DBSP: Automatic Incremental View Maintenance for Rich Query Languages . . 1601--1614 Ling Liang and Jilan Lin and Zheng Qu and Ishtiyaque Ahmad and Fengbin Tu and Trinabh Gupta and Yufei Ding and Yuan Xie SPG: Structure-Private Graph Database via SqueezePIR . . . . . . . . . . . . . 1615--1628 Jingyuan Zhang and Ao Wang and Xiaolong Ma and Benjamin Carver and Nicholas John Newman and Ali Anwar and Lukas Rupprecht and Vasily Tarasov and Dimitrios Skourtis and Feng Yan and Yue Cheng InfiniStore: Elastic Serverless Cloud Storage . . . . . . . . . . . . . . . . 1629--1642 Peng Fang and Arijit Khan and Siqiang Luo and Fang Wang and Dan Feng and Zhenli Li and Wei Yin and Yuchao Cao Distributed Graph Embedding with Information-Oriented Random Walks . . . 1643--1656 Shuyuan Zheng and Yang Cao and Masatoshi Yoshikawa Secure Shapley Value for Cross-Silo Federated Learning . . . . . . . . . . . 1657--1670 Xiang Li and Nuozhou Sun and Yunqian Luo and Mingyu Gao SODA: a Set of Fast Oblivious Algorithms in Distributed Secure Data Analytics . . 1671--1684 Zicong Hong and Song Guo and Enyuan Zhou and Wuhui Chen and Huawei Huang and Albert Zomaya GriDB: Scaling Blockchain Database via Sharding and Off-Chain Cross-Shard Mechanism . . . . . . . . . . . . . . . 1685--1698 Xun Jian and Zhiyuan Li and Lei Chen SUFF: Accelerating Subgraph Matching with Historical Data . . . . . . . . . . 1699--1711 Haochen He and Erci Xu and Shanshan Li and Zhouyang Jia and Si Zheng and Yue Yu and Jun Ma and Xiangke Liao When Database Meets New Storage Devices: Understanding and Exposing Performance Mismatches via Configurations . . . . . 1712--1725 Grace Fan and Jin Wang and Yuliang Li and Dan Zhang and Renée J. Miller Semantics-Aware Dataset Discovery from Data Lakes with Contextualized Column-Based Representation Learning . . 1726--1739 Kasper Overgaard Mortensen and Fatemeh Zardbani and Mohammad Ahsanul Haque and Steinn Ymir Agustsson and Davide Mottin and Philip Hofmann and Panagiotis Karras Marigold: Efficient $k$-Means Clustering in High Dimensions . . . . . . . . . . . 1740--1748 Ibrahim Sabek and Tim Kraska The Case for Learned In-Memory Joins . . 1749--1762 Ruiyuan Li and Zheng Li and Yi Wu and Chao Chen and Yu Zheng Elf: Erasing-Based Lossless Floating-Point Compression . . . . . . . 1763--1776 Tianyi Chen and Jun Gao and Hedui Chen and Yaofeng Tu LOGER: a Learned Optimizer Towards Generating Efficient and Robust Query Execution Plans . . . . . . . . . . . . 1777--1789 Wim Martens and Matthias Niewerth and Tina Popp and Carlos Rojas and Stijn Vansummeren and Domagoj Vrgoc Representing Paths in Graph Database Pattern Matching . . . . . . . . . . . . 1790--1803
Xiling Li and Chenkai Weng and Yongxin Xu and Xiao Wang and Jennie Rogers ZKSQL: Verifiable and Efficient Query Evaluation with Zero-Knowledge Proofs 1804--1816 Chengzhi Piao and Tingyang Xu and Xiangguo Sun and Yu Rong and Kangfei Zhao and Hong Cheng Computing Graph Edit Distance via Neural Graph Matching . . . . . . . . . . . . . 1817--1829 Christine Schäler and Thomas Hütter and Martin Schäler Benchmarking the Utility of $w$-Event Differential Privacy Mechanisms --- When Baselines Become Mighty Competitors . . 1830--1842 Eriq Augustine and Lise Getoor Collective Grounding: Applying Database Techniques to Grounding Templated Models 1843--1855 Jan Niklas Adams and Cameron Pitsch and Tobias Brockhoff and Wil M. P. van der Aalst An Experimental Evaluation of Process Concept Drift Detection . . . . . . . . 1856--1869 Gerardo Vitagliano and Mazhar Hameed and Lan Jiang and Lucas Reisener and Eugene Wu and Felix Naumann Pollock: a Data Loading Benchmark . . . 1870--1882 Yingtai Xiao and Guanhong Wang and Danfeng Zhang and Daniel Kifer Answering Private Linear Queries Adaptively Using the Common Mechanism 1883--1896 Yuntao Du and Yujia Hu and Zhikun Zhang and Ziquan Fang and Lu Chen and Baihua Zheng and Yunjun Gao LDPTrace: Locally Differentially Private Trajectory Synthesis . . . . . . . . . . 1897--1909 Xenophon Kitsios and Panagiotis Liakos and Katia Papakonstantinopoulou and Yannis Kotidis Sim-Piece: Highly Accurate Piecewise Linear Approximation through Similar Segment Merging . . . . . . . . . . . . 1910--1922 Eugenio Marinelli and Yiqing Yan and Virginie Magnone and Charlotte Dumargne and Pascal Barbry and Thomas Heinis and Raja Appuswamy Towards Migration-Free ``Just-in-Case'' Data Archival for Future Cloud Data Lakes Using Synthetic DNA . . . . . . . 1923--1929 Zhiyuan Dong and Zhaoguo Wang and Xiaodong Zhang and Xian Xu and Changgeng Zhao and Haibo Chen and Aurojit Panda and Jinyang Li Fine-Grained Re-Execution for Efficient Batched Commit of Distributed Transactions . . . . . . . . . . . . . . 1930--1943 Wenfei Fan and Resul Tugay and Yaoshu Wang and Min Xie and Muhammad Asif Ali Learning and Deducing Temporal Orders 1944--1957 Xu Chen and Zhen Wang and Shuncheng Liu and Yaliang Li and Kai Zeng and Bolin Ding and Jingren Zhou and Han Su and Kai Zheng BASE: Bridging the Gap between Cost and Latency for Query Optimization . . . . . 1958--1966 Jakub Lemiesz Efficient Framework for Operating on Data Sketches . . . . . . . . . . . . . 1967--1978 Xi Zhao and Yao Tian and Kai Huang and Bolong Zheng and Xiaofang Zhou Towards Efficient Index Construction and Approximate Nearest Neighbor Search in High-Dimensional Spaces . . . . . . . . 1979--1991 Zhaoyan Sun and Xuanhe Zhou and Guoliang Li Learned Index: a Comprehensive Experimental Evaluation . . . . . . . . 1992--2004 Yanping Zhang and Johes Bater and Kartik Nayak and Ashwin Machanavajjhala Longshot: Indexing Growing Databases Using MPC and Differential Privacy . . . 2005--2018 John Paparrizos and Kaize Wu and Aaron Elmore and Christos Faloutsos and Michael J. Franklin Accelerating Similarity Search for Elastic Measures: a Study and New Generalization of Lower Bounding Distances . . . . . . . . . . . . . . . 2019--2032 Chenyuan Wu and Bhavana Mehta and Mohammad Javad Amiri and Ryan Marcus and Boon Thau Loo AdaChain: a Learned Adaptive Blockchain 2033--2046 Yingli Zhou and Yixiang Fang and Wensheng Luo and Yunming Ye Influential Community Search over Large Heterogeneous Information Networks . . . 2047--2060
Naheed Anjum Arafat and Arijit Khan and Arpit Kumar Rai and Bishwamittra Ghosh Neighborhood-Based Hypergraph Core Decomposition . . . . . . . . . . . . . 2061--2074 Janet Layne and Justin Carpenter and Edoardo Serra and Francesco Gullo Temporal SIR-GN: Efficient and Effective Structural Representation Learning for Temporal Graphs . . . . . . . . . . . . 2075--2089 Gabriel Haas and Viktor Leis What Modern NVMe Storage Can Do, and How to Exploit it: High-Performance I/O for High-Performance Storage Engines . . . . 2090--2102 Vinay Banakar and Kan Wu and Yuvraj Patel and Kimberly Keeton and Andrea C. Arpaci-Dusseau and Remzi H. Arpaci-Dusseau WiscSort: External Sorting for Byte-Addressable Storage . . . . . . . . 2103--2116 Lorraine A. K. Ayad and Grigorios Loukides and Solon P. Pissis Text Indexing for Long Patterns: Anchors are All you Need . . . . . . . . . . . . 2117--2131 Azim Afroozeh and Peter Boncz The FastLanes Compression Layout: Decoding $> 100$ Billion Integers per Second with Scalar Code . . . . . . . . 2132--2144 Cong Yue and Meihui Zhang and Changhao Zhu and Gang Chen and Dumitrel Loghin and Beng Chin Ooi VeriBench: Analyzing the Performance of Database Systems with Verifiability . . 2145--2157 Jiangneng Li and Zheng Wang and Gao Cong and Cheng Long and Han Mao Kiah and Bin Cui Towards Designing and Learning Piecewise Space-Filling Curves . . . . . . . . . . 2158--2171 Xiaoke Zhu and Yang Liu and Shuhao Liu and Wenfei Fan MiniGraph: Querying Big Graphs with a Single Machine . . . . . . . . . . . . . 2172--2185 Yunyoung Choi and Kunsoo Park and Hyunjoon Kim BICE: Exploring Compact Search Space by Using Bipartite Matching and Cell-Wide Verification . . . . . . . . . . . . . . 2186--2198 Anxin Tian and Alexander Zhou and Yue Wang and Lei Chen Maximal D-Truss Search in Dynamic Directed Graphs . . . . . . . . . . . . 2199--2211 Pengfei Li and Hua Lu and Rong Zhu and Bolin Ding and Long Yang and Gang Pan DILI: a Distribution-Driven Learned Index . . . . . . . . . . . . . . . . . 2212--2224 Alexandros Zeakis and George Papadakis and Dimitrios Skoutas and Manolis Koubarakis Pre-Trained Embeddings for Entity Resolution: an Experimental Analysis . . 2225--2238 Yanping Zheng and Zhewei Wei and Jiajun Liu Decoupled Graph Neural Networks for Large Dynamic Graphs . . . . . . . . . . 2239--2247 Fatemeh Zardbani and Nikos Mamoulis and Stratos Idreos and Panagiotis Karras Adaptive Indexing of Objects with Spatial Extent . . . . . . . . . . . . . 2248--2260 Xu Chen and Haitian Chen and Zibo Liang and Shuncheng Liu and Jinghong Wang and Kai Zeng and Han Su and Kai Zheng LEON: a New Framework for ML-Aided Query Optimization . . . . . . . . . . . . . . 2261--2273 Nuno Faria and José Pereira and Ana Nunes Alonso and Ricardo Vilaça and Yunus Koning and Niels Nes TiQuE: Improving the Transactional Performance of Analytical Systems for True Hybrid Workloads . . . . . . . . . 2274--2288 Jaeho Bang and Gaurav Tarlok Kakkar and Pramod Chunduri and Subrata Mitra and Joy Arulraj Seiden: Revisiting Query Processing in Video Database Systems . . . . . . . . . 2289--2301 Ferdi Kossmann and Ziniu Wu and Eugenie Lai and Nesime Tatbul and Lei Cao and Tim Kraska and Sam Madden Extract-Transform-Load for Video Streams 2302--2315 Sivaprasad Sudhir and Wenbo Tao and Nikolay Laptev and Cyrille Habis and Michael Cafarella and Samuel Madden Pando: Enhanced Data Skipping with Logical Data Partitioning . . . . . . . 2316--2329 Kajetan Maliszewski and Jorge-Arnulfo Quiané-Ruiz and Volker Markl Cracking-Like Join for Trusted Execution Environments . . . . . . . . . . . . . . 2330--2343 Umut Çalikyilmaz and Sven Groppe and Jinghua Groppe and Tobias Winker and Stefan Prestel and Farida Shagieva and Daanish Arya and Florian Preis and Le Gruenwald Opportunities for Quantum Acceleration of Databases: Optimization of Queries and Transaction Schedules . . . . . . . 2344--2353 Xupeng Miao and Yining Shi and Zhi Yang and Bin Cui and Zhihao Jia SDPipe: a Semi-Decentralized Framework for Heterogeneity-Aware Pipeline-parallel Training . . . . . . . 2354--2363 Bohyun Lee and Mijin An and Sang-Won Lee LRU-C: Parallelizing Database I/Os for Flash SSDs . . . . . . . . . . . . . . . 2364--2376 Zixuan Chen and Panagiotis Manolios and Mirek Riedewald Why Not Yet: Fixing a Top-$k$ Ranking that is Not Fair to Individuals . . . . 2377--2390
Shantanu Sharma and Yin Li and Sharad Mehrotra and Nisha Panwar and Komal Kumari and Swagnik Roychoudhury Information-Theoretically Secure and Highly Efficient Search and Row Retrieval . . . . . . . . . . . . . . . 2391--2403 Fumiyuki Kato and Yang Cao and Masatoshi Yoshikawa Olive: Oblivious Federated Learning on Trusted Execution Environment against the Risk of Sparsification . . . . . . . 2404--2417 Chengyang Luo and Qing Liu and Yunjun Gao and Lu Chen and Ziheng Wei and Congcong Ge Task: an Efficient Framework for Instant Error-Tolerant Spatial Keyword Queries on Road Networks . . . . . . . . . . . . 2418--2430 Besat Kassaie and Frank Wm. Tompa Autonomously Computable Information Extraction . . . . . . . . . . . . . . . 2431--2443 Dimitrios Koutsoukos and Raghav Bhartia and Michal Friedman and Ana Klimovic and Gustavo Alonso NVM: Is it Not Very Meaningful for Databases? . . . . . . . . . . . . . . . 2444--2457 Yuyang Dong and Chuan Xiao and Takuma Nozawa and Masafumi Enomoto and Masafumi Oyamada DeepJoin: Joinable Table Discovery with Pre-Trained Language Models . . . . . . 2458--2470 Yuncheng Wu and Naili Xing and Gang Chen and Tien Tuan Anh Dinh and Zhaojing Luo and Beng Chin Ooi and Xiaokui Xiao and Meihui Zhang Falcon: a Privacy-Preserving and Interpretable Vertical Federated Learning System . . . . . . . . . . . . 2471--2484 Haotian Gao and Cong Yue and Tien Tuan Anh Dinh and Zhiyong Huang and Beng Chin Ooi Enabling Secure and Efficient Data Analytics Pipeline Evolution with Trusted Execution Environment . . . . . 2485--2498 Harish Doraiswamy and Vikas Kalagi and Karthik Ramachandra and Jayant R. Haritsa A Case for Graphics-Driven Query Processing . . . . . . . . . . . . . . . 2499--2511 Wei Tian and Jieming Shi and Siqiang Luo and Hui Li and Xike Xie and Yuanhang Zou Effective and Efficient Route Planning Using Historical Trajectories on Road Networks . . . . . . . . . . . . . . . . 2512--2524 Konstantinos Lampropoulos and Fatemeh Zardbani and Nikos Mamoulis and Panagiotis Karras Adaptive Indexing in High-Dimensional Metric Spaces . . . . . . . . . . . . . 2525--2537 Sen Gao and Hongchao Qin and Rong-Hua Li and Bingsheng He Parallel Colorful $h$-Star Core Maintenance in Dynamic Graphs . . . . . 2538--2550 Jia Li and Wenyue Zhao and Nikos Ntarmos and Yang Cao and Peter Buneman MITra: a Framework for Multi-Instance Graph Traversal . . . . . . . . . . . . 2551--2564 Jiazun Chen and Yikuan Xia and Jun Gao CommunityAF: an Example-Based Community Search Method via Autoregressive Flow 2565--2577 Yiming Lin and Yeye He and Surajit Chaudhuri Auto-BI: Automatically Build BI-Models Leveraging Local Join Prediction and Global Schema Graph . . . . . . . . . . 2578--2590 Yuemin Zhang and Qingqing Ye and Rui Chen and Haibo Hu and Qilong Han Trajectory Data Collection with Local Differential Privacy . . . . . . . . . . 2591--2604 Jian Gao and Xin Cao and Xin Yao and Gong Zhang and Wei Wang LMSFC: a Novel Multidimensional Index Based on Learned Monotonic Space Filling Curves . . . . . . . . . . . . . . . . . 2605--2617 Kexin Rong and Mihai Budiu and Athinagoras Skiadopoulos and Lalith Suresh and Amy Tai Scaling a Declarative Cluster Manager Architecture with Query Optimization Techniques . . . . . . . . . . . . . . . 2618--2631 Mukul Singh and José Cambronero Sánchez and Sumit Gulwani and Vu Le and Carina Negreanu and Mohammad Raza and Gust Verbruggen Cornet: Learning Table Formatting Rules By Example . . . . . . . . . . . . . . . 2632--2644 Chaoji Zuo and Dong Deng ARKGraph: All-Range Approximate $K$-Nearest-Neighbor Graph . . . . . . . 2645--2658 Brit Youngmann and Michael Cafarella and Babak Salimi and Anna Zeng Causal Data Integration . . . . . . . . 2659--2665 Michael Martini and Daniel Schuster and Wil M. P. van der Aalst Mining Frequent Infix Patterns from Concurrency-Aware Process Execution Variants . . . . . . . . . . . . . . . . 2666--2678 Pedro Pedreira and Orri Erling and Konstantinos Karanasos and Scott Schneider and Wes McKinney and Satya R. Valluri and Mohamed Zait and Jacques Nadeau The Composable Data Management System Manifesto . . . . . . . . . . . . . . . 2679--2685
Daniel Schmitt and Daniel Kocher and Nikolaus Augsten and Willi Mann and Alexander Miller A Two-Level Signature Scheme for Stable Set Similarity Joins . . . . . . . . . . 2686--2698 Olivier Rodriguez and Federico Ulliana and Marie-Laure Mugnier Scalable Reasoning on Document Stores via Instance-Aware Query Rewriting . . . 2699--2713 Enhao Zhang and Maureen Daum and Dong He and Brandon Haynes and Ranjay Krishna and Magdalena Balazinska EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions . . . . . . . . . . . 2714--2727 Yuhao Zhang and Arun Kumar Lotan: Bridging the Gap between GNNs and Scalable Graph Analytics Engines . . . . 2728--2741 Peter Kraft and Qian Li and Xinjing Zhou and Peter Bailis and Michael Stonebraker and Matei Zaharia and Xiangyao Yu Epoxy: ACID Transactions across Diverse Data Stores . . . . . . . . . . . . . . 2742--2754 Maximilian Böther and Lawrence Benson and Ana Klimovic and Tilmann Rabl Analyzing Vectorized Hash Tables across CPU Architectures . . . . . . . . . . . 2755--2768 Dominik Durner and Viktor Leis and Thomas Neumann Exploiting Cloud Object Storage for High-Performance Analytics . . . . . . . 2769--2782 Dimitrios Karapiperis and Christos Tjortjis and Vassilios S. Verykios A Randomized Blocking Structure for Streaming Record Linkage . . . . . . . . 2783--2791 Cristian Riveros and Nicolás Van Sint Jan and Domagoj Vrgoc REmatch: a Novel Regex Engine for Finding All Matches . . . . . . . . . . 2792--2804 Junxiong Wang and Immanuel Trummer and Ahmet Kara and Dan Olteanu ADOPT: Adaptively Optimizing Attribute Orders for Worst-Case Optimal Join Algorithms via Reinforcement Learning 2805--2817 Zheng Hu and Weiguo Zheng and Xiang Lian Triangular Stability Maximization by Influence Spread over Social Networks 2818--2831 Haoquan Guan and Ziling Chen and Shaoxu Song CORE-Sketch: On Exact Computation of Median Absolute Deviation with Limited Space . . . . . . . . . . . . . . . . . 2832--2844 Christian Lülf and Denis Mayr Lima Martins and Marcos Antonio Vaz Salles and Yongluan Zhou and Fabian Gieseke Fast Search-by-Classification for Large-Scale Databases Using Index-Aware Decision Trees and Random Forests . . . 2845--2857 Marco Calautti and Mostafa Milani and Andreas Pieris Semi-Oblivious Chase Termination for Linear Existential Rules: an Experimental Studya . . . . . . . . . . 2858--2870 Kukjin Lee and Anshuman Dutt and Vivek Narasayya and Surajit Chaudhuri Analyzing the Impact of Cardinality Estimation on Execution Plans in Microsoft SQL Server . . . . . . . . . . 2871--2883 Jongsung Lee and Donguk Kim and Jae W. Lee WALTZ: Leveraging Zone Append to Tighten the Tail Latency of LSM Tree on ZNS SSD 2884--2896 Matthew Russo and Tatsunori Hashimoto and Daniel Kang and Yi Sun and Matei Zaharia Accelerating Aggregation Queries on Unstructured Streams of Data . . . . . . 2897--2910 Qiushi Bai and Sadeem Alsudais and Chen Li QueryBooster: Improving SQL Performance Using Middleware Services for Human-Centered Query Rewriting . . . . . 2911--2924 Jiongli Zhu and Sainyam Galhotra and Nazanin Sabri and Babak Salimi Consistent Range Approximation for Fair Predictive Modeling . . . . . . . . . . 2925--2938 Haoteng Yin and Muhan Zhang and Jianguo Wang and Pan Li SUREL+: Moving from Walks to Sets for Scalable Subgraph-Based Graph Representation Learning . . . . . . . . 2939--2948 Hanzhi Wang and Zhewei Wei Estimating Single-Node PageRank in $ \tilde {O}(\min d_t, \sqrt {m}) $ Time 2949--2961 Yunjia Zhang and Yannis Chronis and Jignesh M. Patel and Theodoros Rekatsinas Simple Adaptive Query Processing vs. Learned Query Optimizers: Observations and Analysis . . . . . . . . . . . . . . 2962--2975 Helen Xu and Amanda Li and Brian Wheatman and Manoj Marneni and Prashant Pandey BP-Tree: Overcoming the Point-Range Operation Tradeoff for In-Memory B-Trees 2976--2989 Ge Lv and Chen Jason Zhang and Lei Chen HENCE-X: Toward Heterogeneity-Agnostic Multi-Level Explainability for Deep Graph Networks . . . . . . . . . . . . . 2990--3003 Haitao Yuan and Sai Wang and Zhifeng Bao and Shangguang Wang Automatic Road Extraction with Multi-Source Data Revisited: Completeness, Smoothness and Discrimination . . . . . . . . . . . . . 3004--3017 Philipp Fent and Guido Moerkotte and Thomas Neumann Asymptotically Better Query Optimization Using Indexed Algebra . . . . . . . . . 3018--3030 Philipp Skavantzos and Sebastian Link Normalizing Property Graphs . . . . . . 3031--3043 Chunwei Liu and Anna Pavlenko and Matteo Interlandi and Brandon Haynes A Deep Dive into Common Open Formats for Analytical DBMSs . . . . . . . . . . . . 3044--3056 Zezhou Huang and Jiaxiang Liu and Daniel Gbenga Alabi and Raul Castro Fernandez and Eugene Wu Saibot: a Differentially Private Data Search Platform . . . . . . . . . . . . 3057--3070 Zezhou Huang and Rathijit Sen and Jiaxiang Liu and Eugene Wu JoinBoost: Grow Trees over Normalized Data Using Only SQL . . . . . . . . . . 3071--3084
Junichi Tatemura and Tao Zou and Jagan Sankaranarayanan and Yanlai Huang and Jim Chen and Yupu Zhang and Kevin Lai and Hao Zhang and Gokul Nath Babu Manoharan and Goetz Graefe and Divyakant Agrawal and Brad Adelberg and Shilpa Kolhar and Indrajit Roy Progressive Partitioning for Parallelized Query Execution in Google's Napa . . . . . . . . . . . . . . . . . . 3475--3487 Alex Depoutovitch and Chong Chen and Per-Ake Larson and Jack Ng and Shu Lin and Guanzhu Xiong and Paul Lee and Emad Boctor and Samiao Ren and Lengdong Wu and Yuchen Zhang and Calvin Sun Taurus MM: Bringing Multi-Master to the Cloud . . . . . . . . . . . . . . . . . 3488--3500 Yancan Mao and Zhanghao Chen and Yifan Zhang and Meng Wang and Yong Fang and Guanghui Zhang and Rui Shi and Richard T. B. Ma StreamOps: Cloud-Native Runtime Management for Streaming Services in ByteDance . . . . . . . . . . . . . . . 3501--3514 Christoph Anneser and Nesime Tatbul and David Cohen and Zhenggang Xu and Prithviraj Pandian and Nikolay Laptev and Ryan Marcus AutoSteer: Learned Query Optimization for Any SQL Database . . . . . . . . . . 3515--3527 Jianjun Chen and Rui Shi and Heng Chen and Li Zhang and Ruidong Li and Wei Ding and Liya Fan and Hao Wang and Mu Xiong and Yuxiang Chen and Benchao Dong and Kuankuan Guo and Yuanjin Lin and Xiao Liu and Haiyang Shi and Peipei Wang and Zikang Wang and Yemeng Yang and Junda Zhao and Dongyan Zhou and Zhikai Zuo and Yuming Liang Krypton: Real-Time Serving and Analytical SQL Engine at ByteDance . . . 3528--3542 Yuanhang Zou and Zhihao Ding and Jieming Shi and Shuting Guo and Chunchen Su and Yafei Zhang EmbedX: a Versatile, Efficient and Scalable Platform to Embed Both Graphs and High-Dimensional Sparse Data . . . . 3543--3556 Mohit Saxena and Benjamin Sowell and Daiyan Alamgir and Nitin Bahadur and Bijay Bisht and Santosh Chandrachood and Chitti Keswani and G. Krishnamoorthy and Austin Lee and Bohou Li and Zach Mitchell and Vaibhav Porwal and Maheedhar Reddy Chappidi and Brian Ross and Noritaka Sekiyama and Omer Zaki and Linchi Zhang and Mehul A. Shah The Story of AWS Glue . . . . . . . . . 3557--3569 Yang Li and Huaijun Jiang and Yu Shen and Yide Fang and Xiaofeng Yang and Danqing Huang and Xinyi Zhang and Wentao Zhang and Ce Zhang and Peng Chen and Bin Cui Towards General and Efficient Online Tuning for Spark . . . . . . . . . . . . 3570--3583 Jiashu Zhang and Wen Jiang and Bo Tang and Haoxiang Ma and Lixun Cao and Zhongbin Jiang and Yuanyuan Nie and Fan Wang and Lei Zhang and Yuming Liang CDSBen: Benchmarking the Performance of Storage Services in Cloud-Native Database System at ByteDance . . . . . . 3584--3596 Xuanhe Zhou and Cheng Chen and Kunyi Li and Bingsheng He and Mian Lu and Qiaosheng Liu and Wei Huang and Guoliang Li and Zhao Zheng and Yuqiang Chen FEBench: a Benchmark for Real-Time Relational Data Feature Extraction . . . 3597--3609 Fei Xiao and Yuncheng Wu and Meihui Zhang and Gang Chen and Beng Chin Ooi MINT: Detecting Fraudulent Behaviors from Time-Series Relational Data . . . . 3610--3623 Shafi Ahmad and Dillidorai Arumugam and Srdan Bozovic and Elnata Degefa and Sailesh Duvvuri and Steven Gott and Nitish Gupta and Joachim Hammer and Nivedita Kaluskar and Raghav Kaushik and Rakesh Khanduja and Prasad Mujumdar and Gaurav Malhotra and Pankaj Naik and Nikolas Ogg and Krishna Kumar Parthasarthy and Raghu Ramakrishnan and Vlad Rodriguez and Rahul Sharma and Jakub Szymaszek and Andreas Wolter Microsoft Purview: a System for Central Governance of Data . . . . . . . . . . . 3624--3635 Liang Lin and Yuhan Li and Bin Wu and Huijun Mai and Renjie Lou and Jian Tan and Feifei Li Anser: Adaptive Information Sharing Framework of AnalyticDB . . . . . . . . 3636--3648 Christoph Brücke and Philipp Härtling and Rodrigo D Escobar Palacios and Hamesh Patel and Tilmann Rabl TPCx-AI --- An Industry Standard Benchmark for Artificial Intelligence and Machine Learning Systems . . . . . . 3649--3661 Fotis Psallidas and Ashvin Agrawal and Chandru Sugunan and Khaled Ibrahim and Konstantinos Karanasos and Jesús Camacho-Rodr\'ìguez and Avrilia Floratou and Carlo Curino and Raghu Ramakrishnan OneProvenance: Efficient Extraction of Dynamic Coarse-Grained Provenance from Database Query Event Logs . . . . . . . 3662--3675 V. Srinivasan and Andrew Gooding and Sunil Sayyaparaju and Thomas Lopatic and Kevin Porter and Ashish Shinde and B. Narendran Techniques and Efficiencies from Building a Real-Time DBMS . . . . . . . 3676--3688 Jiaqi Wang and Tianyi Li and Anni Wang and Xiaoze Liu and Lu Chen and Jie Chen and Jianye Liu and Junyang Wu and Feifei Li and Yunjun Gao Real-Time Workload Pattern Analysis for Large-Scale Cloud Databases . . . . . . 3689--3701 Jiang Li and Qi Xie and Yan Ma and Jian Ma and Kunshang Ji and Yizhong Zhang and Chaojun Zhang and Yixiu Chen and Gangsheng Wu and Jie Zhang and Kaidi Yang and Xinyi He and Qiuyang Shen and Yanting Tao and Haiwei Zhao and Penghui Jiao and Chengfei Zhu and David Qian and Cheng Xu Big Data Analytic Toolkit: a General-Purpose, Modular, and Heterogeneous Acceleration Toolkit for Data Analytical Engines . . . . . . . . 3702--3714 Chunhui Shen and Qianyu Ouyang and Feibo Li and Zhipeng Liu and Longcheng Zhu and Yujie Zou and Qing Su and Tianhuan Yu and Yi Yi and Jianhong Hu and Cen Zheng and Bo Wen and Hanbang Zheng and Lunfan Xu and Sicheng Pan and Bin Wu and Xiao He and Ye Li and Jian Tan and Sheng Wang and Dan Pei and Wei Zhang and Feifei Li Lindorm TSDB: a Cloud-Native Time-Series Database for Large-Scale Monitoring Systems . . . . . . . . . . . . . . . . 3715--3727 Zhifeng Yang and Quanqing Xu and Shanyan Gao and Chuanhui Yang and Guoping Wang and Yuzhong Zhao and Fanyu Kong and Hao Liu and Wanhong Wang and Jinliang Xiao OceanBase Paetica: a Hybrid Shared-Nothing/Shared-Everything Database for Supporting Single Machine and Distributed Cluster . . . . . . . . 3728--3740 Yuanyuan Yao and Dimeng Li and Hailiang Jie and Hailiang Jie and Tianyi Li and Jie Chen and Jiaqi Wang and Feifei Li and Yunjun Gao SimpleTS: an Efficient and Universal Model Selection Framework for Time Series Forecasting . . . . . . . . . . . 3741--3753 Xinjun Yang and Yingqiang Zhang and Hao Chen and Chuan Sun and Feifei Li and Wenchao Zhou PolarDB-SCC: a Cloud-Native Database Ensuring Low Latency for Strongly Consistent Reads . . . . . . . . . . . . 3754--3767 Hiroyuki Yamada and Toshihiro Suzuki and Yuji Ito and Jun Nemoto ScalarDB: Universal Transaction Manager for Polystores . . . . . . . . . . . . . 3768--3780 Xiaonan Nie and Yi Liu and Fangcheng Fu and Jinbao Xue and Dian Jiao and Xupeng Miao and Yangyu Tao and Bin Cui Angel-PTM: a Scalable and Economical Large-Scale Pre-Training System in Tencent . . . . . . . . . . . . . . . . 3781--3794 Ji You Li and Jiachi Zhang and Wenchao Zhou and Yuhang Liu and Shuai Zhang and Zhuoming Xue and Ding Xu and Hua Fan and Fangyuan Zhou and Feifei Li Eigen: End-to-End Resource Optimization for Large-Scale Databases on the Cloud 3795--3807 Zhicheng Pan and Yihang Wang and Yingying Zhang and Sean Bin Yang and Yunyao Cheng and Peng Chen and Chenjuan Guo and Qingsong Wen and Xiduo Tian and Yunliang Dou and Zhiqiang Zhou and Chengcheng Yang and Aoying Zhou and Bin Yang MagicScaler: Uncertainty-Aware, Predictive Autoscaling . . . . . . . . . 3808--3821 Anna Povzner and Prince Mahajan and Jason Gustafson and Jun Rao and Ismael Juma and Feng Min and Shriram Sridharan and Nikhil Bhatia and Gopi Attaluri and Adithya Chandra and Stanislav Kozlovski and Rajini Sivaram and Lucas Bradstreet and Bob Barrett and Dhruvil Shah and David Jacot and David Arthur and Ron Dagostino and Colin McCabe and Manikumar Reddy Obili and Kowshik Prakasam and Jose Garcia Sancio and Vikas Singh and Alok Nikhil and Kamal Gupta Kora: a Cloud-Native Event Streaming Platform for Kafka . . . . . . . . . . . 3822--3834 Krishna Kantikiran Pasupuleti and Jiakun Li and Hong Su and Mohamed Ziauddin Automatic SQL Error Mitigation in Oracle 3835--3847 Yanli Zhao and Andrew Gu and Rohan Varma and Liang Luo and Chien-Chin Huang and Min Xu and Less Wright and Hamid Shojanazeri and Myle Ott and Sam Shleifer and Alban Desmaison and Can Balioglu and Pritam Damania and Bernard Nguyen and Geeta Chauhan and Yuchen Hao and Ajit Mathews and Shen Li PyTorch FSDP: Experiences on Scaling Fully Sharded Data Parallel . . . . . . 3848--3860
Lina Qiu and Georgios Kellaris and Nikos Mamoulis and Kobbi Nissim and George Kollios Doquet: Differentially Oblivious Range and Join Queries with Private Data Structures . . . . . . . . . . . . . . . 4160--4173 Monica Chiosa and Thomas B. Preußer and Michaela Blott and Gustavo Alonso AMNES: Accelerating the Computation of Data Correlation Using FPGAs . . . . . . 4174--7187 Maureen Daum and Enhao Zhang and Dong He and Stephen Mussmann and Brandon Haynes and Ranjay Krishna and Magdalena Balazinska VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building . . . . . 4188--4201 Pankaj Arora and Surajit Chaudhuri and Sudipto Das and Junfeng Dong and Cyril George and Ajay Kalhan and Arnd Christian König and Willis Lang and Changsong Li and Feng Li and Jiaqi Liu and Lukas M. Maas and Akshay Mata and Ishai Menache and Justin Moeller and Vivek Narasayya and Matthaios Olma and Morgan Oslake and Elnaz Rezai and Yi Shan and Manoj Syamala and Shize Xu and Vasileios Zois Flexible Resource Allocation for Relational Database-as-a-Service . . . . 4202--4215 Rong Gu and Han Li and Haipeng Dai and Wenjie Huang and Jie Xue and Meng Li and Jiaqi Zheng and Haoran Cai and Yihua Huang and Guihai Chen ShadowAQP: Efficient Approximate Group-by and Join Query via Attribute-Oriented Sample Size Allocation and Data Generation . . . . . 4216--4229 Rui Liu and Kwanghyun Park and Fotis Psallidas and Xiaoyong Zhu and Jinghui Mo and Rathijit Sen and Matteo Interlandi and Konstantinos Karanasos and Yuanyuan Tian and Jesús Camacho-Rodríguez Optimizing Data Pipelines for Machine Learning in Feature Stores . . . . . . . 4230--4239 Renzo Angles and Georg Gottlob and Aleksandar Pavlovi\'c and Reinhard Pichler and Emanuel Sallinger SparqLog: a System for Efficient Evaluation of SPARQL 1.1 Queries via Datalog . . . . . . . . . . . . . . . . 4240--4253 Arnd Christian König and Yi Shan and Karan Newatia and Luke Marshall and Vivek Narasayya Solver-In-The-Loop Cluster Resource Management for Database-as-a-Service . . 4254--4267 Justus Henneberg and Felix Schuhknecht RTIndeX: Exploiting Hardware-Accelerated GPU Raytracing for Database Indexing . . 4268--4281 Jinqing Lian and Xinyi Zhang and Yingxia Shao and Zenglin Pu and Qingfeng Xiang and Yawen Li and Bin Cui ContTune: Continuous Tuning by Conservative Bayesian Optimization for Distributed Stream Data Processing Systems . . . . . . . . . . . . . . . . 4282--4295 Dimitrios Melissourgos and Haibo Wang and Shigang Chen and Chaoyi Ma and Shiping Chen Single Update Sketch with Variable Counter Structure . . . . . . . . . . . 4296--4309 Immanuel Trummer Can Large Language Models Predict Data Correlations from Column Names? . . . . 4310--4323 Javad Ghareh Chamani and Ioannis Demertzis and Dimitrios Papadopoulos and Charalampos Papamanthou and Rasool Jalili GraphOS: Towards Oblivious Graph Processing . . . . . . . . . . . . . . . 4324--4338 Kefei Wang and Feng Chen Catalyst: Optimizing Cache Management for Large In-memory Key-value Systems 4339--4352
Bolong Zheng and Yongyong Gao and Jingyi Wan and Lingsen Yan and Long Hu and Bo Liu and Yunjun Gao and Xiaofang Zhou and Christian S. Jensen DecLog: Decentralized Logging in Non-Volatile Memory for Time Series Database Systems . . . . . . . . . . . . 1--14 Fangyuan Zhang and Mengxu Jiang and Sibo Wang Efficient Dynamic Weighted Set Sampling and Its Extension . . . . . . . . . . . 15--27 Yiming Lin and Sharad Mehrotra ZIP: Lazy Imputation during Query Processing . . . . . . . . . . . . . . . 28--40 Xunkai Li and Zhengyu Wu and Wentao Zhang and Yinlin Zhu and Rong-Hua Li and Guoren Wang FedGTA: Topology-Aware Averaging for Federated Graph Learning . . . . . . . . 41--50 Xueqin Chang and Xiangyu Ke and Lu Chen and Congcong Ge and Ziheng Wei and Yunjun Gao Host Profit Maximization: Leveraging Performance Incentives and User Flexibility . . . . . . . . . . . . . . 51--64 Shweta Patwa and Danyu Sun and Amir Gilad and Ashwin Machanavajjhala and Sudeepa Roy DP-PQD: Privately Detecting Per-Query Gaps in Synthetic Data Generated by Black-Box Mechanisms . . . . . . . . . . 65--78
Ruidi Wei and Florian Kerschbaum Cryptographically Secure Private Record Linkage using Locality-Sensitive Hashing 79--91 Simran Arora and Brandon Yang and Sabri Eyuboglu and Avanika Narayan and Andrew Hojel and Immanuel Trummer and Christopher Ré Language Models Enable Simple Systems for Generating Structured Views of Heterogeneous Data Lakes . . . . . . . . 92--105 Jinyang Li and Yuval Moskovitch and Julia Stoyanovich and H. V. Jagadish Query Refinement for Diversity Constraint Satisfaction . . . . . . . . 106--118 Zhaoheng Li and Pranav Gor and Rahul Prabhu and Hui Yu and Yuzhou Mao and Yongjoo Park ElasticNotebook: Enabling Live Migration for Computational Notebooks . . . . . . 119--133 Kecheng Huang and Zhaoyan Shen and Zili Shao and Tong Zhang and Feng Chen Breathing New Life into an Old Tree: Resolving Logging Dilemma of B$^+$-tree on Modern Computational Storage Drives 134--147 Xinyu Zeng and Yulong Hui and Jiahong Shen and Andrew Pavlo and Wes McKinney and Huanchen Zhang An Empirical Evaluation of Columnar Storage Formats . . . . . . . . . . . . 148--161 Yichao Yuan and Haojie Ye and Sanketh Vedula and Wynn Kaza and Nishil Talati Everest: GPU-Accelerated System for Mining Temporal Motifs . . . . . . . . . 162--174 Xueyi Wu and Yuanyuan Xu and Wenjie Zhang and Ying Zhang Billion-Scale Bipartite Graph Embedding: a Global-Local Induced Approach . . . . 175--183 Wangze Ni and Pengze Chen and Lei Chen and Peng Cheng and Chen Jason Zhang and Xuemin Lin Utility-Aware Payment Channel Network Rebalance . . . . . . . . . . . . . . . 184--196 Pengfei Li and Wenqing Wei and Rong Zhu and Bolin Ding and Jingren Zhou and Hua Lu ALECE: an Attention-based Learned Cardinality Estimator for SPJ Queries on Dynamic Workloads . . . . . . . . . . . 197--210 Haojun Xia and Zhen Zheng and Yuchao Li and Donglin Zhuang and Zhongzhu Zhou and Xiafei Qiu and Yong Li and Wei Lin and Shuaiwen Leon Song Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity . . 211--224 Heidi Howard and Fritz Alder and Edward Ashton and Amaury Chamayou and Sylvan Clebsch and Manuel Costa and Antoine Delignat-Lavaud and Cédric Fournet and Andrew Jeffery and Matthew Kerner and Fotios Kounelis and Markus A. Kuppe and Julien Maffre and Mark Russinovich and Christoph M. Wintersteiger Confidential Consortium Framework: Secure Multiparty Applications with Confidentiality, Integrity, and High Availability . . . . . . . . . . . . . . 225--240 Sijing Duan and Feng Lyu and Xin Zhu and Yi Ding and Haotian Wang and Desheng Zhang and Xue Liu and Yaoxue Zhang and Ju Ren VeLP: Vehicle Loading Plan Learning from Human Behavior in Nationwide Logistics System . . . . . . . . . . . . . . . . . 241--249 Aaditya Naik and Aalok Thakkar and Adam Stein and Rajeev Alur and Mayur Naik Relational Query Synthesis $ \bowtie $ Decision Tree Learning . . . . . . . . . 250--263
Feng Yao and Qian Tao and Wenyuan Yu and Yanfeng Zhang and Shufeng Gong and Qiange Wang and Ge Yu and Jingren Zhou RAGraph: a Region-Aware Framework for Geo-Distributed Graph Processing . . . . 264--277 Qiuru Lin and Sai Wu and Junbo Zhao and Jian Dai and Meng Shi and Gang Chen and Feifei Li SmartLite: a DBMS-Based Serving System for DNN Inference in Resource-Constrained Environments . . . 278--291 Shiwen Wu and Qiyu Wu and Honghua Dong and Wen Hua and Xiaofang Zhou Blocker and Matcher Can Mutually Benefit: a Co-Learning Framework for Low-Resource Entity Resolution . . . . . 292--304 Yihao Ang and Qiang Huang and Yifan Bao and Anthony K. H. Tung and Zhiyong Huang TSGBench: Time Series Generation Benchmark . . . . . . . . . . . . . . . 305--318 Wieger R. Punter and Odysseas Papapetrou and Minos Garofalakis OmniSketch: Efficient Multi-Dimensional High-Velocity Stream Analytics with Arbitrary Predicates . . . . . . . . . . 319--331 Kai Hiu Chung and Alexander Zhou and Yue Wang and Lei Chen Maximum Balanced $ (k, \epsilon)$-Bitruss Detection in Signed Bipartite Graph . . . . . . . . . . . . 332--344 Xiao Li and Huan Li and Hua Lu and Christian S. Jensen and Varun Pandey and Volker Markl Missing Value Imputation for Multi-Attribute Sensor Data Streams via Message Propagation . . . . . . . . . . 345--358 Yuhang Chen and Chaoyun Zhang and Minghua Ma and Yudong Liu and Ruomeng Ding and Bowen Li and Shilin He and Saravan Rajmohan and Qingwei Lin and Dongmei Zhang ImDiffusion: Imputed Diffusion Models for Multivariate Time Series Anomaly Detection . . . . . . . . . . . . . . . 359--372 Dajun Sun and Wei Dong and Ke Yi Confidence Intervals for Private Query Processing . . . . . . . . . . . . . . . 373--385 Zhiyu Liang and Jianfeng Zhang and Chen Liang and Hongzhi Wang and Zheng Liang and Lujia Pan A Shapelet-Based Framework for Unsupervised Multivariate Time Series Representation Learning . . . . . . . . 386--399 Letong Wang and Xiangyun Ding and Yan Gu and Yihan Sun Fast and Space-Efficient Parallel Algorithms for Influence Maximization 400--413 Yile Chen and Gao Cong and Cuauhtemoc Anda TERI: an Effective Framework for Trajectory Recovery with Irregular Time Intervals . . . . . . . . . . . . . . . 414--426 Yuhan Chen and Haojie Ye and Sanketh Vedula and Alex Bronstein and Ronald Dreslinski and Trevor Mudge and Nishil Talati Demystifying Graph Sparsification Algorithms in Graph Properties Preservation . . . . . . . . . . . . . . 427--440 Jiashen Cao and Rathijit Sen and Matteo Interlandi and Joy Arulraj and Hyesoon Kim GPU Database Systems Characterization and Optimization . . . . . . . . . . . . 441--454 Chaoyi Chen and Dechao Gao and Yanfeng Zhang and Qiange Wang and Zhenbo Fu and Xuecang Zhang and Junhua Zhu and Yu Gu and Ge Yu NeutronStream: a Dynamic GNN Training Framework with Sliding Window for Graph Streams . . . . . . . . . . . . . . . . 455--468 Joshua Hildred and Michael Abebe and Khuzaima Daudjee Caerus: Low-Latency Distributed Transactions for Geo-Replicated Systems 469--482 Aoqian Zhang and Shuqing Deng and Dongping Cui and Ye Yuan and Guoren Wang An Experimental Evaluation of Anomaly Detection in Time Series . . . . . . . . 483--496 Mukul Singh and José Cambronero and Sumit Gulwani and Vu Le and Carina Negreanu and Elnaz Nouri and Mohammad Raza and Gust Verbruggen FormaT5: Abstention and Examples for Conditional Table Formatting with Natural Language . . . . . . . . . . . . 497--510 Manuel Schönberger and Immanuel Trummer and Wolfgang Mauerer Quantum-Inspired Digital Annealing for Join Ordering . . . . . . . . . . . . . 511--524 Mashaal Musleh and Mohamed F. Mokbel Kamel: a Scalable BERT-Based System for Trajectory Imputation . . . . . . . . . 525--538 Xinyi Zhang and Hong Wu and Yang Li and Zhengju Tang and Jian Tan and Feifei Li and Bin Cui An Efficient Transfer Learning Based Configuration Adviser for Database Tuning . . . . . . . . . . . . . . . . . 539--552 Adrien Petralia and Philippe Charpentier and Themis Palpanas ADF & TransApp: a Transformer-Based Framework for Appliance Detection Using Smart Meter Consumption Series . . . . . 553--562 Sarah Wooders and Xiangxi Mo and Amit Narang and Kevin Lin and Ion Stoica and Joseph M. Hellerstein and Natacha Crooks and Joseph E. Gonzalez RALF: Accuracy-Aware Scheduling for Feature Store Maintenance . . . . . . . 563--576 Kaisong Huang and Tianzheng Wang and Qingqing Zhou and Qingzhong Meng The Art of Latency Hiding in Modern Database Engines . . . . . . . . . . . . 577--590 Mohammad Matin Najafi and Chenhao Ma and Xiaodong Li and Reynold Cheng and Laks V. S. Lakshmanan MOSER: Scalable Network Motif Discovery Using Serial Test . . . . . . . . . . . 591--603 Dongxiang Zhang and Teng Ma and Junnan Hu and Yijun Bei and Kian-Lee Tan and Gang Chen Co-Movement Pattern Mining from Videos 604--616
Qian Ge and Yu Liu and Yinghao Zhao and Yuetian Sun and Lei Zou and Yuxing Chen and Anqun Pan Efficient and Accurate SimRank-Based Similarity Joins: Experiments, Analysis, and Improvement . . . . . . . . . . . . 617--629 Wentao Li and Maolin Cai and Min Gao and Dong Wen and Lu Qin and Wei Wang Expanding Reverse Nearest Neighbors . . 630--642 Fuheng Zhao and Divyakant Agrawal and Amr El Abbadi and Ahmed Metwally and Claire Mathieu and Michel de Rougemont Errata for ``SpaceSaving$ \pm $: an Optimal Algorithm for Frequency Estimation and Frequent Items in the Bounded-Deletion Model'' . . . . . . . . 643 Tarique Siddiqui and Vivek Narasayya and Marius Dumitru and Surajit Chaudhuri Cache-Efficient Top-$k$ Aggregation over High Cardinality Large Datasets . . . . 644--656 Xinwei Cai and Xiangyu Ke and Kai Wang and Lu Chen and Tianming Zhang and Qing Liu and Yunjun Gao Efficient Temporal Butterfly Counting and Enumeration on Temporal Bipartite Graphs . . . . . . . . . . . . . . . . . 657--670 Tianxiong Zhong and Zhiwei Zhang and Guo Lu and Ye Yuan and Yu-Ping Wang and Guoren Wang TVM: a Tile-based Video Management Framework . . . . . . . . . . . . . . . 671--684 Yi Zhang and Jan Deriu and George Katsogiannis-Meimarakis and Catherine Kosten and Georgia Koutrika and Kurt Stockinger ScienceBenchmark: a Complex Real-World Benchmark for Evaluating Natural Language to SQL Systems . . . . . . . . 685--698 Lu Chen and Chengfei Liu and Rui Zhou and Kewen Liao and Jiajie Xu and Jianxin Li Densest Multipartite Subgraph Search in Heterogeneous Information Networks . . . 699--711 Kabir Nagrecha and Arun Kumar Saturn: an Optimized Data System for Multi-Large-Model Deep Learning Workloads . . . . . . . . . . . . . . . 712--725 Miao Cai and Junru Shen and Yifan Yuan and Zhihao Qu and Baoliu Ye BonsaiKV: Towards Fast, Scalable, and Persistent Key--Value Stores with Tiered, Heterogeneous Memory System . . 726--739 Silvan Reiner and Michael Grossniklaus Sample-Efficient Cardinality Estimation Using Geometric Deep Learning . . . . . 740--752 Kai Zhao and Chenjuan Guo and Yunyao Cheng and Peng Han and Miao Zhang and Bin Yang Multiple Time Series Forecasting with Dynamic Graph Modeling . . . . . . . . . 753--765 Yunyao Cheng and Peng Chen and Chenjuan Guo and Kai Zhao and Qingsong Wen and Bin Yang and Christian S. Jensen Weakly Guided Adaptation for Robust Time Series Forecasting . . . . . . . . . . . 766--779 Rui Yang and Evgenios M. Kornaropoulos and Yue Cheng Algorithmic Complexity Attacks on Dynamic Learned Indexes . . . . . . . . 780--793 Jiaqi Zhu and Shaofeng Cai and Fang Deng and Beng Chin Ooi and Wenqiao Zhang METER: a Dynamic Concept Adaptation Framework for Online Anomaly Detection 794--807 Hailin Zhang and Penghao Zhao and Xupeng Miao and Yingxia Shao and Zirui Liu and Tong Yang and Bin Cui Experimental Analysis of Large-Scale Learnable Vector Storage Compression . . 808--822 Yue Zhao and Zhaodonghui Li and Gao Cong A Comparative Study and Component Analysis of Query Plan Representation Techniques in ML4DB Studies . . . . . . 823--835 Zeyang Zhuang and Penghui Li and Pingchuan Ma and Wei Meng and Shuai Wang Testing Graph Database Systems via Graph-Aware Metamorphic Relations . . . 836--848 Tianji Cong and Madelon Hulsebos and Zhenjie Sun and Paul Groth and H. V. Jagadish Observatory: Characterizing Embeddings of Relational Tables . . . . . . . . . . 849--862 Taeyoon Kim and ChanHo Park and Mansur Mukimbekov and Heelim Hong and Minseok Kim and Ze Jin and Changdae Kim and Ji-Yong Shin and Myeongjae Jeon FusionFlow: Accelerating Data Preprocessing for Machine Learning with CPU--GPU Cooperation . . . . . . . . . . 863--876 Hubert Mohr-Daurat and Xuan Sun and Holger Pirk BOSS --- an Architecture for Database Kernel Composition . . . . . . . . . . . 877--890 Xiangyu Zhi and Xiao Yan and Bo Tang and Ziyao Yin and Yanchao Zhu and Minqi Zhou CoroGraph: Bridging Cache Efficiency and Work Efficiency for Graph Algorithm Execution . . . . . . . . . . . . . . . 891--903 Audrey Cheng and Jack Waudby and Hugo Firth and Natacha Crooks and Ion Stoica Mammoths are Slow: The Overlooked Transactions of Graph Data . . . . . . . 904--911 Enyuan Zhou and Song Guo and Zicong Hong and Christian S. Jensen and Yang Xiao and Dalin Zhang and Jinwen Liang and Qingqi Pei VeriDKG: a Verifiable SPARQL Query Engine for Decentralized Knowledge Graphs . . . . . . . . . . . . . . . . . 912--925
Lianggui Weng and Rong Zhu and Di Wu and Bolin Ding and Bolong Zheng and Jingren Zhou Eraser: Eliminating Performance Regression on Learned Query Optimizer 926--938 Chao Zhang and Guoliang Li and Tao Lv HyBench: a New Benchmark for HTAP Databases . . . . . . . . . . . . . . . 939--951 Ki Hyun Tae and Hantian Zhang and Jaeyoung Park and Kexin Rong and Steven Euijong Whang Falcon: Fair Active Learning Using Multi-Armed Bandits . . . . . . . . . . 952--965 Honghu Wu and Xiangrong Zhu and Wei Hu A Blockchain System for Clustered Federated Learning with Peer-to-Peer Knowledge Transfer . . . . . . . . . . . 966--979 Rong Zhu and Lianggui Weng and Wenqing Wei and Di Wu and Jiazhen Peng and Yifan Wang and Bolin Ding and Defu Lian and Bolong Zheng and Jingren Zhou PilotScope: Steering Databases with Machine Learning Drivers . . . . . . . . 980--993 Yishuai Li and Yunfeng Zhu and Chao Shi and Guanhua Zhang and Jianzhong Wang and Xiaolu Zhang Timestamp as a Service, Not an Oracle 994--1006 Junjie Xing and Xinyu Wang and H. V. Jagadish Data-Driven Insight Synthesis for Multi-Dimensional Data . . . . . . . . . 1007--1019 Naili Xing and Shaofeng Cai and Gang Chen and Zhaojing Luo and Beng Chin Ooi and Jian Pei Database Native Model Selection: Harnessing Deep Neural Networks in Database Systems . . . . . . . . . . . . 1020--1033 Kaiyu Chen and Dong Wen and Wenjie Zhang and Ying Zhang and Xiaoyang Wang and Xuemin Lin Querying Structural Diversity in Streaming Graphs . . . . . . . . . . . . 1034--1046 Xiangyang Gou and Xinyi Ye and Lei Zou and Jeffrey Xu Yu LM-SRPQ: Efficiently Answering Regular Path Query in Streaming Graphs . . . . . 1047--1059 Shihong Gao and Yiming Li and Yanyan Shen and Yingxia Shao and Lei Chen ETC: Efficient Training of Temporal Graph Neural Networks over Large-Scale Dynamic Graphs . . . . . . . . . . . . . 1060--1072 Chenyuan Wu and Mohammad Javad Amiri and Haoyun Qin and Bhavana Mehta and Ryan Marcus and Boon Thau Loo Towards Full Stack Adaptivity in Permissioned Blockchains . . . . . . . . 1073--1080 Jindong Han and Weijia Zhang and Hao Liu and Tao Tao and Naiqiang Tan and Hui Xiong BigST: Linear Complexity Spatio-Temporal Graph Neural Network for Traffic Forecasting on Large-Scale Road Networks 1081--1090 Xinhao Min and Kai Lu and Pengyu Liu and Jiguang Wan and Changsheng Xie and Daohui Wang and Ting Yao and Huatao Wu SepHash: a Write-Optimized Hash Index On Disaggregated Memory via Separate Segment Structure . . . . . . . . . . . 1091--1104 Dahai Tang and Jiali Wang and Rong Chen and Lei Wang and Wenyuan Yu and Jingren Zhou and Kenli Li XGNN: Boosting Multi-GPU GNN Training via Global GNN Memory Store . . . . . . 1105--1118 Youming Tao and Cheng-Long Wang and Miao Pan and Dongxiao Yu and Xiuzhen Cheng and Di Wang Communication Efficient and Provable Federated Unlearning . . . . . . . . . . 1119--1131 Dawei Gao and Haibin Wang and Yaliang Li and Xiuyu Sun and Yichen Qian and Bolin Ding and Jingren Zhou Text-to-SQL Empowered by Large Language Models: a Benchmark Evaluation . . . . . 1132--1145 Anh L. Mai and Pengyu Wang and Azza Abouzied and Matteo Brucato and Peter J. Haas and Alexandra Meliou Scaling Package Queries to a Billion Tuples via Hierarchical Partitioning and Customized Optimization . . . . . . . . 1146--1158
Yuhao Deng and Chengliang Chai and Lei Cao and Nan Tang and Jiayi Wang and Ju Fan and Ye Yuan and Guoren Wang MisDetect: Iterative Mislabel Detection using Early Loss . . . . . . . . . . . . 1159--1172 Wenfei Fan and Muyang Liu and Shuhao Liu and Chao Tian Capturing More Associations by Referencing External Graphs . . . . . . 1173--1186 Longlong Lin and Pingpeng Yuan and Rong-Hua Li and Chunxue Zhu and Hongchao Qin and Hai Jin and Tao Jia QTCS: Efficient Query-Centered Temporal Community Search . . . . . . . . . . . . 1187--1199 Jie Fu and Qingqing Ye and Haibo Hu and Zhili Chen and Lulu Wang and Kuncan Wang and Xun Ran DPSUR: Accelerating Differentially Private Stochastic Gradient Descent Using Selective Update and Release . . . 1200--1213 Alexander Erben and Ruben Mayer and Hans-Arno Jacobsen How Can We Train Deep Learning Models Across Clouds and Continents? An Experimental Study . . . . . . . . . . . 1214--1226 Jeongmin Brian Park and Vikram Sharma Mailthody and Zaid Qureshi and Wen-mei Hwu Accelerating Sampling and Aggregation Operations in GNN Frameworks with GPU Initiated Direct Storage Accesses . . . 1227--1240 Hao Yuan and Yajiong Liu and Yanfeng Zhang and Xin Ai and Qiange Wang and Chaoyi Chen and Yu Gu and Ge Yu Comprehensive Evaluation of GNN Training Systems: a Data Management Perspective 1241--1254 Tsz Nam Chan and Rui Zang and Bojian Zhu and Leong Hou U. and Dingming Wu and Jianliang Xu LION: Fast and High-Resolution Network Kernel Density Visualization . . . . . . 1255--1268 Zitao Li and Bolin Ding and Liuyi Yao and Yaliang Li and Xiaokui Xiao and Jingren Zhou Performance-Based Pricing for Federated Learning via Auction . . . . . . . . . . 1269--1282 Yiqun Diao and Yutong Yang and Qinbin Li and Bingsheng He and Mian Lu OEBench: Investigating Open Environment Challenges in Real-World Relational Data Streams . . . . . . . . . . . . . . . . 1283--1296 Jiadong Xie and Zehua Chen and Deming Chu and Fan Zhang and Xuemin Lin and Zhihong Tian Influence Maximization via Vertex Countering . . . . . . . . . . . . . . . 1297--1309 Tingting Wang and Shixun Huang and Zhifeng Bao and J. Shane Culpepper and Volkan Dedeoglu and Reza Arablouei Optimizing Data Acquisition to Enhance Machine Learning Performance . . . . . . 1310--1323 Xin Chen and Jieming Shi and You Peng and Wenqing Lin and Sibo Wang and Wenjie Zhang Minimum Strongly Connected Subgraph Collection in Dynamic Graphs . . . . . . 1324--1336 Junhao Zhu and Yuren Mao and Lu Chen and Congcong Ge and Ziheng Wei and Yunjun Gao FusionQuery: On-demand Fusion Queries over Multi-source Heterogeneous Data . . 1337--1349 David Justen and Daniel Ritter and Campbell Fraser and Andrew Lamb and Allison Lee and Thomas Bodner and Mhd Yamen Haddad and Steffen Zeuch and Volker Markl and Matthias Boehm POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance . . . . . . . . . . . . . . . 1350--1363 Zhiyuan Li and Xun Jian and Yue Wang and Yingxia Shao and Lei Chen DAHA: Accelerating GNN Training with Data and Hardware Aware Execution Planning . . . . . . . . . . . . . . . . 1364--1376 Ziyi Lu and Qiang Cao and Hong Jiang and Yuxing Chen and Jie Yao and Anqun Pan FluidKV: Seamlessly Bridging the Gap between Indexing Performance and Memory-Footprint on Ultra-Fast Storage 1377--1390 Vraj Shah and Thomas Parashos and Arun Kumar How Do Categorical Duplicates Affect ML? A New Benchmark and Empirical Analyses 1391--1404 Pengjie Cui and Haotian Liu and Bo Tang and Ye Yuan CGgraph: an Ultra-Fast Graph Processing System on Modern Commodity CPU--GPU Co-processor . . . . . . . . . . . . . . 1405--1417 Xinyu Chen and Jiannan Tian and Ian Beaver and Cynthia Freeman and Yan Yan and Jianguo Wang and Dingwen Tao FCBench: Cross-Domain Benchmarking of Lossless Compression for Floating-Point Data . . . . . . . . . . . . . . . . . . 1418--1431 Aaron Hurst and Daniel E. Lucani and Qi Zhang PairwiseHist: Fast, Accurate and Space-Efficient Approximate Query Processing with Data Compression . . . . 1432--1445 Huayi Zhang and Binwei Yan and Lei Cao and Samuel Madden and Elke Rundensteiner MetaStore: Analyzing Deep Learning Meta-Data at Scale . . . . . . . . . . . 1446--1459 Yangming Lv and Kai Zhang and Ziming Wang and Xiaodong Zhang and Rubao Lee and Zhenying He and Yinan Jing and X. Sean Wang RTScan: Efficient Scan with Ray Tracing Cores . . . . . . . . . . . . . . . . . 1460--1472 Kezhao Huang and Haitian Jiang and Minjie Wang and Guangxuan Xiao and David Wipf and Xiang Song and Quan Gan and Zengfeng Huang and Jidong Zhai and Zheng Zhang FreshGNN: Reducing Memory Access via Stable Historical Embeddings for Graph Neural Network Training . . . . . . . . 1473--1486 Ying Zheng and Kian-Lee Tan Sorting on Byte-Addressable Storage: The Resurgence of Tree Structure . . . . . . 1487--1500 Xenofon Chatziliadis and Eleni Tzirita Zacharatou and Alphan Eracar and Steffen Zeuch and Volker Markl Efficient Placement of Decomposable Aggregation Functions for Stream Processing over Large Geo-Distributed Topologies . . . . . . . . . . . . . . . 1501--1514 Jiamin Hou and Zhanhao Zhao and Zhouyu Wang and Wei Lu and Guodong Jin and Dong Wen and Xiaoyong Du AeonG: an Efficient Built-in Temporal Support in Graph Databases . . . . . . . 1515--1527
Tao Yu and Zhaonian Zou and Weihua Sun and Yu Yan Refactoring Index Tuning Process with Benefit Estimation . . . . . . . . . . . 1528--1541 Xunkai Li and Meihao Liao and Zhengyu Wu and Daohan Su and Wentao Zhang and Rong-Hua Li and Guoren Wang LightDiC: a Simple Yet Effective Approach for Large-Scale Digraph Representation Learning . . . . . . . . 1542--1551 Shulei Kuang and Honghui Yang and Zijing Tan and Shuai Ma Efficient Differential Dependency Discovery . . . . . . . . . . . . . . . 1552--1564 Claude Lehmann and Pavel Sulimov and Kurt Stockinger Is Your Learned Query Optimizer Behaving As You Expect? A Machine Learning Perspective . . . . . . . . . . . . . . 1565--1577 Zhuoxing Zhang and Sebastian Link Mixed Covers of Keys and Functional Dependencies for Maintaining the Integrity of Data under Updates . . . . 1578--1590 Yuhao Deng and Yu Wang and Lei Cao and Lianpeng Qiao and Yuping Wang and Jingzhe Xu and Yizhou Yan and Samuel Madden Outlier Summarization via Human Interpretable Rules . . . . . . . . . . 1591--1604 Haitao Yuan and Gao Cong and Guoliang Li Nuhuo: an Effective Estimation Model for Traffic Speed Histogram Imputation on a Road Network . . . . . . . . . . . . . . 1605--1617 Deepak Ravikumar and Alex Yeo and Yiwen Zhu and Aditya Lakra and Harsha Nagulapalli and Santhosh Ravindran and Steve Suh and Niharika Dutta and Andrew Fogarty and Yoonjae Park and Sumeet Khushalani and Arijit Tarafdar and Kunal Parekh and Subru Krishnan Intelligent Pooling: Proactive Resource Provisioning in Large-scale Cloud Service . . . . . . . . . . . . . . . . 1618--1627 Yutong Ye and Xiang Lian and Mingsong Chen Efficient Exact Subgraph Matching via GNN-Based Path Dominance Embedding . . . 1628--1641 Zijia Wang and Haoran Liu and Chen Lin and Zhifeng Bao and Guoliang Li and Tianqing Wang Leveraging Dynamic and Heterogeneous Workload Knowledge to Boost the Performance of Index Advisors . . . . . 1642--1654 Otmar Ertl UltraLogLog: a Practical and More Space-Efficient Alternative to HyperLogLog for Approximate Distinct Counting . . . . . . . . . . . . . . . . 1655--1668 Zengyang Gong and Yuxiang Zeng and Lei Chen Real-Time Insertion Operator for Shared Mobility on Time-Dependent Road Networks 1669--1682 Dayi Fan and Rubao Lee and Xiaodong Zhang X-TED: Massive Parallelization of Tree Edit Distance . . . . . . . . . . . . . 1683--1696 Wonseok Shin and Siwoo Song and Kunsoo Park and Wook-Shin Han Cardinality Estimation of Subgraph Matching: a Filtering-Sampling Approach 1697--1709 Qi Liang and Dian Ouyang and Fan Zhang and Jianye Yang and Xuemin Lin and Zhihong Tian Efficient Regular Simple Path Queries under Transitive Restricted Expressions 1710--1722 Shuhan Zhong and Sizhe Song and Weipeng Zhuo and Guanyao Li and Yang Liu and S.-H. Gary Chan A Multi-Scale Decomposition MLP-Mixer for Time Series Analysis . . . . . . . . 1723--1736 Haocheng Xia and Xiang Li and Junyuan Pang and Jinfei Liu and Kui Ren and Li Xiong P-Shapley: Shapley Values on Probabilistic Classifiers . . . . . . . 1737--1750 Wenjia He and Ibrahim Sabek and Yuze Lou and Michael Cafarella Optimizing Video Selection LIMIT Queries with Commonsense Knowledge . . . . . . . 1751--1764 Nan Huo and Reynold Cheng and Ben Kao and Wentao Ning and Nur Al Hasan Haldar and Xiaodong Li and Jinyang Li and Mohammad Matin Najafi and Tian Li and Ge Qu ZeroEA: a Zero-Training Entity Alignment Framework via Pre-Trained Language Model 1765--1774 Xueli Liu and Bowen Dong and Wenzhi Fu and Nannan Wu and Xin Wang and Wenjun Wang Extending Graph Rules with Oracles . . . 1775--1787
Junyi Mei and Shixuan Sun and Chao Li and Cheng Xu and Cheng Chen and Yibo Liu and Jing Wang and Cheng Zhao and Xiaofeng Hou and Minyi Guo and Bingsheng He and Xiaoliang Cong FlowWalker: a Memory-Efficient and High-Performance GPU-Based Dynamic Graph Random Walk Framework . . . . . . . . . 1788--1801 Minsu Kim and Jinwoo Hwang and Guseul Heo and Seiyeon Cho and Divya Mahajan and Jongse Park Accelerating String-Key Learned Index Structures via Memoization-Based Incremental Training . . . . . . . . . . 1802--1815 Xuankun Liao and Qing Liu and Xin Huang and Jianliang Xu Truss-Based Community Search over Streaming Directed Graphs . . . . . . . 1816--1829 Ricardo Salazar-Díaz and Boris Glavic and Tilmann Rabl InferDB: In-Database Machine Learning Inference Using Indexes . . . . . . . . 1830--1842 Fei Wei and Ergute Bao and Xiaokui Xiao and Yin Yang and Bolin Ding AAA: an Adaptive Mechanism for Locally Differentially Private Mean Estimation 1843--1855 Yangshen Deng and Muxi Yan and Bo Tang Accelerating Merkle Patricia Trie with GPU . . . . . . . . . . . . . . . . . . 1856--1869 Shaowei Wang and Yun Peng and Jin Li and Zikai Wen and Zhipeng Li and Shiyu Yu and Di Wang and Wei Yang Privacy Amplification via Shuffling: Unified, Simplified, and Tightened . . . 1870--1883 Jiansen Song and Wensheng Dou and Yu Gao and Ziyu Cui and Yingying Zheng and Dong Wang and Wei Wang and Jun Wei and Tao Huang Detecting Metadata-Related Logic Bugs in Database Systems via Raw Database Construction . . . . . . . . . . . . . . 1884--1897 Biao Wu and Qiang Huang and Anthony K. H. Tung From Zero to Hero: Detecting Leaked Data through Synthetic Data Injection and Model Querying . . . . . . . . . . . . . 1898--1910 Guanduo Chen and Zhenying He and Meng Li and Siqiang Luo Oasis: an Optimal Disjoint Segmented Learned Range Filter . . . . . . . . . . 1911--1924 Yuhao Deng and Chengliang Chai and Lei Cao and Qin Yuan and Siyuan Chen and Yanrui Yu and Zhaoze Sun and Junyi Wang and Jiajun Li and Ziqi Cao and Kaisen Jin and Chi Zhang and Yuqing Jiang and Yuanfang Zhang and Yuping Wang and Ye Yuan and Guoren Wang and Nan Tang LakeBench: a Benchmark for Discovering Joinable and Unionable Tables in Data Lakes . . . . . . . . . . . . . . . . . 1925--1938 Jiale Lao and Yibo Wang and Yufei Li and Jianping Wang and Yunjia Zhang and Zhiyuan Cheng and Wanghu Chen and Mingjie Tang and Jianguo Wang GPTuner: a Manual-Reading Database Tuning System via GPT-Guided Bayesian Optimization . . . . . . . . . . . . . . 1939--1952 Arik Ermshaus and Patrick Schäfer and Ulf Leser Raising the ClaSS of Streaming Time Series Segmentation . . . . . . . . . . 1953--1966 Qiyan Li and Jeffrey Xu Yu Fast Local Subgraph Counting . . . . . . 1967--1980 Yunjia Zhang and Jordan Henkel and Avrilia Floratou and Joyce Cahoon and Shaleen Deep and Jignesh M. Patel ReAcTable: Enhancing ReAct for Table Question Answering . . . . . . . . . . . 1981--1994 Xin Ai and Qiange Wang and Chunyu Cao and Yanfeng Zhang and Chaoyi Chen and Hao Yuan and Yu Gu and Ge Yu NeutronOrch: Rethinking Sample-Based GNN Training under CPU--GPU Heterogeneous Environments . . . . . . . . . . . . . . 1995--2008 Zifan Liu and Shaleen Deep and Anna Fariha and Fotis Psallidas and Ashish Tiwari and Avrilia Floratou Rapidash: Efficient Detection of Constraint Violations . . . . . . . . . 2009--2021 Shubhankar Mohapatra and Jianqiao Zong and Florian Kerschbaum and Xi He Differentially Private Data Generation with Missing Data . . . . . . . . . . . 2022--2035 Zhaoyuan Su and Ammar Ahmed and Zirui Wang and Ali Anwar and Yue Cheng Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask . . . . . . . . . . . . . . . . . . 2036--2049 Haoyang Li and Shimin Di and Calvin Hong Yi Li and Lei Chen and Xiaofang Zhou Fight Fire with Fire: Towards Robust Graph Neural Networks on Dynamic Graphs via Actively Defense . . . . . . . . . . 2050--2063 Farzaneh Zirak and Farhana Choudhury and Renata Borovica-Gajic SeLeP: Learning Based Semantic Prefetching for Exploratory Database Workloads . . . . . . . . . . . . . . . 2064--2076 Yiwei Chen and Kaiyu Li and Guoliang Li and Yong Wang Contributions Estimation in Federated Learning: a Comprehensive Experimental Evaluation . . . . . . . . . . . . . . . 2077--2090 Stavros Maroulis and Vassilis Stamatopoulos and George Papastefanatos and Manolis Terrovitis Visualization-Aware Time Series Min-Max Caching with Error Bound Guarantees . . 2091--2103 Moe Kayali and Anton Lykov and Ilias Fountalis and Nikolaos Vasiloglou and Dan Olteanu and Dan Suciu Chorus: Foundation Models for Unified Data Discovery and Exploration . . . . . 2104--2114 Viktor Leis and Christian Dietrich Cloud-Native Database Systems and Unikernels: Reimagining OS Abstractions for Modern Hardware . . . . . . . . . . 2115--2122
Haoran Xiong and Hang Zhang and Zeyu Wang and Zhenying He and Peng Wang and X. Sean Wang CIVET: Exploring Compact Index for Variable-Length Subsequence Matching on Time Series . . . . . . . . . . . . . . 2123--2135 Chanwut Kittivorawong and Yongming Ge and Yousef Helal and Alvin Cheung Spatialyze: a Geospatial Video Analytics System with Spatial-Aware Optimizations 2136--2148 Hanyan Yin and Dongxie Wen and Jiajun Li and Zhewei Wei and Xiao Zhang and Zengfeng Huang and Feifei Li Optimal Matrix Sketching over Sliding Windows . . . . . . . . . . . . . . . . 2149--2161 Radim Baca Window Function Expression: Let the Self-Join Enter . . . . . . . . . . . . 2162--2174 Aarati Kakaraparthy and Jignesh M. Patel SplitDF: Splitting Dataframes for Memory-Efficient Data Analysis . . . . . 2175--2184 Majid Daliri and Juliana Freire and Christopher Musco and Aécio Santos and Haoxiang Zhang Sampling Methods for Inner Product Sketching . . . . . . . . . . . . . . . 2185--2197 Han Hu and Jiye Qiu and Hongzhi Wang and Bin Liang and Songling Zou DIDS: Double Indices and Double Summarizations for Fast Similarity Search . . . . . . . . . . . . . . . . . 2198--2211 Qian Xu and Juan Yang and Feng Zhang and Zheng Chen and Jiawei Guan and Kang Chen and Ju Fan and Youren Shen and Ke Yang and Yu Zhang and Xiaoyong Du Improving Graph Compression for Efficient Resource-Constrained Graph Analytics . . . . . . . . . . . . . . . 2212--2226 Jianwei Wang and Kai Wang and Xuemin Lin and Wenjie Zhang and Ying Zhang Efficient Unsupervised Community Search with Pre-Trained Graph Transformer . . . 2227--2240 Jiuqi Wei and Botao Peng and Xiaodong Lee and Themis Palpanas DET-LSH: a Locality-Sensitive Hashing Scheme with Dynamic Encoding Tree for Approximate Nearest Neighbor Search . . 2241--2254 Haoyu Liu and Siqiang Luo BIRD: Efficient Approximation of Bidirectional Hidden Personalized PageRank . . . . . . . . . . . . . . . . 2255--2268 Zihao Yu and Ningyi Liao and Siqiang Luo GENTI: GPU-Powered Walk-Based Subgraph Extraction for Scalable Representation Learning on Dynamic Graphs . . . . . . . 2269--2278 Benjamin Feuer and Yurong Liu and Chinmay Hegde and Juliana Freire ArcheType: a Novel Framework for Open-Source Column Type Annotation Using Large Language Models . . . . . . . . . 2279--2292 Yanchuan Chang and Egemen Tanin and Gao Cong and Christian S. Jensen and Jianzhong Qi Trajectory Similarity Measurement: an Efficiency Perspective . . . . . . . . . 2293--2306 Brian Wheatman and Xiaojun Dong and Zheqi Shen and Laxman Dhulipala and Jakub Lacki and Prashant Pandey and Helen Xu BYO: a Unified Framework for Benchmarking Large-Scale Graph Containers . . . . . . . . . . . . . . . 2307--2320 Yizheng Zhu and Yuncheng Wu and Zhaojing Luo and Beng Chin Ooi and Xiaokui Xiao Secure and Verifiable Data Collaboration with Low-Cost Zero-Knowledge Proofs . . 2321--2334 Heena Nagda and Shubhendra Pal Singhal and Mohammad Javad Amiri and Boon Thau Loo Rashnu: Data-Dependent Order-Fairness 2335--2348 Yuchuan Huang and Mohamed F. Mokbel Sparcle: Boosting the Accuracy of Data Cleaning Systems through Spatial Awareness . . . . . . . . . . . . . . . 2349--2362 Xiangfei Qiu and Jilin Hu and Lekui Zhou and Xingjian Wu and Junyang Du and Buang Zhang and Chenjuan Guo and Aoying Zhou and Christian S. Jensen and Zhenli Sheng and Bin Yang TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting Methods . . . . . . . . . . . . . . . . 2363--2377 Chengjun Liu and Zhuo Peng and Weiguo Zheng and Lei Zou FSM: a Fine-Grained Splitting and Merging Framework for Dual-Balanced Graph Partition . . . . . . . . . . . . 2378--2391 Stefano Marchesin and Gianmaria Silvello Efficient and Reliable Estimation of Knowledge Graph Accuracy . . . . . . . . 2392--2403
Wei Zhou and Chen Lin and Xuanhe Zhou and Guoliang Li Breaking It Down: an In-Depth Study of Index Advisors . . . . . . . . . . . . . 2405--2418 Wen Deng and Weiguo Zheng and Hong Cheng Accelerating Maximal Clique Enumeration via Graph Reduction . . . . . . . . . . 2419--3431 Jiyang Bai and Peixiang Zhao Poligras: Policy-Based Graph Summarization . . . . . . . . . . . . . 2432--2444 Leqian Zheng and Lei Xu and Cong Wang and Sheng Wang and Yuke Hu and Zhan Qin and Feifei Li and Kui Ren SWAT: a System-Wide Approach to Tunable Leakage Mitigation in Encrypted Data Stores . . . . . . . . . . . . . . . . . 2445--2458 Kai Wang and Yuwei Xu and Siqiang Luo TIGER: Training Inductive Graph Neural Network for Large-Scale Knowledge Graph Reasoning . . . . . . . . . . . . . . . 2459--2472 Chao Zhang and Angela Bonifati and M. Tamer Özsu Incremental Sliding Window Connectivity over Streaming Graphs . . . . . . . . . 2473--2486 Qingpeng Cai and Kaiping Zheng and H. V. Jagadish and Beng Chin Ooi and James Yip CohortNet: Empowering Cohort Discovery for Interpretable Healthcare Analytics 2487--2500 Jinghao Wang and Yanping Wu and Xiaoyang Wang and Ying Zhang and Lu Qin and Wenjie Zhang and Xuemin Lin Efficient Influence Minimization via Node Blocking . . . . . . . . . . . . . 2501--2513 Xuanhe Zhou and Guoliang Li and Zhaoyan Sun and Zhiyuan Liu and Weize Chen and Jianming Wu and Jiesi Liu and Ruohang Feng and Guoyang Zeng D-Bot: Database Diagnosis System using Large Language Models . . . . . . . . . 2514--2527 Yiming Qiao and Yihan Gao and Huanchen Zhang Blitzcrank: Fast Semantic Compression for In-Memory Online Transaction Processing . . . . . . . . . . . . . . . 2528--2540 Zhihao Chen and Tianji Yang and Yixiao Zheng and Zhao Zhang and Cheqing Jin and Aoying Zhou Spectrum: Speedy and Strictly-Deterministic Smart Contract Transactions for Blockchain Ledgers . . 2541--2554 Zihao Zhang and Huiqi Hu and Xuan Zhou and Yaofeng Tu and Weining Qian and Aoying Zhou Fast Commitment for Geo-Distributed Transactions via Decentralized Co-Coordinators . . . . . . . . . . . . 2555--2567 Alberto Lerner and Gustavo Alonso CXL and the Return of Scale-Up Database Engines . . . . . . . . . . . . . . . . 2568--2575 Shuheng Fang and Kangfei Zhao and Yu Rong and Zhixun Li and Jeffrey Xu Yu Inductive Attributed Community Search: To Learn Communities Across Graphs . . . 2576--2589 Long Yuan and Xia Li and Zi Chen and Xuemin Lin and Xiang Zhao and Wenjie Zhang I/O Efficient Label-Constrained Reachability Queries in Large Graphs . . 2590--2602 Baotong Lu and Kaisong Huang and Chieh-Jan Mike Liang and Tianzheng Wang and Eric Lo DEX: Scalable Range Indexing on Disaggregated Memory . . . . . . . . . . 2603--2616 Wei Ni and Xiaoye Miao and Xiangyu Zhao and Yangyang Wu and Shuwei Liang and Jianwei Yin Automatic Data Repair: Are We Ready to Deploy? . . . . . . . . . . . . . . . . 2617--2630 Chaokun Chang and Eric Lo and Chunxiao Ye Biathlon: Harnessing Model Resilience for Accelerating ML Inference Pipelines 2631--2640 Yuanyuan Zeng and Chenhao Ma and Yixiang Fang Distributed Shortest Distance Labeling on Large-Scale Graphs . . . . . . . . . 2641--2653 Wensheng Luo and Yixiang Fang and Chunxu Lin and Yingli Zhou Efficient Parallel D-Core Decomposition at Scale . . . . . . . . . . . . . . . . 2654--2667 Leonardo Pellegrina and Fabio Vandin Efficient Discovery of Significant Patterns with Few-Shot Resampling . . . 2668--2680
Qixu Chen and Raymond Chi-Wing Wong Robust Best Point Selection under Unreliable User Feedback . . . . . . . . 2681--2693 Audrey Cheng and Aaron Kabcenell and Jason Chan and Xiao Shi and Peter Bailis and Natacha Crooks and Ion Stoica Towards Optimal Transaction Scheduling 2694--2707 David Campos and Bin Yang and Tung Kieu and Miao Zhang and Chenjuan Guo and Christian S. Jensen QCore: Data-Efficient, On-Device Continual Calibration for Quantized Models . . . . . . . . . . . . . . . . . 2708--2721 Yalong Zhang and Rong-Hua Li and Qi Zhang and Hongchao Qin and Lu Qin and Guoren Wang Efficient Algorithms for Pseudoarboricity Computation in Large Static and Dynamic Graphs . . . . . . . 2722--2734 Meng Chen and Kai Zhang and Zhenying He and Yinan Jing and X. Sean Wang RoarGraph: a Projected Bipartite Graph for Efficient Cross-Modal Approximate Nearest Neighbor Search . . . . . . . . 2735--2749 Ju Fan and Zihui Gu and Songyue Zhang and Yuxin Zhang and Zui Chen and Lei Cao and Guoliang Li and Samuel Madden and Xiaoyong Du and Nan Tang Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL . . . . . . . . . . . . . . . . . 2750--2763 Rustam Guliyev and Aparajita Haldar and Hakan Ferhatosmanoglu D3-GNN: Dynamic Distributed Dataflow for Streaming Graph Neural Networks . . . . 2764--2777 Yunxiang Su and Shaoxu Song and Xiangdong Huang and Chen Wang and Jianmin Wang Distance-Based Outlier Query Optimization in Apache IoTDB . . . . . . 2778--2790 Jianye Yang and Sheng Fang and Zhaoquan Gu and Ziyi Ma and Xuemin Lin and Zhihong Tian TC-Match: Fast Time-Constrained Continuous Subgraph Matching . . . . . . 2791--2804 Michael Wornow and Avanika Narayan and Krista Opsahl-Ong and Quinn McIntyre and Nigam Shah and Christopher Ré Automating the Enterprise with Foundation Models . . . . . . . . . . . 2805--2812 Anxin Tian and Alexander Zhou and Yue Wang and Xun Jian and Lei Chen Efficient Index for Temporal Core Queries over Bipartite Graphs . . . . . 2813--2825 Fumiyuki Kato and Li Xiong and Shun Takagi and Yang Cao and Masatoshi Yoshikawa Uldp-FL: Federated Learning with Across-Silo User-Level Differential Privacy . . . . . . . . . . . . . . . . 2826--2839 Junyong Yang and Ming Zhong and Yuanyuan Zhu and Tieyun Qian and Mengchi Liu and Jeffrey Xu Yu Evolution Forest Index: Towards Optimal Temporal $k$-Core Component Search via Time-Topology Isomorphic Computation . . 2840--2853 Yuxin Ma and Ping Gong and Tianming Wu and Jiawei Yi and Chengru Yang and Cheng Li and Qirong Peng and Guiming Xie and Yongcheng Bao and Haifeng Liu and Yinlong Xu Eliminating Data Processing Bottlenecks in GNN Training over Large Graphs via Two-level Feature Compression . . . . . 2854--2866 Douglas B. Rumbaugh and Dong Xie and Zhuoyue Zhao Towards Systematic Index Dynamization 2867--2879 Dennis Treder-Tschechlov and Manuel Fritz and Holger Schwarz and Bernhard Mitschang Ensemble Clustering Based on Meta-Learning and Hyperparameter Optimization . . . . . . . . . . . . . . 2880--2892 Chenjuan Guo and Ronghui Xu and Bin Yang and Ye Yuan and Tung Kieu and Yan Zhao and Christian S. Jensen Efficient Stochastic Routing in Path-Centric Uncertain Road Networks . . 2893--2905 Angela Bonifati and Filip Murlak and Yann Ramusat Transforming Property Graphs . . . . . . 2906--2918 Yushi Sun and Hao Xin and Kai Sun and Yifan Ethan Xu and Xiao Yang and Xin Luna Dong and Nan Tang and Lei Chen Are Large Language Models a Good Replacement of Taxonomies? . . . . . . . 2919--2932 Yalong Zhang and Rong-Hua Li and Qi Zhang and Hongchao Qin and Guoren Wang Efficient Algorithms for Density Decomposition on Large Static and Dynamic Graphs . . . . . . . . . . . . . 2933--2945 Yingli Zhou and Yixiang Fang and Chenhao Ma and Tianci Hou and Xin Huang Efficient Maximal Motif-Clique Enumeration over Large Heterogeneous Information Networks . . . . . . . . . . 2946--2959 Zeang Sheng and Wentao Zhang and Yangyu Tao and Bin Cui OUTRE: an OUT-of-Core De-REdundancy GNN Training Framework for Massive Graphs within A Single Machine . . . . . . . . 2960--2973 Chenguang Fang and Zijie Chen and Shaoxu Song and Xiangdong Huang and Chen Wang and Jianmin Wang On Reducing Space Amplification with Multi-Column Compaction in Apache IoTDB 2974--2986 Sebastian Schmidl and Felix Naumann and Thorsten Papenbrock AutoTSAD: Unsupervised Holistic Anomaly Detection for Time Series Data . . . . . 2987--3002 Zheng Chen and Feng Zhang and Yang Chen and Xiaokun Fang and Guanyu Feng and Xiaowei Zhu and Wenguang Chen and Xiaoyong Du Enabling Window-Based Monotonic Graph Analytics with Reusable Transitional Results for Pattern-Consistent Queries 3003--3016 Maria Ramos and João Azevedo and Kyle Kingsbury and José Pereira and Tânia Esteves and Ricardo Macedo and João Paulo When Amnesia Strikes: Understanding and Reproducing Data Loss Bugs with Fault Injection . . . . . . . . . . . . . . . 3017--3030 Leixia Wang and Qingqing Ye and Haibo Hu and Xiaofeng Meng PriPL-Tree: Accurate Range Query for Arbitrary Distribution under Local Differential Privacy . . . . . . . . . . 3031--3044 Yu Sun and Jingyu Zhu and Xiao Xu and Xian Xu and Yuyao Sun and Shaoxu Song and Xiang Li and Xiaojie Yuan Win-Win: On Simultaneous Clustering and Imputing over Incomplete Data . . . . . 3045--3057 Shun Takagi and Li Xiong and Fumiyuki Kato and Yang Cao and Masatoshi Yoshikawa HRNet: Differentially Private Hierarchical and Multi-Resolution Network for Human Mobility Data Synthesization . . . . . . . . . . . . . 3058--3071 Sijie Dong and Qitong Wang and Soror Sahri and Themis Palpanas and Divesh Srivastava Efficiently Mitigating the Impact of Data Drift on Machine Learning Pipelines 3072--3081