Paper Digest: KDD 2020 Highlights

August 20, 2020November 10, 2020 admin

Download KDD-2020-Paper-Digests.pdf– highlights of all KDD-2020 papers. Readers can also choose to read this highlight article on our console, which allows users to filter out papers using keywords and find related papers and patents.

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) is one of the top data mining conferences in the world. In 2020, it is to be held virtually due to covid-19 pandemic.

To help the community quickly catch up on the work presented in this conference, Paper Digest Team processed all accepted papers, and generated one highlight sentence (typically the main topic) for each paper. Readers are encouraged to read these machine generated highlights / summaries to quickly get the main idea of each paper.

If you do not want to miss any interesting academic paper, you are welcome to sign up our free daily paper digest service to get updates on new papers published in your area every day. You are also welcome to follow us on Twitter and Linkedin to get updated with new conference digests.

Paper Digest Team
team@paperdigest.org

TABLE 1: Paper Digest: KDD 2020 Highlights

	Title	Authors	Highlight
1	Learning Effective Road Network Representation with Hierarchical Graph Neural Networks	Ning Wu; Xin Wayne Zhao; Jingyuan Wang; Dayan Pan;	In this paper, we propose a novel Hierarchical Road Network Representation model, named HRNR, by constructing a three-level neural architecture, corresponding to "functional zone", "structural regions" and "road segments", respectively.
2	Interpretability is a Kind of Safety: An Interpreter-based Ensemble for Adversary Defense	Jingyuan Wang; Yufan Wu; Mingxuan Li; Xin Lin; Junjie Wu; Chao Li;	In light of this, in this paper, we first reveal a gradient-based correlation between sensitivity analysis-based DNN interpreters and the generation process of adversarial examples, which indicates the Achilles’s heel of adversarial attacks and sheds light on linking together the two long-standing challenges of DNN: fragility and unexplainability. We then propose an interpreter-based ensemble framework called X-Ensemble for robust adversary defense. X-Ensemble adopts a novel detection-rectification process and features in building multiple sub-detectors and a rectifier upon various types of interpretation information toward target classifiers.
3	Higher-order Clustering in Complex Heterogeneous Networks	Aldo G. Carranza; Ryan A. Rossi; Anup Rao; Eunyee Koh;	In this work, we propose a framework for higher-order spectral clustering in heterogeneous networks through the notions of typed graphlets and typed-graphlet conductance.
4	Preserving Dynamic Attention for Long-Term Spatial-Temporal Prediction	Haoxing Lin; Rufan Bai; Weijia Jia; Xinyu Yang; Yongjian You;	To address these issues, we propose a Dynamic Switch-Attention Network (DSAN) with a novel Multi-Space Attention (MSA) mechanism that measures the correlations between inputs and outputs explicitly.
5	Learning to Extract Attribute Value from Product via Question Answering: A Multi-task Approach	Qifan Wang; Li Yang; Bhargav Kanagal; Sumit Sanghai; D. Sivakumar; Bin Shu; Zac Yu; Jon Elsas;	In this work, we propose a novel approach for Attribute Value Extraction via Question Answering (AVEQA) using a multi-task framework.
6	Kernel Assisted Learning for Personalized Dose Finding	Liangyu Zhu; Wenbin Lu; Michael R. Kosorok; Rui Song;	In this article, we propose a kernel assisted learning method for estimating the optimal individualized dose rule.
7	Graph Structure Learning for Robust Graph Neural Networks	Wei Jin; Yao Ma; Xiaorui Liu; Xianfeng Tang; Suhang Wang; Jiliang Tang;	Therefore, in this paper, we explore these properties to defend adversarial attacks on graphs.
8	An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph	Jiarui Jin; Jiarui Qin; Yuchen Fang; Kounianhua Du; Weinan Zhang; Yong Yu; Zheng Zhang; Alexander J. Smola;	In this paper, we propose an end-to-end Neighborhood-based Interaction Model for Recommendation (NIRec) to address above problems.
9	Directional Multivariate Ranking	Nan Wang; Hongning Wang;	In this work, we propose a directional multi-aspect ranking criterion to enable a holistic ranking of items with respect to multiple aspects.
10	Truth Discovery against Strategic Sybil Attack in Crowdsourcing	Yue Wang; Ke Wang; Chunyan Miao;	In this paper, we propose a novel approach, called TDSSA (Truth Discovery against Strategic Sybil Attack), to defend against strategic Sybil attack.
11	Partial Multi-Label Learning via Probabilistic Graph Matching Mechanism	Gengyu Lyu; Songhe Feng; Yidong Li;	In this paper, we interpret such assignments as instance-to-label matchings, and formulate the task of PML as a matching selection problem.
12	Spectrum-Guided Adversarial Disparity Learning	Zhe Liu; Lina Yao; Lei Bai; Xianzhi Wang; Can Wang;	In this work, we propose a novel end-to-end knowledge directed adversarial learning framework, which portrays the class-conditioned intraclass disparity using two competitive encoding distributions and learns the purified latent codes by denoising learned disparity.
13	Attention and Memory-Augmented Networks for Dual-View Sequential Learning	Yong He; Cheng Wang; Nan Li; Zhenyu Zeng;	We develop an AMANet (Attention and Memory-Augmented Networks) architecture by integrating both attention and memory to solve asynchronous multi-view learning problem in general, and we focus on experiments in dual-view sequences in this paper.
14	Semantic Search in Millions of Equations	Lukas Pfahler; Katharina Morik;	Hence, we propose a new approach for retrieval of mathematical expressions based on machine learning. To train our models, we collect a huge dataset with over 29 million mathematical expressions from over 900,000 publications published on arXiv.org.
15	SSumM: Sparse Summarization of Massive Graphs	Kyuhan Lee; Hyeonsoo Jo; Jihoon Ko; Sungsu Lim; Kijung Shin;	In this work, we propose SSumM, a scalable and effective graph-summarization algorithm that yields a sparse summary graph.
16	Rethinking Pruning for Accelerating Deep Inference At the Edge	Dawei Gao; Xiaoxi He; Zimu Zhou; Yongxin Tong; Ke Xu; Lothar Thiele;	To rectify such drawbacks, we propose entropy-based pruning, a new regularizer that can be seamlessly integrated into existing network pruning algorithms.
17	Compositional Embeddings Using Complementary Partitions for Memory-Efficient Recommendation Systems	Hao-Jun Michael Shi; Dheevatsa Mudigere; Maxim Naumov; Jiyan Yang;	We propose a novel approach for reducing the embedding size in an end-to-end fashion by exploiting complementary partitions of the category set to produce a unique embedding vector for each category without explicit definition.
18	Structural Patterns and Generative Models of Real-world Hypergraphs	Manh Tuan Do; Se-eun Yoon; Bryan Hooi; Kijung Shin;	In this work, we empirically study a number of real-world hypergraph datasets across various domains.
19	Efficient Algorithm for the b-Matching Graph	Yasuhiro Fujiwara; Atsutoshi Kumagai; Sekitoshi Kanai; Yasutoshi Ida; Naonori Ueda;	Our proposal, b-dash, can efficiently construct a b-matching graph because of its two key techniques: (1) it prunes unnecessary update messages in determining edges and (2) it incrementally computes edge weights by exploiting the Sherman-Morrison formula.
20	Isolation Distributional Kernel: A New Tool for Kernel based Anomaly Detection	Kai Ming Ting; Bi-Cun Xu; Takashi Washio; Zhi-Hua Zhou;	We introduce Isolation Distributional Kernel as a new way to measure the similarity between two distributions.
21	NodeAug: Semi-Supervised Node Classification with Data Augmentation	Yiwei Wang; Wei Wang; Yuxuan Liang; Yujun Cai; Juncheng Liu; Bryan Hooi;	By using Data Augmentation (DA), we present a new method to enhance Graph Convolutional Networks (GCNs), that are the state-of-the-art models for semi-supervised node classification.
22	An Embarrassingly Simple Approach for Trojan Attack in Deep Neural Networks	Ruixiang Tang; Mengnan Du; Ninghao Liu; Fan Yang; Xia Hu;	In this paper, we investigate a specific security problem called trojan attack, which aims to attack deployed DNN systems relying on the hidden trigger patterns inserted by malicious hackers.
23	Kronecker Attention Networks	Hongyang Gao; Zhengyang Wang; Shuiwang Ji;	In this work, we propose to avoid flattening by assuming the data follow matrix-variate normal distributions.
24	GRACE: Generating Concise and Informative Contrastive Sample to Explain Neural Network Model’s Prediction	Thai Le; Suhang Wang; Dongwon Lee;	To mitigate this limitation, therefore, we borrow two notable ideas (i.e., "explanation by intervention" from causality and "explanation are contrastive" from philosophy) and propose a novel solution, named as GRACE, that better explains neural network models’ predictions for tabular datasets.
25	Hierarchical Attention Propagation for Healthcare Representation Learning	Muhan Zhang; Christopher R. King; Michael Avidan; Yixin Chen;	In this paper, we propose Hierarchical Attention Propagation (HAP), a novel medical ontology embedding model that hierarchically propagate attention across the entire ontology structure, where a medical concept adaptively learns its embedding from all other concepts in the hierarchy instead of only its ancestors.
26	SCE: Scalable Network Embedding from Sparsest Cut	Shengzhong Zhang; Zengfeng Huang; Haicang Zhou; Ziang Zhou;	In this paper, we propose SCE for unsupervised network embedding only using negative samples for training.
27	Local Community Detection in Multiple Networks	Dongsheng Luo; Yuchen Bian; Yaowei Yan; Xiao Liu; Jun Huan; Xiang Zhang;	In this paper, we propose a novel RWM (Random Walk in Multiple networks) model to find relevant local communities in all networks for a given query node set from one network.
28	A Block Decomposition Algorithm for Sparse Optimization	Ganzhao Yuan; Li Shen; Wei-Shi Zheng;	This paper considers a new block decomposition algorithm that combines the effectiveness of combinatorial search methods and the efficiency of coordinate descent methods.
29	Adversarial Infidelity Learning for Model Interpretation	Jian Liang; Bing Bai; Yuren Cao; Kun Bai; Fei Wang;	In this paper, we propose a Model-agnostic Effective Efficient Direct (MEED) IFS framework for model interpretation, mitigating concerns about sanity, combinatorial shortcuts, model identifiability, and information transmission.
30	Grounding Visual Concepts for Zero-Shot Event Detection and Event Captioning	Zhihui Li; Xiaojun Chang; Lina Yao; Shirui Pan; Ge Zongyuan; Huaxiang Zhang;	Accordingly, in this paper, we propose a method of grounding visual concepts for large-scale Multimedia Event Detection (MED) and Multimedia Event Captioning (MEC) in zero-shot setting.
31	How to Count Triangles, without Seeing the Whole Graph	Suman K. Bera; C. Seshadhri;	Despite these challenges, we design a provable and practical algorithm, TETRIS, for triangle counting in this model.
32	Incremental Lossless Graph Summarization	Jihoon Ko; Yunbum Kook; Kijung Shin;	In this work, we propose MoSSo, the first incremental algorithm for lossless summarization of fully dynamic graphs.
33	From Online to Non-i.i.d. Batch Learning	Yufei Tao; Shangqi Lu;	We present a set of techniques to utilize an online algorithm as a black box to perform batch learning in the absence of the i.i.d. assumption.
34	Towards Deeper Graph Neural Networks	Meng Liu; Hongyang Gao; Shuiwang Ji;	In this work, we study this observation systematically and develop new insights towards deeper graph neural networks.
35	Laplacian Change Point Detection for Dynamic Graphs	Shenyang Huang; Yasmeen Hitti; Guillaume Rabusseau; Reihaneh Rabbany;	In this paper, we focus on change point detection in dynamic graphs and address two main challenges associated with this problem: I) how to compare graph snapshots across time, II) how to capture temporal dependencies.
36	Learning Transferrable Parameters for Long-tailed Sequential User Behavior Modeling	Jianwen Yin; Chenghao Liu; Weiqing Wang; Jianling Sun; Steven C.H. Hoi;	In this work, we argue that focusing on tail users could bring more benefits and address the long tails issue by learning transferrable parameters from both optimization and feature perspectives.
37	TranSlider: Transfer Ensemble Learning from Exploitation to Exploration	Kuo Zhong; Ying Wei; Chun Yuan; Haoli Bai; Junzhou Huang;	In this paper, we introduce the concept of transfer ensemble learning, a new direction to tackle the over-fitting of transfer strategies.
38	InFoRM: Individual Fairness on Graph Mining	Jian Kang; Jingrui He; Ross Maciejewski; Hanghang Tong;	This paper presents the first principled study of Individual Fairness on gRaph Mining (InFoRM).
39	Local Motif Clustering on Time-Evolving Graphs	Dongqi Fu; Dawei Zhou; Jingrui He;	To bridge this gap, in this paper, we propose a novel framework, Local Motif Clustering on Time-Evolving Graphs (L-MEGA), which provides the evolution pattern of the local motif cluster in an effective and efficient way.
40	A Data-Driven Graph Generative Model for Temporal Interaction Networks	Dawei Zhou; Lecheng Zheng; Jiawei Han; Jingrui He;	To address these challenges, we propose an end-to-end deep generative framework named TagGen.
41	Recurrent Networks for Guided Multi-Attention Classification	Xin Dai; Xiangnan Kong; Tian Guo; John Boaz Lee; Xinyue Liu; Constance Moore;	In this paper, we study the problem of guided multi-attention classification, the goal of which is to achieve high accuracy under the dual constraints of (1) small sample size, and (2) multiple ROIs for each image.
42	Vulnerability vs. Reliability: Disentangled Adversarial Examples for Cross-Modal Learning	Chao Li; Haoteng Tang; Cheng Deng; Liang Zhan; Wei Liu;	In this paper, we propose novel Disentangled Adversarial examples for Cross-Modal learning, dubbed DACM.
43	XGNN: Towards Model-Level Explanations of Graph Neural Networks	Hao Yuan; Jiliang Tang; Xia Hu; Shuiwang Ji;	In this work, we propose a novel approach, known as XGNN, to interpret GNNs at the model-level.
44	CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data	Xiang Li; Ben Kao; Caihua Shan; Dawei Yin; Martin Ester;	We propose the algorithm CAST that applies trace Lasso to regularize the coefficient matrix.
45	INPREM: An Interpretable and Trustworthy Predictive Model for Healthcare	Xianli Zhang; Buyue Qian; Shilei Cao; Yang Li; Hang Chen; Yefeng Zheng; Ian Davidson;	To address this, in this paper, we propose an interpretable and trustworthy predictive model~(INPREM) for healthcare.
46	Policy-GNN: Aggregation Optimization for Graph Neural Networks	Kwei-Herng Lai; Daochen Zha; Kaixiong Zhou; Xia Hu;	To address the above challenges, we propose Policy-GNN, a meta-policy framework that models the sampling procedure and message passing of GNNs into a combined learning process.
47	Malicious Attacks against Deep Reinforcement Learning Interpretations	Mengdi Huai; Jianhui Sun; Renqin Cai; Liuyi Yao; Aidong Zhang;	Specifically, we introduce the first study of the adversarial attacks against DRL interpretations, and propose an optimization framework based on which the optimal adversarial attack strategy can be derived.
48	Disentangled Self-Supervision in Sequential Recommenders	Jianxin Ma; Chang Zhou; Hongxia Yang; Peng Cui; Xin Wang; Wenwu Zhu;	In this paper, we study the problem of mining extra signals for supervision by looking at the longer-term future.
49	DETERRENT: Knowledge Guided Graph Attention Network for Detecting Healthcare Misinformation	Limeng Cui; Haeseung Seo; Maryam Tabar; Fenglong Ma; Suhang Wang; Dongwon Lee;	In this work, to address these shortcomings, we propose a novel knowledge guided graph attention network for detecting health misinformation better.
50	MultiImport: Inferring Node Importance in a Knowledge Graph from Multiple Input Signals	Namyong Park; Andrey Kan; Xin Luna Dong; Tong Zhao; Christos Faloutsos;	In this paper, we develop an end-to-end model MultiImport, which infers latent node importance from multiple, potentially overlapping, input signals.
51	Geodesic Forests	Meghana Madhyastha; Gongkai Li; Veronika Strnadová-Neeley; James Browne; Joshua T. Vogelstein; Randal Burns; Carey E. Priebe;	We propose an unsupervised random forest approach called geodesic forests (GF) to geodesic distance estimation in linear and nonlinear manifolds with noise.
52	Z-Miner: An Efficient Method for Mining Frequent Arrangements of Event Intervals	Zed Lee; Tony Lindgren; Panagiotis Papapetrou;	In this paper, we propose Z-Miner, a novel algorithm for solving this problem that addresses the deficiencies of existing competitors by employing two novel data structures: Z-Table, a hierarchical hash-based data structure for time-efficient candidate generation and support count, and Z-Arrangement, a data structure for efficient memory consumption.
53	Imputing Various Incomplete Attributes via Distance Likelihood Maximization	Shaoxu Song; Yu Sun;	In this paper, we propose to study the distance models that predict distances between tuples for missing data imputation.
54	WeightGrad: Geo-Distributed Data Analysis Using Quantization for Faster Convergence and Better Accuracy	Syeda Nahida Akter; Muhammad Abdullah Adnan;	Our goal in this work is to design a geo-distributed Deep-Learning system that (1) ensures efficient and faster communication over LAN and WAN and (2) maintain accuracy and convergence for complex DNNs with billions of parameters.
55	Feature-Induced Manifold Disambiguation for Multi-View Partial Multi-label Learning	Jing-Han Wu; Xuan Wu; Qing-Guo Chen; Yao Hu; Min-Ling Zhang;	Accordingly, the problem of multi-view partial multi-label learning (MVPML) is studied in this paper, where each example is assumed to be presented by multiple feature vectors while associated with multiple candidate labels which are only partially valid.
56	MinSearch: An Efficient Algorithm for Similarity Search under Edit Distance	Haoyu Zhang; Qin Zhang;	In this paper we propose a novel algorithm for edit similarity search named MinSearch.
57	Mining Large Quasi-cliques with Quality Guarantees from Vertex Neighborhoods	Aritra Konar; Nicholas D. Sidiropoulos;	In this work, we formally establish that two recurring characteristics of real-world graphs, namely heavy-tailed degree distributions and large clustering coefficients, imply the existence of substantially large vertex neighborhoods with high edge-density.
58	Residual Correlation in Graph Neural Network Regression	Junteng Jia; Austion R. Benson;	Here, we address this problem with an interpretable and efficient framework that can improve any graph neural network architecture simply by exploiting correlation structure in the regression residuals.
59	Towards Fair Truth Discovery from Biased Crowdsourced Answers	Yanying Li; Haipei Sun; Wendy Hui Wang;	To address this challenge, in this paper, first, we define a new fairness notion named θ-disparity for truth discovery. Intuitively, ?-disparity bounds the difference in the probabilities that the truth of both protected and unprotected groups being predicted to be positive. Second, we design three fairness enhancing methods, namely Pre-TD, FairTD, and Post-TD, for truth discovery.
60	AutoShuffleNet: Learning Permutation Matrices via an Exact Lipschitz Continuous Penalty in Deep Convolutional Neural Networks	Jiancheng Lyu; Shuai Zhang; Yingyong Qi; Jack Xin;	In this paper, we propose to automate channel shuffling by learning permutation matrices in network training.
61	MoFlow: An Invertible Flow Model for Generating Molecular Graphs	Chengxi Zang; Fei Wang;	In this paper, we propose MoFlow, a flow-based graph generative model to learn invertible mappings between molecular graphs and their latent representations.
62	Parallel DNN Inference Framework Leveraging a Compact RISC-V ISA-based Multi-core System	Yipeng Zhang; Bo Du; Lefei Zhang; Jia Wu;	Accordingly, this paper proposes a collaborative RISC-V multi-core system for Deep Neural Network (DNN) accelerators.
63	Missing Value Imputation for Mixed Data via Gaussian Copula	Yuxuan Zhao; Madeleine Udell;	This paper proposes a new semiparametric algorithm to impute missing values, with no tuning parameters.
64	HiTANet: Hierarchical Time-Aware Attention Networks for Risk Prediction on Electronic Health Records	Junyu Luo; Muchao Ye; Cao Xiao; Fenglong Ma;	To leverage time information for risk prediction in a more reasonable way, we propose a new hierarchical time-aware attention network, named HiTANet, which imitates the decision making process of doctors inrisk prediction.
65	Personalized PageRank to a Target Node, Revisited	Hanzhi Wang; Zhewei Wei; Junhao Gan; Sibo Wang; Zengfeng Huang;	In this paper, we consider the single-target PPR query, which measures the opposite direction of importance for PPR.
66	Edge-consensus Learning: Deep Learning on P2P Networks with Nonhomogeneous Data	Kenta Niwa; Noboru Harada; Guoqiang Zhang; W. Bastiaan Kleijn;	An effective Deep Neural Network (DNN) optimization algorithm that can use decentralized data sets over a peer-to-peer (P2P) network is proposed.
67	Deep Learning of High-Order Interactions for Protein Interface Prediction	Yi Liu; Hao Yuan; Lei Cai; Shuiwang Ji;	In this work, we propose to formulate the protein interface prediction as a 2D dense prediction problem.
68	MAMO: Memory-Augmented Meta-Optimization for Cold-start Recommendation	Manqing Dong; Feng Yuan; Lina Yao; Xiwei Xu; Liming Zhu;	In this paper, we design two memory matrices that can store task-specific memories and feature-specific memories.
69	Finding Effective Geo-social Group for Impromptu Activities with Diverse Demands	Lu Chen; Chengfei Liu; Rui Zhou; Jiajie Xu; Jeffrey Xu Yu; Jianxin Li;	In this paper, we propose a novel geo-social group model, equipped with elegant keyword constraints, to fill this gap.
70	Representing Temporal Attributes for Schema Matching	Yinan Mei; Shaoxu Song; Yunsu Lee; Jungho Park; Soo-Hyung Kim; Sungmin Yi;	In this paper, we argue to order the values in an attribute A by some time attribute T as a time series.
71	Estimating Properties of Social Networks via Random Walk considering Private Nodes	Kazuki Nakajima; Kazuyuki Shudo;	Here we design random walk-based algorithms to accurately estimate properties without any problems caused by private nodes.
72	ASGN: An Active Semi-supervised Graph Neural Network for Molecular Property Prediction	Zhongkai Hao; Chengqiang Lu; Zhenya Huang; Hao Wang; Zheyuan Hu; Qi Liu; Enhong Chen; Cheekong Lee;	Here we propose a novel framework called Active Semi-supervised Graph Neural Network (ASGN) by incorporating both labeled and unlabeled molecules.
73	Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks	Zonghan Wu; Shirui Pan; Guodong Long; Jing Jiang; Xiaojun Chang; Chengqi Zhang;	In this paper, we propose a general graph neural network framework designed specifically for multivariate time series data.
74	Learning Opinion Dynamics From Social Traces	Corrado Monti; Gianmarco De Francisci Morales; Francesco Bonchi;	In this work we propose an inference mechanism for fitting a generative, agent-like model of opinion dynamics to real-world social traces.
75	Enterprise Cooperation and Competition Analysis with a Sign-Oriented Preference Network	Le Dai; Yu Yin; Chuan Qin; Tong Xu; Xiangnan He; Enhong Chen; Hui Xiong;	To this end, in this paper, we provide a large-scale data driven analysis on the cooperative and competitive relationships among companies in a Sign-oriented Preference Network (SOPN).
76	BLOB: A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals	Otmane Sakhi; Stephen Bonner; David Rohde; Flavian Vasile;	In this paper, we propose Bayesian Latent Organic Bandit model (BLOB), a probabilistic approach to combine the ‘organic’ and ‘bandit’ signals in order to improve the estimation of recommendation quality.
77	AutoST: Efficient Neural Architecture Search for Spatio-Temporal Prediction	Ting Li; Junbo Zhang; Kainan Bao; Yuxuan Liang; Yexin Li; Yu Zheng;	In this paper, we study Neural Architecture Search (NAS) for spatio-temporal prediction and propose an efficient spatio-temporal neural architecture search method, entitled AutoST.
78	COMPOSE: Cross-Modal Pseudo-Siamese Network for Patient Trial Matching	Junyi Gao; Cao Xiao; Lucas M. Glass; Jimeng Sun;	In this paper, we proposed CrOss-Modal PseudO-SiamEse network (COMPOSE) to address these challenges for patient-trial matching.
79	Discovering Succinct Pattern Sets Expressing Co-Occurrence and Mutual Exclusivity	Jonas Fischer; Jilles Vreeken;	As the search space for the optimal model is enormous and unstructured, we propose Mexican, a heuristic algorithm to efficiently discover high quality sets of patterns of co-occurences and mutual exclusivity.
80	TIPRDC: Task-Independent Privacy-Respecting Data Crowdsourcing Framework for Deep Learning with Anonymized Intermediate Representations	Ang Li; Yixiao Duan; Huanrui Yang; Yiran Chen; Jianlei Yang;	To tackle the case where the learning task may be unknown or changing, we present TIPRDC, a task-independent privacy-respecting data crowdsourcing framework with anonymized intermediate representation.
81	AutoGrow: Automatic Layer Growing in Deep Convolutional Networks	Wei Wen; Feng Yan; Yiran Chen; Hai Li;	We propose robust growing and stopping policies to generalize to different network architectures and datasets.
82	Curb-GAN: Conditional Urban Traffic Estimation through Spatio-Temporal Generative Adversarial Networks	Yingxue Zhang; Yanhua Li; Xun Zhou; Xiangnan Kong; Jun Luo;	To tackle these challenges, we propose a novel Conditional Urban Traffic Generative Adversarial Network (Curb-GAN), which provides traffic estimations in consecutive time slots based on different (unprecedented) travel demands, thus enables urban planners to accurately evaluate urban plans before deploying them.
83	Incremental Mobile User Profiling: Reinforcement Learning with Spatial Knowledge Graph for Modeling Event Streams	Pengyang Wang; Kunpeng Liu; Lu Jiang; Xiaolin Li; Yanjie Fu;	We propose to formulate the problem into a reinforcement learning task, where an agent is a next-visit planner, an action is a POI that a user will visit next, and the state of environment is a fused representation of a user and spatial entities (e.g., POIs, activity types, functional zones).
84	Identifying Sepsis Subphenotypes via Time-Aware Multi-Modal Auto-Encoder	Changchang Yin; Ruoqi Liu; Dongdong Zhang; Ping Zhang;	However, most sepsis subtyping studies ignore the temporality of EHR data and suffer from missing values. In this paper, we propose a new sepsis subtyping framework to address the two issues.
85	A Causal Look at Statistical Definitions of Discrimination	Elias Chaibub Neto;	Here, we investigate these fairness criteria from a causality perspective.
86	Targeted Data-driven Regularization for Out-of-Distribution Generalization	Mohammad Mahdi Kamani; Sadegh Farhang; Mehrdad Mahdavi; James Z. Wang;	In this paper, we propose a unified data-driven regularization approach to learn a generalizable model from biased data.
87	Neural Dynamics on Complex Networks	Chengxi Zang; Fei Wang;	To address these challenges, we propose to combine Ordinary Differential Equation Systems (ODEs) and Graph Neural Networks (GNNs) to learn continuous-time dynamics on complex networks in a data-driven manner.
88	Grammatically Recognizing Images with Tree Convolution	Guangrun Wang; Guangcong Wang; Keze Wang; Xiaodan Liang; Liang Lin;	Attempting to tackle this problem, this paper proposes a simple yet effective tree convolution (TreeConv) operation for deep neural networks.
89	Generic Outlier Detection in Multi-Armed Bandit	Yikun Ban; Jingrui He;	In this paper, we study the problem of outlier arm detection in multi-armed bandit settings, which finds plenty of applications in many high-impact domains such as finance, healthcare, and online advertising.
90	Robust Spammer Detection by Nash Reinforcement Learning	Yingtong Dou; Guixiang Ma; Philip S. Yu; Sihong Xie;	To address the challenges, we formulate a minimax game where the spammers and spam detectors compete with each other on their practical goals that are not solely based on detection accuracy.
91	Mining Persistent Activity in Continually Evolving Networks	Caleb Belth; Xinyi Zheng; Danai Koutra;	In this work, we propose the problem of mining activity that persists through time in continually evolving networks-i.e., activity that repeatedly and consistently occurs.
92	Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction	Qingquan Song; Dehua Cheng; Hanning Zhou; Jiyan Yang; Yuandong Tian; Xia Hu;	To address these challenges, we propose an automated interaction architecture discovering framework for CTR prediction named AutoCTR.
93	High-Dimensional Similarity Search with Quantum-Assisted Variational Autoencoder	Nicholas Gao; Max Wilson; Thomas Vandal; Walter Vinci; Ramakrishna Nemani; Eleanor Rieffel;	We show how to construct a space-efficient search index based on the latent space representation of a QVAE.
94	Off-policy Bandits with Deficient Support	Noveen Sachdeva; Yi Su; Thorsten Joachims;	To overcome this gap between theory and applications, we identify three approaches that provide various guarantees for IPS-based learning despite the inherent limitations of support-deficient data: restricting the action space, reward extrapolation, and restricting the policy space.
95	Adaptive Graph Encoder for Attributed Graph Embedding	Ganqu Cui; Jie Zhou; Cheng Yang; Zhiyuan Liu;	To address these issues, we propose Adaptive Graph Encoder (AGE), a novel attributed graph embedding framework.
96	NetTrans: Neural Cross-Network Transformation	Si Zhang; Hanghang Tong; Yinglong Xia; Liang Xiong; Jiejun Xu;	In this paper, we address these limitations and tackle cross-network node associations from a new angle, i.e., cross-network transformation.
97	Redundancy-Free Computation for Graph Neural Networks	Zhihao Jia; Sina Lin; Rex Ying; Jiaxuan You; Jure Leskovec; Alex Aiken;	Here we propose Hierarchically Aggregated computation Graphs(HAGs), a new GNN representation technique that explicitly avoids redundancy by managing intermediate aggregation results hierarchically and eliminates repeated computations and unnecessary data transfers in GNN training and inference.
98	Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion	Kun Zhou; Wayne Xin Zhao; Shuqing Bian; Yuanhang Zhou; Ji-Rong Wen; Jingsong Yu;	To address these issues, we incorporate both word-oriented and entity-oriented knowledge graphs~(KG) to enhance the data representations in CRSs, and adopt Mutual Information Maximization to align the word-level and entity-level semantic spaces.
99	Sliding Sketches: A Framework using Time Zones for Data Stream Processing in Sliding Windows	Xiangyang Gou; Long He; Yinda Zhang; Ke Wang; Xilai Liu; Tong Yang; Yi Wang; Bin Cui;	In this paper, we propose a generic framework, namely Sliding sketches, which can be applied to many existing solutions for the above three queries, and enable them to support queries in sliding windows.
100	STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths	Yue Yu; Yinghao Li; Jiaming Shen; Hao Feng; Jimeng Sun; Chao Zhang;	We propose a self-supervised taxonomy expansion model named STEAM, which leverages natural supervision in the existing taxonomy for expansion.
101	Probabilistic Metric Learning with Adaptive Margin for Top-K Recommendation	Chen Ma; Liheng Ma; Yingxue Zhang; Ruiming Tang; Xue Liu; Mark Coates;	To tackle this, we develop a distance-based recommendation model with several novel aspects: (i) each user and item are parameterized by Gaussian distributions to capture the learning uncertainties; (ii) an adaptive margin generation scheme is proposed to generate the margins regarding different training triplets; (iii) explicit user-user/item-item similarity modeling is incorporated in the objective function.
102	Re-identification Attack to Privacy-Preserving Data Analysis with Noisy Sample-Mean	Du Su; Hieu Tri Huynh; Ziao Chen; Yi Lu; Wenmiao Lu;	This paper studies the hazard of re-identification of entire class caused by revealing a noisy sample mean of the class.
103	BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision	Chen Liang; Yue Yu; Haoming Jiang; Siawpeng Er; Ruijia Wang; Tuo Zhao; Chao Zhang;	To address this challenge, we propose a new computational framework — BOND, which leverages the power of pre-trained language models (e.g., BERT and RoBERTa) to improve the prediction performance of NER models.
104	Graph Structural-topic Neural Network	Qingqing Long; Yilun Jin; Guojie Song; Yi Li; Wei Lin;	Correspondingly, in this paper, we propose Graph Structural topic Neural Network, abbreviated GraphSTONE 1, a GCN model that utilizes topic models of graphs, such that the structural topics capture indicative graph structures broadly from a probabilistic aspect rather than merely a few structures.
105	Correlation Networks for Extreme Multi-label Text Classification	Guangxu Xun; Kishlay Jha; Jianhui Sun; Aidong Zhang;	This paper develops the Correlation Networks (CorNet) architecture for the extreme multi-label text classification (XMTC) task, where the objective is to tag an input text sequence with the most relevant subset of labels from an extremely large label set.
106	Predicting Temporal Sets with Deep Neural Networks	Le Yu; Leilei Sun; Bowen Du; Chuanren Liu; Hui Xiong; Weifeng Lv;	In this paper, we propose an integrated solution based on the deep neural networks for temporal sets prediction.
107	FreeDOM: A Transferable Neural Architecture for Structured Information Extraction on Web Documents	Bill Yuchen Lin; Ying Sheng; Nguyen Vo; Sandeep Tata;	In this paper, we present a novel two-stage neural approach, named FreeDOM, which overcomes both these limitations.
108	SEAL: Learning Heuristics for Community Detection with Generative Adversarial Networks	Yao Zhang; Yun Xiong; Yun Ye; Tengfei Liu; Weiqiang Wang; Yangyong Zhu; Philip S. Yu;	In this paper, we instead study the semi-supervised community detection problem where we are given several communities in a network as training data and aim to discover more communities.
109	Matrix Profile XXI: A Geometric Approach to Time Series Chains Improves Robustness	Makoto Imamura; Takaaki Nakamura; Eamonn Keogh;	Inspired by observations from dynamical systems theory, this paper introduces two novel quality metrics for time series chains, directionality and graduality, to improve robustness and to enable top-K search.
110	Retrospective Loss: Looking Back to Improve Training of Deep Neural Networks	Surgan Jandial; Ayush Chopra; Mausoom Sarkar; Piyush Gupta; Balaji Krishnamurthy; Vineeth Balasubramanian;	In this work, we introduce a new retrospective loss to improve the training of deep neural network models by utilizing the prior experience available in past model states during training.
111	Average Sensitivity of Spectral Clustering	Pan Peng; Yuichi Yoshida;	To make reliable and efficient decisions based on spectral clustering, we assess the stability of spectral clustering against edge perturbations in the input graph using the notion of average sensitivity, which is the expected size of the symmetric difference of the output clusters before and after we randomly remove edges.
112	Semi-Supervised Multi-Label Learning from Crowds via Deep Sequential Generative Model	Wanli Shi; Victor S. Sheng; Xiang Li; Bin Gu;	In this paper, we propose a deep generative model to describe the label generation process for this semi-supervised multi-label learning problem.
113	GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training	Jiezhong Qiu; Qibin Chen; Yuxiao Dong; Jing Zhang; Hongxia Yang; Ming Ding; Kuansan Wang; Jie Tang;	We design GCC’s pre-training task as subgraph instance discrimination in and across networks and leverage contrastive learning to empower graph neural networks to learn the intrinsic and transferable structural representations.
114	HGCN: A Heterogeneous Graph Convolutional Network-Based Deep Learning Model Toward Collective Classification	Zhihua Zhu; Xinxin Fan; Xiaokai Chu; Jingping Bi;	To address the challenges, in this paper, we propose a novel heterogeneous graph convolutional network-based deep learning model, called HGCN, to collectively categorize the entities in HINs.
115	Handling Information Loss of Graph Neural Networks for Session-based Recommendation	Tianwen Chen; Raymond Chi-Wing Wong;	To solve the first problem, we propose a lossless encoding scheme and an edge-order preserving aggregation layer based on GRU that is dedicatedly designed to process the losslessly encoded graphs.
116	Ultrafast Local Outlier Detection from a Data Stream with Stationary Region Skipping	Susik Yoon; Jae-Gil Lee; Byung Suk Lee;	We propose a new algorithm, abbr. STARE, which identifies local regions in which data distributions hardly change and then skips updating the densities in those regions-a notion called stationary region skipping.
117	LayoutLM: Pre-training of Text and Layout for Document Image Understanding	Yiheng Xu; Minghao Li; Lei Cui; Shaohan Huang; Furu Wei; Ming Zhou;	In this paper, we propose the LayoutLM to jointly model interactions between text and layout information across scanned document images, which is beneficial for a great number of real-world document image understanding tasks such as information extraction from scanned documents.
118	Block Model Guided Unsupervised Feature Selection	Zilong Bai; Hoa Nguyen; Ian Davidson;	Here we take the novel approach of first building a block model on the graph and then using the block model for feature selection.
119	Data Compression as a Comprehensive Framework for Graph Drawing and Representation Learning	Claudia Plant; Sonja Biedermann; Christian Böhm;	Our fundamental idea is to compress the adjacency matrix by predicting the existence of an edge from the Euclidean distance between the corresponding vertices in the embedding, and to use the achieved compression as a quality measure for the embedding.
120	Joint Policy-Value Learning for Recommendation	Olivier Jeunen; David Rohde; Flavian Vasile; Martin Bompaire;	In this work, we conduct the first broad empirical study of counterfactual learning methods for recommendation, in a simulated environment.
121	FedFast: Going Beyond Average for Faster Training of Federated Recommender Systems	Khalil Muhammad; Qinqin Wang; Diarmuid O’Reilly-Morgan; Elias Tragos; Barry Smyth; Neil Hurley; James Geraci; Aonghus Lawlor;	We present a novel technique, FedFast, to accelerate distributed learning which achieves good accuracy for all users very early in the training process.
122	AM-GCN: Adaptive Multi-channel Graph Convolutional Networks	Xiao Wang; Meiqi Zhu; Deyu Bo; Peng Cui; Chuan Shi; Jian Pei;	We tackle the challenge and propose an adaptive multi-channel graph convolutional networks for semi-supervised classification (AM-GCN).
123	Discovering Approximate Functional Dependencies using Smoothed Mutual Information	Frédéric Pennerath; Panagiotis Mandros; Jilles Vreeken;	In this paper, we consider a different correction strategy and counter data sparsity using uniform priors and smoothing techniques, that leads to an efficient and robust estimating process.
124	Competitive Analysis for Points of Interest	Shuangli Li; Jingbo Zhou; Tong Xu; Hao Liu; Xinjiang Lu; Hui Xiong;	To this end, in this paper, we study how to predict the POI competitive relationship.
125	HOPS: Probabilistic Subtree Mining for Small and Large Graphs	Pascal Welke; Florian Seiffarth; Michael Kamp; Stefan Wrobel;	In this paper, we adapt sampling techniques from mathematical combinatorics to the problem of probabilistic subtree mining in arbitrary databases of many small to medium-size graphs or a single large graph.
126	The NodeHopper: Enabling Low Latency Ranking with Constraints via a Fast Dual Solver	Anton Zhernov; Krishnamurthy Dj Dvijotham; Ivan Lobov; Dan A. Calian; Michelle Gong; Natarajan Chandrashekar; Timothy A. Mann;	To address this challenge, we exploit the structure of the dual optimization problem to develop a fast solver.
127	HGMF: Heterogeneous Graph-based Fusion for Multimodal Data with Incompleteness	Jiayi Chen; Aidong Zhang;	We propose a Heterogeneous Graph-based Multimodal Fusion (HGMF) approach to enable multimodal fusion of incomplete data within a heterogeneous graph structure.
128	ST-SiameseNet: Spatio-Temporal Siamese Networks for Human Mobility Signature Identification	Huimin Ren; Menghai Pan; Yanhua Li; Xun Zhou; Jun Luo;	To deal with this challenge, in this work, we make the first attempt to match identities of human agents only from the observed location trajectory data by proposing a novel and efficient framework named Spatio-temporal Siamese Networks (ST-SiameseNet).
129	A Novel Deep Learning Model by Stacking Conditional Restricted Boltzmann Machine and Deep Neural Network	Tianyu Kang; Ping Chen; John Quackenbush; Wei Ding;	Similar to Convolution Neural Network dealing with spatially correlated features and Recurrent Neural Network with temporally correlated features, in this paper we present a novel deep learning model to tackle functionally interactive features by stacking a Conditional Restricted Boltzmann Machine and a Deep Neural Network (CRBM-DNN).
130	InfiniteWalk: Deep Network Embeddings as Laplacian Embeddings with a Nonlinearity	Sudhanshu Chanpuriya; Cameron Musco;	We study the objective in the limit as T goes to infinity, which allows us to simplify the expression of Qiu et al.
131	xGAIL: Explainable Generative Adversarial Imitation Learning for Explainable Human Decision Analysis	Menghai Pan; Weixiao Huang; Yanhua Li; Xun Zhou; Jun Luo;	This paper addresses this research gap by proposing xGAIL, the first explainable generative adversarial imitation learning framework.
132	Catalysis Clustering with GAN by Incorporating Domain Knowledge	Olga Andreeva; Wei Li; Wei Ding; Marieke Kuijjer; John Quackenbush; Ping Chen;	In this work we propose a GAN-based approach called Catalysis Clustering to incorporate domain knowledge into the clustering process.
133	Prediction and Profiling of Audience Competition for Online Television Series	Peng Zhang; Chuanren Liu; Kefeng Ning; Wenxiang Zhu; Yu Zhang;	In this paper, we develop a data-driven framework to model and predict audience competition patterns for popular online television series.
134	Multi-Class Data Description for Out-of-distribution Detection	Dongha Lee; Sehun Yu; Hwanjo Yu;	In this work, we present a deep multi-class data description, termed as Deep-MCDD, which is effective to detect out-of-distribution (OOD) samples as well as classify in-distribution (ID) samples.
135	In and Out: Optimizing Overall Interaction in Probabilistic Graphs under Clustering Constraints	Domenico Mandaglio; Andrea Tagarelli; Francesco Gullo;	We study two novel clustering problems in which the pairwise interactions between entities are characterized by probability distributions and conditioned by external factors within the environment where the entities interact.
136	Recurrent Halting Chain for Early Multi-label Classification	Thomas Hartvigsen; Cansu Sen; Xiangnan Kong; Elke Rundensteiner;	We design an effective solution to this open problem, the Recurrent Halting Chain (RHC), that for the first time integrates key innovations in both Early and Multi-label Classification into one multi-objective model.
137	Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks	Weilin Cong; Rana Forsati; Mahmut Kandemir; Mehrdad Mahdavi;	In this paper, we theoretically analyze the variance of sampling methods and show that, due to the composite structure of empirical risk, the variance of any sampling method can be decomposed intoembedding approximation variance in the forward stage andstochastic gradient variance in the backward stage that necessities mitigating both types of variance to obtain faster convergence rate.
138	Discovering Functional Dependencies from Mixed-Type Data	Panagiotis Mandros; David Kaltenpoth; Mario Boley; Jilles Vreeken;	In this paper, we analyze these fundamental questions and derive formal criteria as to when a discretization process applied to a mixed set of random variables leads to consistent estimates of mutual information.
139	Attackability Characterization of Adversarial Evasion Attack on Discrete Data	Yutong Wang; Yufei Han; Hongyan Bao; Yun Shen; Fenglong Ma; Jin Li; Xiangliang Zhang;	Based on our attackability analysis, we propose a computationally efficient orthogonal matching pursuit-guided attack method for evasion attack on discrete data.
140	The Spectral Zoo of Networks: Embedding and Visualizing Networks with Spectral Moments	Shengmin Jin; Reza Zafarani;	We introduce a spectral embedding method for a network, its Spectral Point, which is basically the first few spectral moments of a network.
141	Unsupervised Differentiable Multi-aspect Network Embedding	Chanyoung Park; Carl Yang; Qi Zhu; Donghyun Kim; Hwanjo Yu; Jiawei Han;	In this paper, we propose a novel end-to-end framework for multi-aspect network embedding, called asp2vec, in which the aspects of each node are dynamically assigned based on its local context.
142	AutoML Pipeline Selection: Efficiently Navigating the Combinatorial Space	Chengrun Yang; Jicong Fan; Ziyang Wu; Madeleine Udell;	In this work, we design a new AutoML system TensorOboe to address this challenge: an automated system to design a supervised learning pipeline.
143	Towards Physics-informed Deep Learning for Turbulent Flow Prediction	Rui Wang; Karthik Kashinath; Mustafa Mustafa; Adrian Albert; Rose Yu;	In this paper, we aim to predict turbulent flow by learning its highly nonlinear dynamics from spatiotemporal velocity fields of large-scale fluid flow simulations of relevance to turbulence modeling and climate modeling.
144	Evaluating Fairness Using Permutation Tests	Cyrus DiCiccio; Sriram Vasudevan; Kinjal Basu; Krishnaram Kenthapadi; Deepak Agarwal;	We propose a permutation testing methodology that performs a hypothesis test that a model is fair across two groups with respect to any given metric.
145	Leveraging Model Inherent Variable Importance for Stable Online Feature Selection	Johannes Haug; Martin Pawelczyk; Klaus Broelemann; Gjergji Kasneci;	In this work, we introduce FIRES, a novel framework for online feature selection. By treating model parameters as random variables, we can penalize features with high uncertainty and thus generate more stable feature sets.
146	Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction	Hongxu Chen; Hongzhi YIN; Xiangguo Sun; Tong Chen; Bogdan Gabrys; Katarzyna Musial;	In this paper, to address this problem, we propose a novel framework that considers multi-level graph convolutions on both local network structure and hypergraph structure in a unified manner.
147	Evaluating Conversational Recommender Systems via User Simulation	Shuo Zhang; Krisztian Balog;	As an alternative, we propose automated evaluation by means of simulating users.
148	Measuring Model Complexity of Neural Networks with Curve Activation Functions	Xia Hu; Weiqing Liu; Jiang Bian; Jian Pei;	To tackle the challenge, in this paper, we first propose linear approximation neural network (LANN for short), a piecewise linear framework to approximate a given deep model with curve activation function.
149	Diverse Rule Sets	Guangyi Zhang; Aristides Gionis;	Here we propose a novel approach of inferring diverse rule sets, by optimizing small overlap among decision rules with a 2-approximation guarantee under the framework of Max-Sum diversification.
150	Vamsa: Automated Provenance Tracking in Data Science Scripts	Mohammad Hossein Namaki; Avrilia Floratou; Fotis Psallidas; Subru Krishnan; Ashvin Agrawal; Yinghui Wu; Yiwen Zhu; Markus Weimer;	In this work, we introduce the ML provenance tracking problem: the fundamental idea is to automatically track which columns in a dataset have been used to derive the features/labels of an ML model.
151	Deep State-Space Generative Model For Correlated Time-to-Event Predictions	Yuan Xue; Denny Zhou; Nan Du; Andrew M. Dai; Zhen Xu; Kun Zhang; Claire Cui;	In this work, we propose a deep latent state-space generative model to capture the interactions among different types of correlated clinical events (e.g., kidney failure, mortality) by explicitly modeling the temporal dynamics of patients’ latent states.
152	Meta-learning on Heterogeneous Information Networks for Cold-start Recommendation	Yuanfu Lu; Yuan Fang; Chuan Shi;	In MetaHIN, we propose a novel semantic-enhanced tasks constructor and a co-adaptation meta-learner to address the two questions.
153	WavingSketch: An Unbiased and Generic Sketch for Finding Top-k Items in Data Streams	Jizhou Li; Zikun Li; Yifei Xu; Shiqi Jiang; Tong Yang; Bin Cui; Yafei Dai; Gong Zhang;	In this paper, we propose a new sketch, WavingSketch, which is much more accurate than existing unbiased algorithms.
154	Dynamic Knowledge Graph based Multi-Event Forecasting	Songgaojun Deng; Huzefa Rangwala; Yue Ning;	In this paper, we study a temporal graph learning method with heterogeneous data fusion for predicting concurrent events of multiple types and inferring multiple candidate actors simultaneously.
155	A Geometric Approach to Predicting Bounds of Downstream Model Performance	Brian J. Goode; Debanjan Datta;	This paper presents the motivation and methodology for including model application criteria into baseline analysis.
156	Context-to-Session Matching: Utilizing Whole Session for Response Selection in Information-Seeking Dialogue Systems	Zhenxin Fu; Shaobo Cui; Mingyue Shang; Feng Ji; Dongyan Zhao; Haiqing Chen; Rui Yan;	In this paper, we consider the response and its context as a whole session and explore the task of matching the query’s context with the sessions.
157	HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units	Shenda Hong; Yanbo Xu; Alind Khare; Satria Priambada; Kevin Maher; Alaa Aljiffry; Jimeng Sun; Alexey Tumanov;	To address these challenges, we propose HOLMES—an online model ensemble serving framework for healthcare applications.
158	LogPar: Logistic PARAFAC2 Factorization for Temporal Binary Data with Missing Values	Kejing Yin; Ardavan Afshar; Joyce C. Ho; William K. Cheung; Chao Zhang; Jimeng Sun;	In this paper, we propose Logistic PARAFAC2 (LogPar) by modeling the binary irregular tensor with Bernoulli distribution parameterized by an underlying real-valued tensor.
159	RECORD: Resource Constrained Semi-Supervised Learning under Distribution Shift	Lan-Zhe Guo; Zhi Zhou; Yu-Feng Li;	This paper presents a systemic solution Record consisting of three sub-steps, that is, distribution tracking, sample selection and model updating.
160	Statistically Significant Pattern Mining with Ordinal Utility	Thien Q. Tran; Kazuto Fukuchi; Youhei Akimoto; Jun Sakuma;	Our study aims to introduce a preference relation into patterns and to discover the most preferred patterns under the constraint of statistical significance, which has never been considered in existing SSPM problems.
161	Certifiable Robustness of Graph Convolutional Networks under Structure Perturbations	Daniel Zügner; Stephan Günnemann;	In this work we close this gap and propose the first method to certify robustness of Graph Convolutional Networks (GCNs) under perturbations of the graph structure.
162	Understanding Negative Sampling in Graph Representation Learning	Zhen Yang; Ming Ding; Chang Zhou; Hongxia Yang; Jingren Zhou; Jie Tang;	To bridge the gap, we systematically analyze the role of negative sampling from the perspectives of both objective and risk, theoretically demonstrating that negative sampling is as important as positive sampling in determining the optimization objective and the resulted variance.
163	Aligning Superhuman AI with Human Behavior: Chess as a Model System	Reid McIlroy-Young; Siddhartha Sen; Jon Kleinberg; Ashton Anderson;	We develop and introduce Maia, a customized version of AlphaZero trained on human chess games, that predicts human moves at a much higher accuracy than existing engines, and can achieve maximum accuracy when predicting decisions made by players at a specific skill level in a tuneable way.
164	Heidegger: Interpretable Temporal Causal Discovery	Mehrdad Mansouri; Ali Arab; Zahra Zohrevand; Martin Ester;	Toward a new horizon, this study introduces the novel problem of Causal Profile Discovery, which is crucial for many applications such as adverse drug reaction and cyber-attack detection.
165	Interpretable Deep Graph Generation with Node-edge Co-disentanglement	Xiaojie Guo; Liang Zhao; Zhao Qin; Lingfei Wu; Amarda Shehu; Yanfang Ye;	To address these challenges, we propose a new disentanglement enhancement framework for deep generative models for attributed graphs.
166	Minimizing Localized Ratio Cut Objectives in Hypergraphs	Nate Veldt; Austin R. Benson; Jon Kleinberg;	Here we present a framework for local hypergraph clustering based on minimizing localized ratio cut objectives.
167	RECIPTOR: An Effective Pretrained Model for Recipe Representation Learning	Diya Li; Mohammed J. Zaki;	In this paper, we provide a joint approach for learning effective pretrained recipe embeddings using both the ingredients and cooking instructions.
168	Hyperbolic Distance Matrices	Puoya Tabaghi; Ivan Dokmanić;	In this paper, we propose a unified framework to compute hyperbolic embeddings from an arbitrary mix of noisy metric and non-metric data.
169	RayS: A Ray Searching Method for Hard-label Adversarial Attack	Jinghui Chen; Quanquan Gu;	In this paper, we present the Ray Searching attack (RayS), which greatly improves the hard-label attack effectiveness as well as efficiency.
170	On Sampled Metrics for Item Recommendation	Walid Krichene; Steffen Rendle;	We show that it is possible to improve the quality of the sampled metrics by applying a correction, obtained by minimizing different criteria such as bias or mean squared error.
171	ALO-NMF: Accelerated Locality-Optimized Non-negative Matrix Factorization	Gordon E. Moon; J. Austin Ellis; Aravind Sukumaran-Rajam; Srinivasan Parthasarathy; P. Sadayappan;	In this paper, we present a novel optimization method for parallel NMF algorithm based on the HALS (Hierarchical Alternating Least Squares) scheme that incorporates algorithmic transformations to enhance data locality.
172	Multi-Source Deep Domain Adaptation with Weak Supervision for Time-Series Sensor Data	Garrett Wilson; Janardhan Rao Doppa; Diane J. Cook;	However, robust techniques have not yet been considered for time series data with varying amounts of data availability. In this paper, we make three main contributions to fill this gap.
173	Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions	James McInerney; Brian Brost; Praveen Chandar; Rishabh Mehrotra; Benjamin Carterette;	We propose a new counterfactual estimator that allows for sequential interactions in the rewards with lower variance in an asymptotically unbiased manner.
174	TAdaNet: Task-Adaptive Network for Graph-Enriched Meta-Learning	Qiuling Suo; Jingyuan Chou; Weida Zhong; Aidong Zhang;	In this paper, we propose a task-adaptive network (TAdaNet) that makes use of a domain-knowledge graph to enrich data representations and provide task-specific customization.
175	Unsupervised Paraphrasing via Deep Reinforcement Learning	A. B. Siddique; Samet Oymak; Vagelis Hristidis;	We propose Progressive Unsupervised Paraphrasing (PUP): a novel unsupervised paraphrase generation method based on deep reinforcement learning (DRL).
176	CICLAD: A Fast and Memory-efficient Closed Itemset Miner for Streams	Tomas Martin; Guy Francoeur; Petko Valtchev;	In a search for a better storage-efficiency trade-off, we designed Ciclad, an intersection-based sliding-window FCI miner.
177	Graph Attention Networks over Edge Content-Based Channels	Lu Lin; Hongning Wang;	In this paper, we propose a channel-aware attention mechanism enabled by edge text content when aggregating information from neighboring nodes; and we realize this mechanism in a graph autoencoder framework.
178	Multimodal Learning with Incomplete Modalities by Knowledge Distillation	Qi Wang; Liang Zhan; Paul Thompson; Jiayu Zhou;	In this paper, we proposed a framework based on knowledge distillation, utilizing the supplementary information from all modalities, and avoiding imputation and noise associated with it.
179	Estimating the Percolation Centrality of Large Networks through Pseudo-dimension Theory	Alane M. de Lima, Murilo V. G. da Silva, André L. Vignatti;	In this work we investigate the problem of estimating the percolation centrality of every vertex in a graph.
180	TinyGNN: Learning Efficient Graph Neural Networks	Bencheng Yan; Chaokun Wang; Gaoyang Guo; Yunkai Lou;	In this paper, we try to learn a small GNN (called TinyGNN), which can achieve high performance and infer the node representation in a short time.
181	GPT-GNN: Generative Pre-Training of Graph Neural Networks	Ziniu Hu; Yuxiao Dong; Kuansan Wang; Kai-Wei Chang; Yizhou Sun;	In this paper, we present the GPT-GNN framework to initialize GNNs by generative pre-training.
182	Parameterized Correlation Clustering in Hypergraphs and Bipartite Graphs	Nate Veldt; Anthony Wirth; David F. Gleich;	Motivated by applications in community detection and dense subgraph discovery, we consider new clustering objectives in hypergraphs and bipartite graphs.
183	Prioritized Restreaming Algorithms for Balanced Graph Partitioning	Amel Awadelkarim; Johan Ugander;	With the help of this modular perspective, we find that a key combination of design decisions leads to a novel family of algorithms with notably better empirical performance than any existing highly-scalable algorithm on a broad range of real-world graphs.
184	A Non-Iterative Quantile Change Detection Method in Mixture Model with Heavy-Tailed Components	Yuantong Li; Qi Ma; Sujit K. Ghosh;	In this paper, we propose a robust and quick approach based on change-point methods to determine the number of mixture components that works for almost any location-scale families even when the components are heavy tailed (e.g., Cauchy).
185	AdvMind: Inferring Adversary Intent of Black-Box Attacks	Ren Pang; Xinyang Zhang; Shouling Ji; Xiapu Luo; Ting Wang;	In this paper, we present AdvMind, a new class of estimation models that infer the adversary intent of black-box adversarial attacks in a robust and prompt manner.
186	Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding	Yu Meng; Yunyi Zhang; Jiaxin Huang; Yu Zhang; Chao Zhang; Jiawei Han;	To guide the hierarchical topic discovery process with minimal user supervision, we propose a new task, Hierarchical Topic Mining, which takes a category tree described by category names only, and aims to mine a set of representative terms for each category from a text corpus to help a user comprehend his/her interested topics.
187	Combinatorial Black-Box Optimization with Expert Advice	Hamid Dadkhahi; Karthikeyan Shanmugam; Jesus Rios; Payel Das; Samuel C. Hoffman; Troy David Loeffler; Subramanian Sankaranarayanan;	To address this problem, we propose a computationally efficient model learning algorithm based on multilinear polynomials and exponential weight updates.
188	CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation Transferring	Jiaxin Huang; Yiqing Xie; Yu Meng; Yunyi Zhang; Jiawei Han;	In this paper, we propose a method for seed-guided topical taxonomy construction, which takes a corpus and a seed taxonomy described by concept names as input, and constructs a more complete taxonomy based on user’s interest, wherein each node is represented by a cluster of coherent terms.
189	Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes	Soorajnath Boominathan; Michael Oberst; Helen Zhou; Sanjat Kanjilal; David Sontag;	We present, compare, and evaluate three approaches for learning individualized treatment policies in this setting: First, we consider two indirect approaches, which use predictive models of treatment response to construct policies optimal for different trade-offs between objectives. Second, we consider a direct approach that constructs such a set of policies without intermediate models of outcomes.
190	List-wise Fairness Criterion for Point Processes	Jin Shang; Mingxuan Sun; Nina S.N. Lam;	In this paper, we propose a novel list-wise fairness criterion for point processes, which can efficiently evaluate the ranking fairness in event prediction.
191	Neural Subgraph Isomorphism Counting	Xin Liu; Haojie Pan; Mutian He; Yangqiu Song; Xin Jiang; Lifeng Shang;	In this paper, we study a new graph learning problem: learning to count subgraph isomorphisms.
192	Hypergraph Clustering Based on PageRank	Yuuki Takai; Atsushi Miyauchi; Masahiro Ikeda; Yuichi Yoshida;	In this study, we develop two clustering algorithms based on personalized PageRank on hypergraphs.
193	DeepSinger: Singing Voice Synthesis with Data Mined From the Web	Yi Ren; Xu Tan; Tao Qin; Jian Luan; Zhou Zhao; Tie-Yan Liu;	In this paper, we develop DeepSinger, a multi-lingual multi-singer singing voice synthesis (SVS) system, which is built from scratch using singing training data mined from music websites.
194	Scaling Choice Models of Relational Social Data	Jan Overgoor; George Pakapol Supaniratisai; Johan Ugander;	Given the importance of negative sampling, in this work we introduce a model simplification technique for mixed logit models that we call "de-mixing”, whereby standard mixture models of network formation—particularly models that mix local and global link formation—are reformulated to operate their modes over disjoint choice sets.
195	Deep Exogenous and Endogenous Influence Combination for Social Chatter Intensity Prediction	Subhabrata Dutta; Sarah Masud; Soumen Chakrabarti; Tanmoy Chakraborty;	To address the three limitations noted above, we propose a novel framework, ChatterNet, which, to our knowledge, is the first that can model and predict user engagement without considering the underlying user network.
196	Geography-Aware Sequential Location Recommendation	Defu Lian; Yongji Wu; Yong Ge; Xing Xie; Enhong Chen;	To this end, we propose a Geography-aware sequential recommender based on the Self-Attention Network (GeoSAN for short) for location recommendation.
197	Dual Channel Hypergraph Collaborative Filtering	Shuyi Ji; Yifan Feng; Rongrong Ji; Xibin Zhao; Wanwan Tang; Yue Gao;	Under such circumstances, we propose a dual channel hypergraph collaborative filtering (DHCF) framework to tackle the above issues.
198	A Framework for Recommending Accurate and Diverse Items Using Bayesian Graph Convolutional Neural Networks	Jianing Sun; Wei Guo; Dengcheng Zhang; Yingxue Zhang; Florence Regol; Yaochen Hu; Huifeng Guo; Ruiming Tang; Han Yuan; Xiuqiang He; Mark Coates;	To alleviate the above issue, in this work, we take a first step to introduce a principled way to model the uncertainty in the user-item interaction graph using the Bayesian Graph Convolutional Neural Network framework.
199	Learning Based Distributed Tracking	Hao WU; Junhao Gan; Rui Zhang;	In this paper, we revisit a fundamental problem called Distributed Tracking (DT) under an assumption that the data follows a certain (known or unknown) distribution, and propose a number Data-dependent algorithms with improved theoretical bounds.
200	Tight Sensitivity Bounds For Smaller Coresets	Alaa Maalouf; Adiel Statman; Dan Feldman;	We provide algorithms that compute provably tight bounds for the sensitivity of each input row. It is based on two ingredients: (i) iterative algorithm that computes the exact sensitivity of each row up to arbitrary small precision for (non-affine) k-subspaces, and (ii) a general reduction for computing a coreset for affine subspaces, given a coreset for (non-affine) subspaces in Rd.
201	GHashing: Semantic Graph Hashing for Approximate Similarity Search in Graph Databases	Zongyue Qin; Yunsheng Bai; Yizhou Sun;	Inspired by the recent success of deep-learning-based semantic hashing in image and document retrieval, we propose a novel graph neural network (GNN) based semantic hashing, i.e. GHashing, for approximate pruning.
202	Interactive Path Reasoning on Graph for Conversational Recommendation	Wenqiang Lei; Gangyi Zhang; Xiangnan He; Yisong Miao; Xiang Wang; Liang Chen; Tat-Seng Chua;	In this paper, we propose Conversational Path Reasoning (CPR), a generic framework that models conversational recommendation as an interactive path reasoning problem on a graph.
203	Algorithmic Aspects of Temporal Betweenness	Sebastian Buß; Hendrik Molter; Rolf Niedermeier; Maciej Rymar;	We provide a systematic study of temporal betweenness variants based on various concepts of optimal temporal paths both on a theoretical and empirical level.
204	Non-Linear Mining of Social Activities in Tensor Streams	Koki Kawabata; Yasuko Matsubara; Takato Honda; Yasushi Sakurai;	In this paper, we propose a streaming method, namely, CubeCast, that is designed to capture basic trends and seasonality in tensor streams and extract temporal and multi-dimensional relationships between such dynamics.
205	DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering	Yuval Heffetz; Roman Vainshtein; Gilad Katz; Lior Rokach;	In this study we present DeepLine, a reinforcement learning-based approach for automatic pipeline generation.
206	On Sampling Top-K Recommendation Evaluation	Dong Li; Ruoming Jin; Jing Gao; Zhi Liu;	In this work, we thoroughly investigate the relationship between the sampling and global top-K Hit-Ratio (HR, or Recall), originally proposed by Koren[2] and extensively used by others.
207	Algorithmic Decision Making with Conditional Fairness	Renzhe Xu; Peng Cui; Kun Kuang; Bo Li; Linjun Zhou; Zheyan Shen; Wei Cui;	We thus define conditional fairness as a more sound fairness metric by conditioning on the fairness variables.
208	Semi-supervised Collaborative Filtering by Text-enhanced Domain Adaptation	Wenhui Yu; Xiao Lin; Junfeng Ge; Wenwu Ou; Zheng Qin;	To solve these difficulties, we regard the problem of recommendation on sparse implicit feedbacks as a semi-supervised learning task, and explore domain adaption to solve it.
209	Rich Information is Affordable: A Systematic Performance Analysis of Second-order Optimization Using K-FAC	Yuichiro Ueno; Kazuki Osawa; Yohei Tsuji; Akira Naruse; Rio Yokota;	In this work, we conduct a step-by-step performance analysis when computing the Fisher information matrix during training of ResNet-50 on ImageNet, and show that the overhead can be reduced to the same amount as the cost of performing a single SGD step.
210	Voronoi Graph Traversal in High Dimensions with Applications to Topological Data Analysis and Piecewise Linear Interpolation	Vladislav Polianskii; Florian T. Pokorny;	We propose a randomized approximation approach that mitigates the prohibitive cost of exact computation of Voronoi diagrams in high dimensions for machine learning applications.
211	MCRapper: Monte-Carlo Rademacher Averages for Poset Families and Approximate Pattern Mining	Leonardo Pellegrina; Cyrus Cousins; Fabio Vandin; Matteo Riondato;	We present MCRapper, an algorithm for efficient computation of Monte-Carlo Empirical Rademacher Averages (MCERA) for families of functions exhibiting poset (e.g., lattice) structure, such as those that arise in many pattern mining tasks.
212	REA: Robust Cross-lingual Entity Alignment Between Knowledge Graphs	Shichao Pei; Lu Yu; Guoxian Yu; Xiangliang Zhang;	Our proposed method named REA (Robust Entity Alignment) consists of two components: noise detection and noise-aware entity alignment.
213	Stable Learning via Differentiated Variable Decorrelation	Zheyan Shen; Peng Cui; Jiashuo Liu; Tong Zhang; Bo Li; Zhitang Chen;	In this paper, we incorporate the unlabled data from multiple environments into the variable decorrelation framework and propose a Differentiated Variable Decorrelation (DVD) algorithm based on the clustering of variables.
214	Learning Stable Graphs from Multiple Environments with Selection Bias	Yue He; Peng Cui; Jianxin Ma; Hao Zou; Xiaowei Wang; Hongxia Yang; Philip S. Yu;	In this paper, we target the problem of learning stable graphs from multiple environments with selection bias.
215	Fast RobustSTL: Efficient and Robust Seasonal-Trend Decomposition for Time Series with Complex Patterns	Qingsong Wen; Zhe Zhang; Yan Li; Liang Sun;	In this paper, we extend RobustSTL to handle multiple seasonality.
216	CurvaNet: Geometric Deep Learning based on Directional Curvature for 3D Shape Analysis	Wenchong He; Zhe Jiang; Chengming Zhang; Arpan Man Sainju;	In contrast, this paper proposes a novel geometric deep learning model called CurvaNet that integrates differential geometry with graph neural networks.
217	Attentional Multi-graph Convolutional Network for Regional Economy Prediction with Open Migration Data	Fengli Xu; Yong Li; Shusheng Xu;	We study the problem of predicting regional economy of U.S. counties with open migration data collected from U.S. Internal Revenue Service (IRS) records.
218	Octet: Online Catalog Taxonomy Enrichment with Self-Supervision	Yuning Mao; Tong Zhao; Andrey Kan; Chenwei Zhang; Xin Luna Dong; Christos Faloutsos; Jiawei Han;	In this paper, we present a self-supervised end-to-end framework, Octet, for Online Catalog Taxonomy EnrichmenT.
219	TIMME: Twitter Ideology-detection via Multi-task Multi-relational Embedding	Zhiping Xiao; Weiping Song; Haoyan Xu; Zhicheng Ren; Yizhou Sun;	We aim at solving the problem of predicting people’s ideology, or political tendency.
220	Knowing your FATE: Friendship, Action and Temporal Explanations for User Engagement Prediction on Social Apps	Xianfeng Tang; Yozen Liu; Neil Shah; Xiaolin Shi; Prasenjit Mitra; Suhang Wang;	In this paper, we study a novel problem of explainable user engagement prediction for social network Apps.
221	Sub-Matrix Factorization for Real-Time Vote Prediction	Alexander Immer; Victor Kristof; Matthias Grossglauser; Patrick Thiran;	We address the problem of predicting aggregate vote outcomes (e.g., national) from partial outcomes (e.g., regional) that are revealed sequentially.
222	Temporal-Contextual Recommendation in Real-Time	Yifei Ma; Balakrishnan (Murali) Narayanaswamy; Haibin Lin; Hao Ding;	To fill this gap, we present a black-box recommender system that can adapt to a diverse set of scenarios without the need for manual tuning.
223	OptMatch: Optimized Matchmaking via Modeling the High-Order Interactions on the Arena	Linxia Gong; Xiaochuan Feng; Dezhi Ye; Hao Li; Runze Wu; Jianrong Tao; Changjie Fan; Peng Cui;	This paper proposes a two-stage data-driven matchmaking framework (namely OptMatch), which is applicable to most of gaming products and has the minimal product knowledge required.
224	PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest	Aditya Pal; Chantat Eksombatchai; Yitong Zhou; Bo Zhao; Charles Rosenberg; Jure Leskovec;	In this work, we introduce PinnerSage, an end-to-end recommender system that represents each user via multi-modal embeddings and leverages this rich representation of users to provides high quality personalized recommendations.
225	Polestar: An Intelligent, Efficient and National-Wide Public Transportation Routing Engine	Hao Liu; Ying Li; Yanjie Fu; Huaibo Mei; Jingbo Zhou; Xu Ma; Hui Xiong;	To this end, in this paper, we present Polestar, a data-driven engine for intelligent and efficient public transportation routing.
226	Context-Aware Attentive Knowledge Tracing	Aritra Ghosh; Neil Heffernan; Andrew S. Lan;	In this paper, we propose attentive knowledge tracing (AKT), which couples flexible attention-based neural network models with a series of novel, interpretable model components inspired by cognitive and psychometric models.
227	Improving Movement Predictions of Traffic Actors in Bird’s-Eye View Models using GANs and Differentiable Trajectory Rasterization	Eason Wang; Henggang Cui; Sai Yalamanchi; Mohana Moorthy; Nemanja Djuric;	In this paper we build upon these two directions and propose a raster-based conditional GAN architecture, powered by a novel differentiable rasterizer module at the input of the conditional discriminator that maps generated trajectories into the raster space in a differentiable manner.
228	M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems	Menghan Wang; Yujie Lin; Guli Lin; Keping Yang; Xiao-ming Wu;	In this paper, we use a multi-view representation alignment approach to address this issue.
229	Attribute-based Propensity for Unbiased Learning in Recommender Systems: Algorithm and Case Studies	Zhen Qin; Suming J. Chen; Donald Metzler; Yongwoo Noh; Jingzheng Qin; Xuanhui Wang;	In this paper, we generalize the traditional position bias model to an attribute-based propensity framework.
230	Predicting Individual Treatment Effects of Large-scale Team Competitions in a Ride-sharing Economy	Teng Ye; Wei Ai; Lingyu Zhang; Ning Luo; Lulu Zhang; Jieping Ye; Qiaozhu Mei;	In this study, we analyze data collected from more than 500 large-scale team competitions organized by a leading ride-sharing platform, building machine learning models to predict individual treatment effects.
231	Cellular Network Radio Propagation Modeling with Deep Convolutional Neural Networks	Xin Zhang; Xiujun Shu; Bingwen Zhang; Jie Ren; Lizhou Zhou; Xin Chen;	In this article we present a novel method to model radio propagation using deep convolutional neural networks and report significantly improved performance compared to conventional models.
232	Neural Input Search for Large Scale Recommendation Models	Manas R. Joglekar; Cong Li; Mei Chen; Taibai Xu; Xiaoming Wang; Jay K. Adams; Pranav Khaitan; Jiahui Liu; Quoc V. Le;	We present Neural Input Search (NIS), a technique for learning the optimal vocabulary sizes and embedding dimensions for categorical features.
233	Easy Perturbation EEG Algorithm for Spectral Importance (easyPEASI): A Simple Method to Identify Important Spectral Features of EEG in Deep Learning Models	David O. Nahmias; Kimberly L. Kontson;	This work proposes and validates a method to investigate frequency bands important to EEG-driven deep learning models.
234	Building Continuous Integration Services for Machine Learning	Bojan Karlaš; Matteo Interlandi; Cedric Renggli; Wentao Wu; Ce Zhang; Deepak Mukunthu Iyappan Babu; Jordan Edwards; Chris Lauren; Andy Xu; Markus Weimer;	We develop the first CI system for ML, to the best of our knowledge, that integrates seamlessly with existing ML development tools.
235	Learning to Cluster Documents into Workspaces Using Large Scale Activity Logs	Weize Kong; Michael Bendersky; Marc Najork; Brandon Vargo; Mike Colagrosso;	We go beyond the textual similarity-based unsupervised clustering paradigm and instead directly learn from users’ activity for document clustering.
236	What is that Building?: An End-to-end System for Building Recognition from Streetside Images	Chiqun Zhang; Dragomir Yankov; Chun-Ting Wu; Simon Shapiro; Jason Hong; Wei Wu;	The paper describes Streetside Building Search-Retrieve System (SBSRS) – a system for recognizing buildings from steetside images. To evaluate the system, we generate a dataset of over 23K unique business buildings from four major US cities.
237	MultiSage: Empowering GCN with Contextualized Multi-Embeddings on Web-Scale Multipartite Networks	Carl Yang; Aditya Pal; Andrew Zhai; Nikil Pancha; Jiawei Han; Charles Rosenberg; Jure Leskovec;	Here, we present a contextualized GCN engine by modeling the multipartite networks of target nodes and their intermediatecontext nodes that specify the contexts of their interactions.
238	HetETA: Heterogeneous Information Network Embedding for Estimating Time of Arrival	Huiting Hong; Yucheng Lin; Xiaoqing Yang; Zang Li; Kung Fu; Zheng Wang; Xiaohu Qie; Jieping Ye;	In this paper, we propose HetETA to leverage heterogeneous information graph in ETA task.
239	Hubble: An Industrial System for Audience Expansion in Mobile Marketing	Chenyi Zhuang; Ziqi Liu; Zhiqiang Zhang; Yize Tan; Zhengwei Wu; Zhining Liu; Jianping Wei; Jinjie Gu; Guannan Zhang; Jun Zhou; Yuan Qi;	Addressing the above challenges, in this paper, we present the Hubble System, an industrial solution for audience expansion in mobile marketing scenario.
240	Scaling Graph Neural Networks with Approximate PageRank	Aleksandar Bojchevski; Johannes Klicpera; Bryan Perozzi; Amol Kapoor; Martin Blais; Benedek Rózemberczki; Michal Lukasik; Stephan Günnemann;	We present the PPRGo model which utilizes an efficient approximation of information diffusion in GNNs resulting in significant speed gains while maintaining state-of-the-art prediction performance.
241	Combo-Attention Network for Baidu Video Advertising	Tan Yu; Yi Yang; Yi Li; Xiaodong Chen; Mingming Sun; Ping Li;	In this paper, we introduce a technique used in Baidu video advertising for feeding relevant video ads according to the user’s query. To testify the effectiveness of the proposed CAN offline, we built a Daily700K dataset collected from HaoKan APP.
242	Federated Doubly Stochastic Kernel Learning for Vertically Partitioned Data	Bin Gu; Zhiyuan Dang; Xiang Li; Heng Huang;	In this paper, we focus on nonlinear learning with kernels,and propose a federated doubly stochastic kernel learning (FDSKL) algorithm for vertically partitioned data.
243	To Tune or Not to Tune?: In Search of Optimal Configurations for Data Analytics	Ayat Fekry; Lucian Carata; Thomas Pasquier; Andrew Rice; Andy Hopper;	We adapt different ML techniques in order to obtain efficient incremental tuning in our problem domain, and propose Tuneful, a configuration tuning framework.
244	Reconstruction and Decomposition of High-Dimensional Landscapes via Unsupervised Learning	Jing Lei; Nasrin Akhter; Wanli Qiao; Amarda Shehu;	In this paper, we present a novel, hybrid method that combines strengths of these methods, allowing both visualization of the landscape and discovery of macrostates.
245	Map Generation from Large Scale Incomplete and Inaccurate Data Labels	Rui Zhang; Conrad Albrecht; Wei Zhang; Xiaodong Cui; Ulrich Finkler; David Kung; Siyuan Lu;	In this paper we present progress in developing an algorithmic pipeline and distributed compute system that automates the process of map creation using high resolution aerial images.
246	Grale: Designing Networks for Graph Learning	Jonathan Halcrow; Alexandru Mosoi; Sam Ruth; Bryan Perozzi;	In this work, we present Grale, a scalable method we have developed to address the problem of graph design for graphs with billions of nodes.
247	Automatic Validation of Textual Attribute Values in E-commerce Catalog by Learning with Limited Labeled Data	Yaqing Wang; Yifan Ethan Xu; Xian Li; Xin Luna Dong; Jing Gao;	In this paper, we propose to develop an automatic validation approach that verifies the correctness of textual attribute values for products.
248	CLARA: Confidence of Labels and Raters	Viet-An Nguyen; Peibei Shi; Jagdish Ramakrishnan; Udi Weinsberg; Henry C. Lin; Steve Metz; Neil Chandra; Jane Jing; Dimitris Kalimeris;	In this paper, we present CLARA (Confidence of Labels and Raters), a system developed and deployed at Facebook for aggregating reviewer decisions and estimating their uncertainty.
249	Embedding-based Retrieval in Facebook Search	Jui-Ting Huang; Ashish Sharma; Shuying Sun; Li Xia; David Zhang; Philip Pronin; Janani Padmanabhan; Giuseppe Ottaviano; Linjun Yang;	In this paper, we discuss the techniques for applying EBR to a Facebook Search system.
250	Lumos: A Library for Diagnosing Metric Regressions in Web-Scale Applications	Jamie Pool; Ebrahim Beyrami; Vishak Gopal; Ashkan Aazami; Jayant Gupchup; Jeff Rowland; Binlong Li; Pritesh Kanani; Ross Cutler; Johannes Gehrke;	In this work, we open sourceLumos and present our results from applying it to two different components within the RTC group over millions of sessions.
251	Order Fulfillment Cycle Time Estimation for On-Demand Food Delivery	Lin Zhu; Wei Yu; Kairong Zhou; Xing Wang; Wenxing Feng; Pengyu Wang; Ning Chen; Pei Lee;	In this paper, we present the OFCT prediction model that is currently deployed at Ele.me, which is one of the world’s largest OFD platforms and delivers over 10 million meals in more than 200 Chinese cities every day.
252	Calendar Graph Neural Networks for Modeling Time Structures in Spatiotemporal User Behaviors	Daheng Wang; Meng Jiang; Munira Syed; Oliver Conway; Vishal Juneja; Sriram Subramanian; Nitesh V. Chawla;	In this work, we propose a novel model based on graph neural networks for learning user representations from spatiotemporal behavior data.
253	Privileged Features Distillation at Taobao Recommendations	Chen Xu; Quan Li; Junfeng Ge; Jinyang Gao; Xiaoyong Yang; Changhua Pei; Fei Sun; Jian Wu; Hanxiao Sun; Wenwu Ou;	Inspired by the distillation techniques which bridge the gap between training and inference, in this work, we propose privileged features distillation (PFD).
254	Cracking Tabular Presentation Diversity for Automatic Cross-Checking over Numerical Facts	Hongwei Li; Qingping Yang; Yixuan Cao; Jiaquan Yao; Ping Luo;	This paper introduces the key module of such a system, which aims to identify whether a pair of table cells are semantically equivalent, namely referring to the same fact.
255	GrokNet: Unified Computer Vision Model Trunk and Embeddings For Commerce	Sean Bell; Yiqun Liu; Sami Alsheikh; Yina Tang; Edward Pizzi; M. Henning; Karun Singh; Omkar Parkhi; Fedor Borisyuk;	In this paper, we present GrokNet, a deployed image recognition system for commerce applications.
256	Learning Instrument Invariant Characteristics for Generating High-resolution Global Coral Reef Maps	Ata Akbari Asanjan; Kamalika Das; Alan Li; Ved Chirayath; Juan Torres-Perez; Soroosh Sorooshian;	In this work, we develop a deep learning model for extracting domain invariant features from multimodal remote sensing imagery and creating high-resolution global maps of coral reefs by combining various sources of imagery and limited hand-labeled data available for certain regions.
257	Causal Meta-Mediation Analysis: Inferring Dose-Response Function From Summary Statistics of Many Randomized Experiments	Zenan Wang; Xuan Yin; Tianbo Li; Liangjie Hong;	We model the online evaluation metric as a mediator and formalize its causality with the business KPI as dose-response function (DRF).
258	AutoFIS: Automatic Feature Interaction Selection in Factorization Models for Click-Through Rate Prediction	Bin Liu; Chenxu Zhu; Guilin Li; Weinan Zhang; Jincai Lai; Ruiming Tang; Xiuqiang He; Zhenguo Li; Yong Yu;	In this work, we propose a two-stage algorithm called Automatic Feature Interaction Selection (AutoFIS).
259	City Metro Network Expansion with Reinforcement Learning	Yu Wei; Minjia Mao; Xi Zhao; Jianhua Zou; Ping An;	To address these limitations, we propose a reinforcement learning based method for the city metro network expansion problem.
260	Game Action Modeling for Fine Grained Analyses of Player Behavior in Multi-player Card Games (Rummy as Case Study)	Sharanya Eswaran; Mridul Sachdeva; Vikram Vimal; Deepanshi Seth; Suhaas Kalpam; Sanjay Agarwal; Tridib Mukherjee; Samrat Dattagupta;	We present a deep learning framework for game action modeling, which enables fine-grained analyses of player behavior.
261	Cascade-LSTM: A Tree-Structured Neural Classifier for Detecting Misinformation Cascades	Francesco Ducci; Mathias Kraus; Stefan Feuerriegel;	As a remedy, we propose a novel tree-structured neural network named Cascade-LSTM.
262	Personalized Prefix Embedding for POI Auto-Completion in the Search Engine of Baidu Maps	Jizhou Huang; Haifeng Wang; Miao Fan; An Zhuo; Ying Li;	In this paper, we present an end-to-end neural-based framework for POI-AC, which has been recently deployed in the search engine of Baidu Maps, one of the largest Web mapping applications with hundreds of millions monthly active users worldwide.
263	Category-Specific CNN for Visual-aware CTR Prediction at JD.com	Hu Liu; Jing Lu; Hao Yang; Xiwei Zhao; Sulong Xu; Hao Peng; Zehua Zhang; Wenjie Niu; Xiaokun Zhu; Yongjun Bao; Weipeng Yan;	To overcome the two challenges, we propose Category-specific CNN (CSCNN) specially for CTR prediction.
264	ConSTGAT: Contextual Spatial-Temporal Graph Attention Network for Travel Time Estimation at Baidu Maps	Xiaomin Fang; Jizhou Huang; Fan Wang; Lingke Zeng; Haijin Liang; Haifeng Wang;	In this paper, we propose an end-to-end neural framework named ConSTGAT, which integrates traffic prediction and contextual information to address these two problems.
265	Faster Secure Data Mining via Distributed Homomorphic Encryption	Junyi Li; Heng Huang;	In this paper, we propose a novel general distributed HE-based data mining framework towards one step of solving the scaling problem.
266	Contagious Chain Risk Rating for Networked-guarantee Loans	Dawei Cheng; Zhibin Niu; Yiyi Zhang;	To this end, we propose a novel approach to rate the risk of contagion chains in the bank industry with the deep neural network.
267	AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types	Xin Luna Dong; Xiang He; Andrey Kan; Xian Li; Yan Liang; Jun Ma; Yifan Ethan Xu; Chenwei Zhang; Tong Zhao; Gabriel Blanco Saldana; Saurabh Deshpande; Alexandre Michetti Manduca; Jay Ren; Surender Pal Singh; Fan Xiao; Haw-Shiuan Chang; Giannis Karamanolakis; Yuning Mao; Yaqing Wang; Christos Faloutsos; Andrew McCallum; Jiawei Han;	We describe AutoKnow, our automatic (self-driving) system that addresses these challenges.
268	Personalized Image Retrieval with Sparse Graph Representation Learning	Xiaowei Jia; Handong Zhao; Zhe Lin; Ajinkya Kale; Vipin Kumar;	In this paper, we develop a novel method CA-GCN for personalized image retrieval in the Adobe Stock image system.
269	Comprehensive Information Integration Modeling Framework for Video Titling	Shengyu Zhang; Ziqi Tan; Zhou Zhao; Jin Yu; Kun Kuang; Tan Jiang; Jingren Zhou; Hongxia Yang; Fei Wu;	To bridge this gap, we integrate comprehensive sources of information, including the content of consumer-generated videos, the narrative comment sentences supplied by consumers, and the product attributes, in an end-to-end modeling framework.
270	Acoustic Measures for Real-Time Voice Coaching	Ying Li; Abraham Miller; Arthur Liu; Kyle Coburn; Luis J. Salazar;	This paper presents methodologies for computing a set of physical properties from sound waves of a speaker’s voice directly, referred to as acoustic measures.
271	Geodemographic Influence Maximization	Kaichen Zhang; Jingbo Zhou; Donglai Tao; Panagiotis Karras; Qing Li; Hui Xiong;	In this paper, we address the natural problem that arises such data: given a distribution of population and point-to-point movement statistics over a network, find a set of locations within a budget that achieves maximum expected reach.
272	A Self-Evolving Mutually-Operative Recurrent Network-based Model for Online Tool Condition Monitoring in Delay Scenario	Monidipa Das; Mahardhika Pratama; Tegoeh Tjahjowidodo;	In order to tackle these issues, we propose SERMON as a novel learning model based on a pair of self-evolving mutually-operative recurrent neural networks.
273	Maximizing Cumulative User Engagement in Sequential Recommendation: An Online Optimization Perspective	Yifei Zhao; Yu-Hang Zhou; Mingdong Ou; Huan Xu; Nan Li;	In this paper, we study this problem from an online optimization perspective, and propose a flexible and practical framework to explicitly tradeoff longer user browsing length and high immediate user engagement.
274	Domain Specific Knowledge Graphs as a Service to the Public: Powering Social-Impact Funding in the US	Ying Li; Vitalii Zakhozhyi; Daniel Zhu; Luis J. Salazar;	This paper explores the practical applications of Domain Specific Knowledge Graphs that allow for the extraction of information from trusted published and unpublished sources, to map the extracted information to an ontology defined in collaboration with sector experts, and to enable the public to go from single queries into ongoing conversations meeting their knowledge needs reliably.
275	LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition	Jin Xu; Xu Tan; Yi Ren; Tao Qin; Jian Li; Sheng Zhao; Tie-Yan Liu;	In this paper, we develop LRSpeech, a TTS and ASR system under the extremely low-resource setting, which can support rare languages with low data cost.
276	Doing in One Go: Delivery Time Inference Based on Couriers’ Trajectories	Sijie Ruan; Zi Xiong; Cheng Long; Yiheng Chen; Jie Bao; Tianfu He; Ruiyuan Li; Shengnan Wu; Zhongyuan Jiang; Yu Zheng;	To this end, we propose Delivery Time Inference (DTInf), to automatically infer the delivery time of waybills based on couriers’ trajectories.
277	Improving Deep Learning for Airbnb Search	Malay Haldar; Prashant Ramanathan; Tyler Sax; Mustafa Abdool; Lanbo Zhang; Aamir Mansawala; Shulin Yang; Bradley Turnbull; Junshuo Liao;	In this paper we describe the journey beyond, discussing what we refer to as the ABCs of improving search: A for architecture, B for bias and C for cold start.
278	General-Purpose User Embeddings based on Mobile App Usage	Junqi Zhang; Bing Bai; Ye Lin; Jian Liang; Kun Bai; Fei Wang;	In this paper, we report our recent practice at Tencent for user modeling based on mobile app usage.
279	Unsupervised Translation via Hierarchical Anchoring: Functional Mapping of Places across Cities	Takahiro Yabe; Kota Tsubouchi; Toru Shimizu; Yoshihide Sekimoto; Satish V. Ukkusuri;	We address this problem by proposing an unsupervised translation method that translates embeddings by exploiting common hierarchical structures that exist across imbalanced domains.
280	Debiasing Grid-based Product Search in E-commerce	Ruocheng Guo; Xiaoting Zhao; Adam Henderson; Liangjie Hong; Huan Liu;	In this work, we extend unbiased learning to rank to the world of e-commerce search via considering a grid-based product search scenario.
281	Forecasting the Evolution of Hydropower Generation	Fan Zhou; Liang Li; Kunpeng Zhang; Goce Trajcevski; Fuming Yao; Ying Huang; Ting Zhong; Jiahao Wang; Qiao Liu;	In this paper, we present DeepHydro, a novel stochastic method for modeling multivariate time series (e.g., water inflow/outflow and temperature) and forecasting power generation of hydropower stations.
282	Salience and Market-aware Skill Extraction for Job Targeting	Baoxu Shi; Jaewon Yang; Feng Guo; Qi He;	In this work, we show that the commonly used text-based salience and market-agnostic skill extraction approach is sub-optimal because it only considers skill mention and ignores the salient level of a skill and its market dynamics, i.e., the market supply and demand influence on the importance of skills.
283	DATE: Dual Attentive Tree-aware Embedding for Customs Fraud Detection	Sundong Kim; Yu-Che Tsai; Karandeep Singh; Yeonsoo Choi; Etim Ibok; Cheng-Te Li; Meeyoung Cha;	This paper proposes DATE, a model of Dual-task Attentive Tree-aware Embedding, to classify and rank illegal trade flows that contribute the most to the overall customs revenue when caught.
284	User Sentiment as a Success Metric: Persistent Biases Under Full Randomization	Ercan Yildiz; Joshua Safyan; Marc Harper;	We study user sentiment (reported via optional surveys) as a metric for fully randomized A/B tests.
285	Improving Recommendation Quality in Google Drive	Suming J. Chen; Zhen Qin; Zac Wilson; Brian Calaci; Michael Rose; Ryan Evans; Sean Abraham; Donald Metzler; Sandeep Tata; Michael Colagrosso;	In this paper, we discuss both the challenges of iteratively improving the quality of a personal recommendation system as well as the variety of approaches that we took in order to improve this feature.
286	Large-Scale Training System for 100-Million Classification at Alibaba	Liuyihan Song; Pan Pan; Kang Zhao; Hao Yang; Yiming Chen; Yingya Zhang; Yinghui Xu; Rong Jin;	In this paper, we propose a large-scale training system to address these challenges.
287	Mining Implicit Relevance Feedback from User Behavior for Web Question Answering	Linjun Shou; Shining Bo; Feixiang Cheng; Ming Gong; Jian Pei; Daxin Jiang;	In this paper, we make the first study to explore the correlation between user behavior and passage relevance, and propose a novel approach for mining training data for Web QA.
288	Controllable Multi-Interest Framework for Recommendation	Yukuo Cen; Jianwei Zhang; Xu Zou; Chang Zhou; Hongxia Yang; Jie Tang;	In this paper, we propose a novel controllable multi-interest framework for the sequential recommendation, called ComiRec.
289	Managing Diversity in Airbnb Search	Mustafa Abdool; Malay Haldar; Prashant Ramanathan; Tyler Sax; Lanbo Zhang; Aamir Manaswala; Lynn Yang; Bradley Turnbull; Qing Zhang; Thomas Legrand;	In this paper, we describe our journey in tackling the problem of diversity for Airbnb search, starting from heuristic based approaches and concluding with a novel deep learning solution that produces an embedding of the entire query context by leveraging Recurrent Neural Networks (RNNs).
290	Molecular Inverse-Design Platform for Material Industries	Seiji Takeda; Toshiyuki Hama; Hsiang-Han Hsu; Victoria A. Piunova; Dmitry Zubarev; Daniel P. Sanders; Jed W. Pitera; Makoto Kogoh; Takumi Hongo; Yenwei Cheng; Wolf Bocanett; Hideaki Nakashika; Akihiro Fujita; Yuta Tsuchiya; Katsuhiko Hino; Kentaro Yano; Shuichi Hirose; Hiroki Toda; Yasumitsu Orii; Daiju Nakano;	In this paper, we present a material industry-oriented web platform of an AI-driven molecular inverse-design system, which automatically designs brand new molecular structures rapidly and diversely.
291	Learning to Score Economic Development from Satellite Imagery	Sungwon Han; Donghyun Ahn; Sungwon Park; Jeasurk Yang; Susang Lee; Jihee Kim; Hyunjoo Yang; Sangyoon Park; Meeyoung Cha;	In this paper, we introduce a novel approach for measuring economic development from high-resolution satellite images in the absence of ground truth statistics.
292	A Request-level Guaranteed Delivery Advertising Planning: Forecasting and Allocation	Hong Zhang; Lan Zhang; Lan Xu; Xiaoyang Ma; Zhengtao Wu; Cong Tang; Wei Xu; Yiguo Yang;	Facing the challenges, we present a holistic design of a request-level guaranteed delivery advertising planning system with careful optimization for all three critical components including impression forecasting, selling and serving.
293	Two Sides of the Same Coin: White-box and Black-box Attacks for Transfer Learning	Yinghua Zhang; Yangqiu Song; Jian Liang; Kun Bai; Qiang Yang;	To figure out this problem, we conduct extensive empirical evaluations to show that fine-tuning effectively enhances model robustness under white-box FGSM attacks. We also propose a black-box attack method for transfer learning models which attacks the target model with the adversarial examples produced by its source model.
294	Learning to Generate Personalized Query Auto-Completions via a Multi-View Multi-Task Attentive Approach	Di Yin; Jiwei Tan; Zhe Zhang; Hongbo Deng; Shujian Huang; Jiajun Chen;	In this paper, we study the task of Query Auto-Completion (QAC), which is a very significant feature of modern search engines.
295	A Sleeping, Recovering Bandit Algorithm for Optimizing Recurring Notifications	Kevin P. Yancey; Burr Settles;	In this paper, we introduce the Recovering Difference Softmax Algorithm to address the particular challenges of this problem domain, and use it to successfully optimize millions of daily reminders for the online language-learning app Duolingo.
296	Multi-objective Optimization for Guaranteed Delivery in Video Service Platform	Hang Lei; Yin Zhao; Longjun Cai;	In this paper, we study the problem of how to maximize certain gains, such as video view (VV) or fairness of different contents (CTR variations between contents) under the GD constraints.
297	Delivery Scope: A New Way of Restaurant Retrieval for On-demand Food Delivery Service	Xuetao Ding; Runfeng Zhang; Zhen Mao; Ke Xing; Fangxiao Du; Xingyu Liu; Guoxing Wei; Feifan Yin; Renqing He; Zhizhao Sun;	In order to draw suitable delivery scopes for millions of restaurant partners, we propose a pioneering delivery scope generation framework.
298	Fraud Transactions Detection via Behavior Tree with Local Intention Calibration	Can Liu; Qiwei Zhong; Xiang Ao; Li Sun; Wangli Lin; Jinghua Feng; Qing He; Jiayu Tang;	In this paper, we devise a tree-like structure named behavior tree to reorganize the user behavioral data, in which a group of successive sequential actions denoting a specific user intention are represented as a branch on the tree.
299	Balanced Order Batching with Task-Oriented Graph Clustering	Lu Duan; Haoyuan Hu; Zili Wu; Guozheng Li; Xinhang Zhang; Yu Gong; Yinghui Xu;	In this paper, rather than designing heuristics, we propose an end-to-end learning and optimization framework named Balanced Task-orientated Graph Clustering Network (BTOGCN) to solve the BOBP by reducing it to balanced graph clustering optimization problem.
300	Efficiently Solving the Practical Vehicle Routing Problem: A Novel Joint Learning Approach	Lu Duan; Yang Zhan; Haoyuan Hu; Yu Gong; Jiangwen Wei; Xiaodong Zhang; Yinghui Xu;	We propose a strategy that combines the reinforcement learning manner with the supervised learning manner to train the model.
301	Meta-Learning for Query Conceptualization at Web Scale	Fred X. Han; Di Niu; Haolan Chen; Weidong Guo; Shengli Yan; Bowei Long;	In this paper, we study the problem of query conceptualization, which is to find the most appropriate matching concepts for any given search query from a large pool of pre-defined concepts.
302	Hybrid Spatio-Temporal Graph Convolutional Network: Improving Traffic Prediction with Navigation Data	Rui Dai; Shenkun Xu; Qian Gu; Chenguang Ji; Kaikui Liu;	Specifically, we propose an algorithm to acquire the upcoming traffic volume from an online navigation engine.
303	Multitask Mixture of Sequential Experts for User Activity Streams	Zhen Qin; Yicheng Cheng; Zhe Zhao; Zhe Chen; Donald Metzler; Jingzheng Qin;	In this work, we study the challenging problem of how to model sequential user behavior in the neural multi-task learning settings.
304	Identifying Homeless Youth At-Risk of Substance Use Disorder: Data-Driven Insights for Policymakers	Maryam Tabar; Heesoo Park; Stephanie Winkler; Dongwon Lee; Anamika Barman-Adhikari; Amulya Yadav;	This work aims to fill this gap by making the following three contributions: (i) we use a real-world dataset collected from ~1,400 homeless youth (across six American states) to build accurate Machine Learning (ML) models for predicting the susceptibility of homeless youth to SUD; (ii) we find a representative set of factors associated with SUD among this population by analyzing feature importance values associated with our ML models; and (iii) we investigate the effect of geographical heterogeneity on the factors associated with SUD.
305	Interleaved Sequence RNNs for Fraud Detection	Bernardo Branco; Pedro Abreu; Ana Sofia Gomes; Mariana S. C. Almeida; João Tiago Ascensão; Pedro Bizarro;	We present a complete RNN framework to detect fraud in real-time, proposing an efficient ML pipeline from preprocessing to deployment.
306	Attention based Multi-Modal New Product Sales Time-series Forecasting	Vijay Ekambaram; Kushagra Manglik; Sumanta Mukherjee; Surya Shravan Kumar Sajja; Satyam Dwivedi; Vikas Raykar;	In this paper, we propose and empirically evaluate several novel attention-based multi-modal encoder-decoder models to forecast the sales for a new product purely based on product images, any available product attributes and also external factors like holidays, events, weather, and discount.
307	Pest Management In Cotton Farms: An AI-System Case Study from the Global South	Aman Dalmia; Jerome White; Ankit Chaurasia; Vishal Agarwal; Rajesh Jain; Dhruvin Vora; Balasaheb Dhame; Raghu Dharmaraju; Rahul Panicker;	We address this problem by presenting a new solution for pesticide management that uses deep learning, smartphone cameras, inexpensive pest traps, existing digital pipelines, and agricultural extension-worker programs.
308	TIES: Temporal Interaction Embeddings for Enhancing Social Media Integrity at Facebook	Nima Noorshams; Saurabh Verma; Aude Hofleitner;	In this paper, we present our efforts to protect various social media entities at Facebook from people who try to abuse our platform.
309	Price Investment using Prescriptive Analytics and Optimization in Retail	Prakhar Mehrotra; Linsey Pang; Karthick Gopalswamy; Avinash Thangali; Timothy Winters; Ketki Gupte; Dnyanesh Kulkarni; Sunil Potnuru; Supreeth Shastry; Harshada Vuyyuri;	In this paper, we apply Machine Learning (ML) algorithms and Operations Research techniques for forecasting and optimization to build a new price recommendation system, which improves our ability to generate price recommendations accurately and automatically.
310	Climate Downscaling Using YNet: A Deep Convolutional Network with Skip Connections and Fusion	Yumin Liu; Auroop R. Ganguly; Jennifer Dy;	In this paper, we proposed YNet, a novel deep convolutional neural network (CNN) with skip connections and fusion capabilities to perform downscaling for climate variables, on multiple GCMs directly rather than on reanalysis data.
311	Cracking the Black Box: Distilling Deep Sports Analytics	Xiangyu Sun; Jack Davis; Oliver Schulte; Guiliang Liu;	We propose and compare several scalable model tree learning heuristics to address the computational challenge from datasets with millions of data points.
312	Taming Pretrained Transformers for Extreme Multi-label Text Classification	Wei-Cheng Chang; Hsiang-Fu Yu; Kai Zhong; Yiming Yang; Inderjit S. Dhillon;	In this paper, we propose X-Transformer, the first scalable approach to fine-tuning deep transformer models for the XMC problem.
313	Prediction of Hourly Earnings and Completion Time on a Crowdsourcing Platform	Anna Lioznova; Alexey Drutsa; Vladimir Kukushkin; Anastasia Bezzubtseva;	We study the problem of predicting future hourly earnings and task completion time for a crowdsourcing platform user who sees the list of available tasks and wants to select one of them to execute.
314	SimClusters: Community-Based Representations for Heterogeneous Recommendations at Twitter	Venu Satuluri; Yao Wu; Xun Zheng; Yilei Qian; Brian Wichers; Qieyun Dai; Gui Ming Tang; Jerry Jiang; Jimmy Lin;	In this paper, we present SimClusters, a general-purpose representation layer based on overlapping communities into which users as well as heterogeneous content can be captured as sparse, interpretable vectors to support a multitude of recommendation tasks.
315	Time-Aware User Embeddings as a Service	Martin Pavlovski; Jelena Gligorijevic; Ivan Stojkovic; Shubham Agrawal; Shabhareesh Komirishetty; Djordje Gligorijevic; Narayan Bhamidipati; Zoran Obradovic;	To that end, we address the limitations of the current state-of-the-art self-supervised methods for task-independent (unsupervised) sequence embedding, and propose a novel Time-Aware Sequential Autoencoder (TASA) that accounts for the temporal aspects of sequences of activities.
316	Shop The Look: Building a Large Scale Visual Shopping System at Pinterest	Raymond Shiau; Hao-Yu Wu; Eric Kim; Yue Li Du; Anqi Guo; Zhiyuan Zhang; Eileen Li; Kunlong Gu; Charles Rosenberg; Andrew Zhai;	In this work, we provide a holistic view of how we built Shop The Look, a shopping oriented visual search system, along with lessons learned from addressing shopping needs.
317	Dynamic Heterogeneous Graph Neural Network for Real-time Event Prediction	Wenjuan Luo; Han Zhang; Xiaodi Yang; Lin Bo; Xiaoqing Yang; Zang Li; Xiaohu Qie; Jieping Ye;	In this paper, we propose to use dynamically constructed heterogeneous graph for each ongoing event to encode the attributes of the event and its surroundings.
318	Bandit based Optimization of Multiple Objectives on a Music Streaming Platform	Rishabh Mehrotra; Niannan Xue; Mounia Lalmas;	This paper aims at extending contextual bandits to multi-objective setting so as to power recommendations in a multi-stakeholder platforms.
319	Multimodal Deep Learning Based Crop Classification Using Multispectral and Multitemporal Satellite Imagery	Krishna Karthik Gadiraju; Bharathkumar Ramachandra; Zexi Chen; Ranga Raju Vatsavai;	In this paper, we present a multimodal deep learning solution that jointly exploits spatial-spectral and phenological properties to identify major crop types.
320	BusTr: Predicting Bus Travel Times from Real-Time Traffic	Richard Barnes; Senaka Buthpitiya; James Cook; Alex Fabrikant; Andrew Tomkins; Fangzhou Xu;	We present BusTr, a machine-learned model for translating road traffic forecasts into predictions of bus delays, used by Google Maps to serve the majority of the world’s public transit systems where no official real-time bus tracking is provided.
321	Characterizing and Learning Representation on Customer Contact Journeys in Cellular Services	Shuai Zhao; Wen-Ling Hsu; George Ma; Tan Xu; Guy Jacobson; Raif Rustamov;	We propose to learn journey embeddings using a sequence-to-sequence framework that converts each customer journey into a fixed-length latent embedding.
322	CrowdQuake: A Networked System of Low-Cost Sensors for Earthquake Detection via Deep Learning	Xin Huang; Jangsoo Lee; Young-Woo Kwon; Chul-Ho Lee;	In this paper, we present CrowdQuake, a networked system based on low-cost acceleration sensors, which monitors ground motions and detects earthquakes, by developing a convolutional-recurrent neural network model.
323	An Empirical Analysis of Backward Compatibility in Machine Learning Systems	Megha Srivastava; Besmira Nushi; Ece Kamar; Shital Shah; Eric Horvitz;	We consider how updates, intended to improve ML models, can introduce new errors that can significantly affect downstream systems and users.
324	DeepTriage: Automated Transfer Assistance for Incidents in Cloud Services	Phuong Pham; Vivek Jain; Lukas Dauterman; Justin Ormont; Navendu Jain;	To address these challenges, we introduce DeepTriage, an intelligent incident transfer service combining multiple machine learning techniques – gradient boosted classifiers, clustering methods, and deep neural networks – in an ensemble to recommend the responsible team to triage an incident.
325	An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images	Zekun Li; Yao-Yi Chiang; Sasan Tavakkol; Basel Shbita; Johannes H. Uhl; Stefan Leyk; Craig A. Knoblock;	This paper presents an end-to-end approach to address the real-world problem of finding and indexing historical map images.
326	Bootstrapping Complete The Look at Pinterest	Eileen Li; Eric Kim; Andrew Zhai; Josh Beal; Kunlong Gu;	In this paper, we will describe how we bootstrapped the Complete The Look (CTL) system at Pinterest.
327	Explainable Classification of Brain Networks via Contrast Subgraphs	Tommaso Lanciano; Francesco Bonchi; Aristides Gionis;	In this paper we introduce a novel approach for classifying brain networks based on extracting contrast subgraphs, i.e., a set of vertices whose induced subgraphs are dense in one class of graphs and sparse in the other.
328	Jointly Learning to Recommend and Advertise	Xiangyu Zhao; Xudong Zheng; Xiwang Yang; Xiaobing Liu; Jiliang Tang;	To this end, in this paper, we propose a novel two-level reinforcement learning framework to jointly optimize the recommending and advertising strategies, where the first level generates a list of recommendations to optimize user experience in the long run; then the second level inserts ads into the recommendation list that can balance the immediate advertising revenue from advertisers and the negative influence of ads on long-term user experience.
329	Fitbit for Chickens?: Time Series Data Mining Can Increase the Productivity of Poultry Farms	Alireza Abdoli; Sara Alaee; Shima Imani; Amy Murillo; Alec Gerry; Leslie Hickle; Eamonn Keogh;	In this work, we propose a general-purpose framework to robustly learn and classify from datasets exhibiting these issues.
330	CompactETA: A Fast Inference System for Travel Time Prediction	Kun Fu; Fanlin Meng; Jieping Ye; Zheng Wang;	In this paper, we develop a novel ETA learning system named as CompactETA, which provides an accurate online travel time inference within 100 microseconds.
331	Intelligent Exploration for User Interface Modules of Mobile App with Collective Learning	Jingbo Zhou; Zhenwei Tang; Min Zhao; Xiang Ge; Fuzheng Zhuang; Meng Zhou; Liming Zou; Chenglei Yang; Hui Xiong;	To this end, we introduce FEELER, a framework to fast and intelligently explore design solutions of user interface modules with a collective machine learning approach.
332	Gemini: A Novel and Universal Heterogeneous Graph Information Fusing Framework for Online Recommendations	Jixing Xu; Zhenlong Zhu; Jianxin Zhao; Xuanye Liu; Minghui Shan; Jiecheng Guo;	To solve the above problems, we propose a universal and effective framework named Gemini, which only relies on the common interaction logs, avoiding the dependence on auxiliary information and ensuring a better generality.
333	Hypergraph Convolutional Recurrent Neural Network	Jaehyuk Yi; Jinkyoo Park;	In this study, we present a hypergraph convolutional recurrent neural network (HGC-RNN), which is a prediction model for structured time-series sensor network data.
334	Towards Building an Intelligent Chatbot for Customer Service: Learning to Respond at the Appropriate Time	Che Liu; Junfeng Jiang; Chao Xiong; Yi Yang; Jieping Ye;	In this paper, we propose a multi-turn response triggering model (MRTM) to address this problem.
335	Ads Allocation in Feed via Constrained Optimization	Jinyun Yan; Zhiyuan Xu; Birjodh Tiwana; Shaunak Chatterjee;	The paper describes how large-scale recommender system like feed ranking works, and why it is useful to consider ads allocation as a post-operation once the ranking of organic items and (separately) the ranking of ads are done.
336	USAD: UnSupervised Anomaly Detection on Multivariate Time Series	Julien Audibert; Pietro Michiardi; Frédéric Guyard; Sébastien Marti; Maria A. Zuluaga;	In this paper, we propose a fast and stable method called UnSupervised Anomaly Detection for multivariate time series (USAD) based on adversely trained autoencoders.
337	A Dual Heterogeneous Graph Attention Network to Improve Long-Tail Performance for Shop Search in E-Commerce	Xichuan Niu; Bofang Li; Chenliang Li; Rong Xiao; Haochuan Sun; Hongbo Deng; Zhenzhong Chen;	Specifically, we propose a dual heterogeneous graph attention network (DHGAT) integrated with the two-tower architecture, using the user interaction data from both shop search and product search.
338	Learning with Limited Labels via Momentum Damped & Differentially Weighted Optimization	Rishabh Mehrotra; Ashish Gupta;	In this paper, we consider the task of learning from limited labeled data, wherein we aim at jointly leveraging strong supervision data (e.g. explicit judgments) along with weak supervision data (e.g. implicit feedback or labels from the related task) to train neural models.
339	Learning to Simulate Human Mobility	Jie Feng; Zeyu Yang; Fengli Xu; Haisu Yu; Mudan Wang; Yong Li;	To solve this problem, we propose a model-free generative adversarial framework, which effectively integrates the domain knowledge of human mobility regularity utilized in the model-based methods.
340	Data-driven Simulation and Optimization for Covid-19 Exit Strategies	Salah Ghamizi, Renaud Rwemalika, Maxime Cordy, Lisa Veiber, Tegawendé F. Bissyandé, Mike Papadakis, Jacques Klein, Yves Le Traon;	In this paper, we propose to augment epidemiological forecasting with actual data-driven models that will learn to fine-tune predictions for different contexts (e.g., per country).
341	Understanding the Impact of the COVID-19 Pandemic on Transportation-related Behaviors with Human Mobility Data	Jizhou Huang; Haifeng Wang; Miao Fan; An Zhuo; Yibo Sun; Ying Li;	To be specific, we conduct data-driven analysis on transportation-related behaviors during the pandemic from the perspectives of 1) means of transportation, 2) type of visited venues, 3) check-in time of venues, 4) preference on "origin-destination” distance, and 5) "origin-transportation-destination” patterns.
342	Simulating the Impact of Hospital Capacity and Social Isolation to Minimize the Propagation of Infectious Diseases	Shaon Bhatta Shuvo; Bonaventure C. Molokwu; Ziad Kobti;	In this paper, we used artificial agent-based simulation modeling to identify the importance of social distancing and hospitals’ capacity in terms of the number of beds to shorten the length of an outbreak and reduce the total number of infections and deaths during an epidemic.
343	Effective Transfer Learning for Identifying Similar Questions: Matching User Questions to COVID-19 FAQs	Clara H. McCreery; Namit Katariya; Anitha Kannan; Manish Chablani; Xavier Amatriain;	In this paper, we show how a double fine-tuning approach of pretraining a neural network on medical question-answer pairs followed by fine-tuning on medical question-question pairs is a particularly useful intermediate task for the ultimate goal of determining medical question similarity.
344	Hi-COVIDNet: Deep Learning Approach to Predict Inbound COVID-19 Patients and Case Study in South Korea	Minseok Kim; Junhyeok Kang; Doyoung Kim; Hwanjun Song; Hyangsuk Min; Youngeun Nam; Dongmin Park; Jae-Gil Lee;	In this paper, to aid in such allocation by predicting the number of inbound COVID-19 cases, we propose Hi-COVIDNet, which takes advantage of the geographic hierarchy.
345	Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data	Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Jing Han, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Cecilia Mascolo;	In this paper we describe our data analysis over a large-scale crowdsourced dataset of respiratory sounds collected to aid diagnosis of COVID-19.
346	Understanding the Urban Pandemic Spreading of COVID-19 with Real World Mobility Data	Qianyue Hao; Lin Chen; Fengli Xu; Yong Li;	To address these challenges, we build a data-driven epidemic simulator with COVID-19 specific features, which incorporates real-world mobility data capturing the heterogeneity in urban environments.