Paper Digest: KDD 2020 Highlights
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) is one of the top data mining conferences in the world. In 2020, it is to be held virtually due to covid-19 pandemic.
To help the community quickly catch up on the work presented in this conference, Paper Digest Team processed all accepted papers, and generated one highlight sentence (typically the main topic) for each paper. Readers are encouraged to read these machine generated highlights / summaries to quickly get the main idea of each paper.
If you do not want to miss any interesting academic paper, you are welcome to sign up our free daily paper digest service to get updates on new papers published in your area every day. You are also welcome to follow us on Twitter and Linkedin to get updated with new conference digests.
Paper Digest Team
team@paperdigest.org
TABLE 1: Paper Digest: KDD 2020 Highlights
Title | Authors | Highlight | |
---|---|---|---|
1 | Learning Effective Road Network Representation with Hierarchical Graph Neural Networks | Ning Wu; Xin Wayne Zhao; Jingyuan Wang; Dayan Pan; | In this paper, we propose a novel Hierarchical Road Network Representation model, named HRNR, by constructing a three-level neural architecture, corresponding to "functional zone", "structural regions" and "road segments", respectively. |
2 | Interpretability is a Kind of Safety: An Interpreter-based Ensemble for Adversary Defense | Jingyuan Wang; Yufan Wu; Mingxuan Li; Xin Lin; Junjie Wu; Chao Li; | In light of this, in this paper, we first reveal a gradient-based correlation between sensitivity analysis-based DNN interpreters and the generation process of adversarial examples, which indicates the Achilles’s heel of adversarial attacks and sheds light on linking together the two long-standing challenges of DNN: fragility and unexplainability. We then propose an interpreter-based ensemble framework called X-Ensemble for robust adversary defense. X-Ensemble adopts a novel detection-rectification process and features in building multiple sub-detectors and a rectifier upon various types of interpretation information toward target classifiers. |
3 | Higher-order Clustering in Complex Heterogeneous Networks | Aldo G. Carranza; Ryan A. Rossi; Anup Rao; Eunyee Koh; | In this work, we propose a framework for higher-order spectral clustering in heterogeneous networks through the notions of typed graphlets and typed-graphlet conductance. |
4 | Preserving Dynamic Attention for Long-Term Spatial-Temporal Prediction | Haoxing Lin; Rufan Bai; Weijia Jia; Xinyu Yang; Yongjian You; | To address these issues, we propose a Dynamic Switch-Attention Network (DSAN) with a novel Multi-Space Attention (MSA) mechanism that measures the correlations between inputs and outputs explicitly. |
5 | Learning to Extract Attribute Value from Product via Question Answering: A Multi-task Approach | Qifan Wang; Li Yang; Bhargav Kanagal; Sumit Sanghai; D. Sivakumar; Bin Shu; Zac Yu; Jon Elsas; | In this work, we propose a novel approach for Attribute Value Extraction via Question Answering (AVEQA) using a multi-task framework. |
6 | Kernel Assisted Learning for Personalized Dose Finding | Liangyu Zhu; Wenbin Lu; Michael R. Kosorok; Rui Song; | In this article, we propose a kernel assisted learning method for estimating the optimal individualized dose rule. |
7 | Graph Structure Learning for Robust Graph Neural Networks | Wei Jin; Yao Ma; Xiaorui Liu; Xianfeng Tang; Suhang Wang; Jiliang Tang; | Therefore, in this paper, we explore these properties to defend adversarial attacks on graphs. |
8 | An Efficient Neighborhood-based Interaction Model for Recommendation on Heterogeneous Graph | Jiarui Jin; Jiarui Qin; Yuchen Fang; Kounianhua Du; Weinan Zhang; Yong Yu; Zheng Zhang; Alexander J. Smola; | In this paper, we propose an end-to-end Neighborhood-based Interaction Model for Recommendation (NIRec) to address above problems. |
9 | Directional Multivariate Ranking | Nan Wang; Hongning Wang; | In this work, we propose a directional multi-aspect ranking criterion to enable a holistic ranking of items with respect to multiple aspects. |
10 | Truth Discovery against Strategic Sybil Attack in Crowdsourcing | Yue Wang; Ke Wang; Chunyan Miao; | In this paper, we propose a novel approach, called TDSSA (Truth Discovery against Strategic Sybil Attack), to defend against strategic Sybil attack. |
11 | Partial Multi-Label Learning via Probabilistic Graph Matching Mechanism | Gengyu Lyu; Songhe Feng; Yidong Li; | In this paper, we interpret such assignments as instance-to-label matchings, and formulate the task of PML as a matching selection problem. |
12 | Spectrum-Guided Adversarial Disparity Learning | Zhe Liu; Lina Yao; Lei Bai; Xianzhi Wang; Can Wang; | In this work, we propose a novel end-to-end knowledge directed adversarial learning framework, which portrays the class-conditioned intraclass disparity using two competitive encoding distributions and learns the purified latent codes by denoising learned disparity. |
13 | Attention and Memory-Augmented Networks for Dual-View Sequential Learning | Yong He; Cheng Wang; Nan Li; Zhenyu Zeng; | We develop an AMANet (Attention and Memory-Augmented Networks) architecture by integrating both attention and memory to solve asynchronous multi-view learning problem in general, and we focus on experiments in dual-view sequences in this paper. |
14 | Semantic Search in Millions of Equations | Lukas Pfahler; Katharina Morik; | Hence, we propose a new approach for retrieval of mathematical expressions based on machine learning. To train our models, we collect a huge dataset with over 29 million mathematical expressions from over 900,000 publications published on arXiv.org. |
15 | SSumM: Sparse Summarization of Massive Graphs | Kyuhan Lee; Hyeonsoo Jo; Jihoon Ko; Sungsu Lim; Kijung Shin; | In this work, we propose SSumM, a scalable and effective graph-summarization algorithm that yields a sparse summary graph. |
16 | Rethinking Pruning for Accelerating Deep Inference At the Edge | Dawei Gao; Xiaoxi He; Zimu Zhou; Yongxin Tong; Ke Xu; Lothar Thiele; | To rectify such drawbacks, we propose entropy-based pruning, a new regularizer that can be seamlessly integrated into existing network pruning algorithms. |
17 | Compositional Embeddings Using Complementary Partitions for Memory-Efficient Recommendation Systems | Hao-Jun Michael Shi; Dheevatsa Mudigere; Maxim Naumov; Jiyan Yang; | We propose a novel approach for reducing the embedding size in an end-to-end fashion by exploiting complementary partitions of the category set to produce a unique embedding vector for each category without explicit definition. |
18 | Structural Patterns and Generative Models of Real-world Hypergraphs | Manh Tuan Do; Se-eun Yoon; Bryan Hooi; Kijung Shin; | In this work, we empirically study a number of real-world hypergraph datasets across various domains. |
19 | Efficient Algorithm for the b-Matching Graph | Yasuhiro Fujiwara; Atsutoshi Kumagai; Sekitoshi Kanai; Yasutoshi Ida; Naonori Ueda; | Our proposal, b-dash, can efficiently construct a b-matching graph because of its two key techniques: (1) it prunes unnecessary update messages in determining edges and (2) it incrementally computes edge weights by exploiting the Sherman-Morrison formula. |
20 | Isolation Distributional Kernel: A New Tool for Kernel based Anomaly Detection | Kai Ming Ting; Bi-Cun Xu; Takashi Washio; Zhi-Hua Zhou; | We introduce Isolation Distributional Kernel as a new way to measure the similarity between two distributions. |
21 | NodeAug: Semi-Supervised Node Classification with Data Augmentation | Yiwei Wang; Wei Wang; Yuxuan Liang; Yujun Cai; Juncheng Liu; Bryan Hooi; | By using Data Augmentation (DA), we present a new method to enhance Graph Convolutional Networks (GCNs), that are the state-of-the-art models for semi-supervised node classification. |
22 | An Embarrassingly Simple Approach for Trojan Attack in Deep Neural Networks | Ruixiang Tang; Mengnan Du; Ninghao Liu; Fan Yang; Xia Hu; | In this paper, we investigate a specific security problem called trojan attack, which aims to attack deployed DNN systems relying on the hidden trigger patterns inserted by malicious hackers. |
23 | Kronecker Attention Networks | Hongyang Gao; Zhengyang Wang; Shuiwang Ji; | In this work, we propose to avoid flattening by assuming the data follow matrix-variate normal distributions. |
24 | GRACE: Generating Concise and Informative Contrastive Sample to Explain Neural Network Model’s Prediction | Thai Le; Suhang Wang; Dongwon Lee; | To mitigate this limitation, therefore, we borrow two notable ideas (i.e., "explanation by intervention" from causality and "explanation are contrastive" from philosophy) and propose a novel solution, named as GRACE, that better explains neural network models’ predictions for tabular datasets. |
25 | Hierarchical Attention Propagation for Healthcare Representation Learning | Muhan Zhang; Christopher R. King; Michael Avidan; Yixin Chen; | In this paper, we propose Hierarchical Attention Propagation (HAP), a novel medical ontology embedding model that hierarchically propagate attention across the entire ontology structure, where a medical concept adaptively learns its embedding from all other concepts in the hierarchy instead of only its ancestors. |
26 | SCE: Scalable Network Embedding from Sparsest Cut | Shengzhong Zhang; Zengfeng Huang; Haicang Zhou; Ziang Zhou; | In this paper, we propose SCE for unsupervised network embedding only using negative samples for training. |
27 | Local Community Detection in Multiple Networks | Dongsheng Luo; Yuchen Bian; Yaowei Yan; Xiao Liu; Jun Huan; Xiang Zhang; | In this paper, we propose a novel RWM (Random Walk in Multiple networks) model to find relevant local communities in all networks for a given query node set from one network. |
28 | A Block Decomposition Algorithm for Sparse Optimization | Ganzhao Yuan; Li Shen; Wei-Shi Zheng; | This paper considers a new block decomposition algorithm that combines the effectiveness of combinatorial search methods and the efficiency of coordinate descent methods. |
29 | Adversarial Infidelity Learning for Model Interpretation | Jian Liang; Bing Bai; Yuren Cao; Kun Bai; Fei Wang; | In this paper, we propose a Model-agnostic Effective Efficient Direct (MEED) IFS framework for model interpretation, mitigating concerns about sanity, combinatorial shortcuts, model identifiability, and information transmission. |
30 | Grounding Visual Concepts for Zero-Shot Event Detection and Event Captioning | Zhihui Li; Xiaojun Chang; Lina Yao; Shirui Pan; Ge Zongyuan; Huaxiang Zhang; | Accordingly, in this paper, we propose a method of grounding visual concepts for large-scale Multimedia Event Detection (MED) and Multimedia Event Captioning (MEC) in zero-shot setting. |
31 | How to Count Triangles, without Seeing the Whole Graph | Suman K. Bera; C. Seshadhri; | Despite these challenges, we design a provable and practical algorithm, TETRIS, for triangle counting in this model. |
32 | Incremental Lossless Graph Summarization | Jihoon Ko; Yunbum Kook; Kijung Shin; | In this work, we propose MoSSo, the first incremental algorithm for lossless summarization of fully dynamic graphs. |
33 | From Online to Non-i.i.d. Batch Learning | Yufei Tao; Shangqi Lu; | We present a set of techniques to utilize an online algorithm as a black box to perform batch learning in the absence of the i.i.d. assumption. |
34 | Towards Deeper Graph Neural Networks | Meng Liu; Hongyang Gao; Shuiwang Ji; | In this work, we study this observation systematically and develop new insights towards deeper graph neural networks. |
35 | Laplacian Change Point Detection for Dynamic Graphs | Shenyang Huang; Yasmeen Hitti; Guillaume Rabusseau; Reihaneh Rabbany; | In this paper, we focus on change point detection in dynamic graphs and address two main challenges associated with this problem: I) how to compare graph snapshots across time, II) how to capture temporal dependencies. |
36 | Learning Transferrable Parameters for Long-tailed Sequential User Behavior Modeling | Jianwen Yin; Chenghao Liu; Weiqing Wang; Jianling Sun; Steven C.H. Hoi; | In this work, we argue that focusing on tail users could bring more benefits and address the long tails issue by learning transferrable parameters from both optimization and feature perspectives. |
37 | TranSlider: Transfer Ensemble Learning from Exploitation to Exploration | Kuo Zhong; Ying Wei; Chun Yuan; Haoli Bai; Junzhou Huang; | In this paper, we introduce the concept of transfer ensemble learning, a new direction to tackle the over-fitting of transfer strategies. |
38 | InFoRM: Individual Fairness on Graph Mining | Jian Kang; Jingrui He; Ross Maciejewski; Hanghang Tong; | This paper presents the first principled study of Individual Fairness on gRaph Mining (InFoRM). |
39 | Local Motif Clustering on Time-Evolving Graphs | Dongqi Fu; Dawei Zhou; Jingrui He; | To bridge this gap, in this paper, we propose a novel framework, Local Motif Clustering on Time-Evolving Graphs (L-MEGA), which provides the evolution pattern of the local motif cluster in an effective and efficient way. |
40 | A Data-Driven Graph Generative Model for Temporal Interaction Networks | Dawei Zhou; Lecheng Zheng; Jiawei Han; Jingrui He; | To address these challenges, we propose an end-to-end deep generative framework named TagGen. |
41 | Recurrent Networks for Guided Multi-Attention Classification | Xin Dai; Xiangnan Kong; Tian Guo; John Boaz Lee; Xinyue Liu; Constance Moore; | In this paper, we study the problem of guided multi-attention classification, the goal of which is to achieve high accuracy under the dual constraints of (1) small sample size, and (2) multiple ROIs for each image. |
42 | Vulnerability vs. Reliability: Disentangled Adversarial Examples for Cross-Modal Learning | Chao Li; Haoteng Tang; Cheng Deng; Liang Zhan; Wei Liu; | In this paper, we propose novel Disentangled Adversarial examples for Cross-Modal learning, dubbed DACM. |
43 | XGNN: Towards Model-Level Explanations of Graph Neural Networks | Hao Yuan; Jiliang Tang; Xia Hu; Shuiwang Ji; | In this work, we propose a novel approach, known as XGNN, to interpret GNNs at the model-level. |
44 | CAST: A Correlation-based Adaptive Spectral Clustering Algorithm on Multi-scale Data | Xiang Li; Ben Kao; Caihua Shan; Dawei Yin; Martin Ester; | We propose the algorithm CAST that applies trace Lasso to regularize the coefficient matrix. |
45 | INPREM: An Interpretable and Trustworthy Predictive Model for Healthcare | Xianli Zhang; Buyue Qian; Shilei Cao; Yang Li; Hang Chen; Yefeng Zheng; Ian Davidson; | To address this, in this paper, we propose an interpretable and trustworthy predictive model~(INPREM) for healthcare. |
46 | Policy-GNN: Aggregation Optimization for Graph Neural Networks | Kwei-Herng Lai; Daochen Zha; Kaixiong Zhou; Xia Hu; | To address the above challenges, we propose Policy-GNN, a meta-policy framework that models the sampling procedure and message passing of GNNs into a combined learning process. |
47 | Malicious Attacks against Deep Reinforcement Learning Interpretations | Mengdi Huai; Jianhui Sun; Renqin Cai; Liuyi Yao; Aidong Zhang; | Specifically, we introduce the first study of the adversarial attacks against DRL interpretations, and propose an optimization framework based on which the optimal adversarial attack strategy can be derived. |
48 | Disentangled Self-Supervision in Sequential Recommenders | Jianxin Ma; Chang Zhou; Hongxia Yang; Peng Cui; Xin Wang; Wenwu Zhu; | In this paper, we study the problem of mining extra signals for supervision by looking at the longer-term future. |
49 | DETERRENT: Knowledge Guided Graph Attention Network for Detecting Healthcare Misinformation | Limeng Cui; Haeseung Seo; Maryam Tabar; Fenglong Ma; Suhang Wang; Dongwon Lee; | In this work, to address these shortcomings, we propose a novel knowledge guided graph attention network for detecting health misinformation better. |
50 | MultiImport: Inferring Node Importance in a Knowledge Graph from Multiple Input Signals | Namyong Park; Andrey Kan; Xin Luna Dong; Tong Zhao; Christos Faloutsos; | In this paper, we develop an end-to-end model MultiImport, which infers latent node importance from multiple, potentially overlapping, input signals. |
51 | Geodesic Forests | Meghana Madhyastha; Gongkai Li; Veronika Strnadová-Neeley; James Browne; Joshua T. Vogelstein; Randal Burns; Carey E. Priebe; | We propose an unsupervised random forest approach called geodesic forests (GF) to geodesic distance estimation in linear and nonlinear manifolds with noise. |
52 | Z-Miner: An Efficient Method for Mining Frequent Arrangements of Event Intervals | Zed Lee; Tony Lindgren; Panagiotis Papapetrou; | In this paper, we propose Z-Miner, a novel algorithm for solving this problem that addresses the deficiencies of existing competitors by employing two novel data structures: Z-Table, a hierarchical hash-based data structure for time-efficient candidate generation and support count, and Z-Arrangement, a data structure for efficient memory consumption. |
53 | Imputing Various Incomplete Attributes via Distance Likelihood Maximization | Shaoxu Song; Yu Sun; | In this paper, we propose to study the distance models that predict distances between tuples for missing data imputation. |
54 | WeightGrad: Geo-Distributed Data Analysis Using Quantization for Faster Convergence and Better Accuracy | Syeda Nahida Akter; Muhammad Abdullah Adnan; | Our goal in this work is to design a geo-distributed Deep-Learning system that (1) ensures efficient and faster communication over LAN and WAN and (2) maintain accuracy and convergence for complex DNNs with billions of parameters. |
55 | Feature-Induced Manifold Disambiguation for Multi-View Partial Multi-label Learning | Jing-Han Wu; Xuan Wu; Qing-Guo Chen; Yao Hu; Min-Ling Zhang; | Accordingly, the problem of multi-view partial multi-label learning (MVPML) is studied in this paper, where each example is assumed to be presented by multiple feature vectors while associated with multiple candidate labels which are only partially valid. |
56 | MinSearch: An Efficient Algorithm for Similarity Search under Edit Distance | Haoyu Zhang; Qin Zhang; | In this paper we propose a novel algorithm for edit similarity search named MinSearch. |
57 | Mining Large Quasi-cliques with Quality Guarantees from Vertex Neighborhoods | Aritra Konar; Nicholas D. Sidiropoulos; | In this work, we formally establish that two recurring characteristics of real-world graphs, namely heavy-tailed degree distributions and large clustering coefficients, imply the existence of substantially large vertex neighborhoods with high edge-density. |
58 | Residual Correlation in Graph Neural Network Regression | Junteng Jia; Austion R. Benson; | Here, we address this problem with an interpretable and efficient framework that can improve any graph neural network architecture simply by exploiting correlation structure in the regression residuals. |
59 | Towards Fair Truth Discovery from Biased Crowdsourced Answers | Yanying Li; Haipei Sun; Wendy Hui Wang; | To address this challenge, in this paper, first, we define a new fairness notion named θ-disparity for truth discovery. Intuitively, ?-disparity bounds the difference in the probabilities that the truth of both protected and unprotected groups being predicted to be positive. Second, we design three fairness enhancing methods, namely Pre-TD, FairTD, and Post-TD, for truth discovery. |
60 | AutoShuffleNet: Learning Permutation Matrices via an Exact Lipschitz Continuous Penalty in Deep Convolutional Neural Networks | Jiancheng Lyu; Shuai Zhang; Yingyong Qi; Jack Xin; | In this paper, we propose to automate channel shuffling by learning permutation matrices in network training. |
61 | MoFlow: An Invertible Flow Model for Generating Molecular Graphs | Chengxi Zang; Fei Wang; | In this paper, we propose MoFlow, a flow-based graph generative model to learn invertible mappings between molecular graphs and their latent representations. |
62 | Parallel DNN Inference Framework Leveraging a Compact RISC-V ISA-based Multi-core System | Yipeng Zhang; Bo Du; Lefei Zhang; Jia Wu; | Accordingly, this paper proposes a collaborative RISC-V multi-core system for Deep Neural Network (DNN) accelerators. |
63 | Missing Value Imputation for Mixed Data via Gaussian Copula | Yuxuan Zhao; Madeleine Udell; | This paper proposes a new semiparametric algorithm to impute missing values, with no tuning parameters. |
64 | HiTANet: Hierarchical Time-Aware Attention Networks for Risk Prediction on Electronic Health Records | Junyu Luo; Muchao Ye; Cao Xiao; Fenglong Ma; | To leverage time information for risk prediction in a more reasonable way, we propose a new hierarchical time-aware attention network, named HiTANet, which imitates the decision making process of doctors inrisk prediction. |
65 | Personalized PageRank to a Target Node, Revisited | Hanzhi Wang; Zhewei Wei; Junhao Gan; Sibo Wang; Zengfeng Huang; | In this paper, we consider the single-target PPR query, which measures the opposite direction of importance for PPR. |
66 | Edge-consensus Learning: Deep Learning on P2P Networks with Nonhomogeneous Data | Kenta Niwa; Noboru Harada; Guoqiang Zhang; W. Bastiaan Kleijn; | An effective Deep Neural Network (DNN) optimization algorithm that can use decentralized data sets over a peer-to-peer (P2P) network is proposed. |
67 | Deep Learning of High-Order Interactions for Protein Interface Prediction | Yi Liu; Hao Yuan; Lei Cai; Shuiwang Ji; | In this work, we propose to formulate the protein interface prediction as a 2D dense prediction problem. |
68 | MAMO: Memory-Augmented Meta-Optimization for Cold-start Recommendation | Manqing Dong; Feng Yuan; Lina Yao; Xiwei Xu; Liming Zhu; | In this paper, we design two memory matrices that can store task-specific memories and feature-specific memories. |
69 | Finding Effective Geo-social Group for Impromptu Activities with Diverse Demands | Lu Chen; Chengfei Liu; Rui Zhou; Jiajie Xu; Jeffrey Xu Yu; Jianxin Li; | In this paper, we propose a novel geo-social group model, equipped with elegant keyword constraints, to fill this gap. |
70 | Representing Temporal Attributes for Schema Matching | Yinan Mei; Shaoxu Song; Yunsu Lee; Jungho Park; Soo-Hyung Kim; Sungmin Yi; | In this paper, we argue to order the values in an attribute A by some time attribute T as a time series. |
71 | Estimating Properties of Social Networks via Random Walk considering Private Nodes | Kazuki Nakajima; Kazuyuki Shudo; | Here we design random walk-based algorithms to accurately estimate properties without any problems caused by private nodes. |
72 | ASGN: An Active Semi-supervised Graph Neural Network for Molecular Property Prediction | Zhongkai Hao; Chengqiang Lu; Zhenya Huang; Hao Wang; Zheyuan Hu; Qi Liu; Enhong Chen; Cheekong Lee; | Here we propose a novel framework called Active Semi-supervised Graph Neural Network (ASGN) by incorporating both labeled and unlabeled molecules. |
73 | Connecting the Dots: Multivariate Time Series Forecasting with Graph Neural Networks | Zonghan Wu; Shirui Pan; Guodong Long; Jing Jiang; Xiaojun Chang; Chengqi Zhang; | In this paper, we propose a general graph neural network framework designed specifically for multivariate time series data. |
74 | Learning Opinion Dynamics From Social Traces | Corrado Monti; Gianmarco De Francisci Morales; Francesco Bonchi; | In this work we propose an inference mechanism for fitting a generative, agent-like model of opinion dynamics to real-world social traces. |
75 | Enterprise Cooperation and Competition Analysis with a Sign-Oriented Preference Network | Le Dai; Yu Yin; Chuan Qin; Tong Xu; Xiangnan He; Enhong Chen; Hui Xiong; | To this end, in this paper, we provide a large-scale data driven analysis on the cooperative and competitive relationships among companies in a Sign-oriented Preference Network (SOPN). |
76 | BLOB: A Probabilistic Model for Recommendation that Combines Organic and Bandit Signals | Otmane Sakhi; Stephen Bonner; David Rohde; Flavian Vasile; | In this paper, we propose Bayesian Latent Organic Bandit model (BLOB), a probabilistic approach to combine the ‘organic’ and ‘bandit’ signals in order to improve the estimation of recommendation quality. |
77 | AutoST: Efficient Neural Architecture Search for Spatio-Temporal Prediction | Ting Li; Junbo Zhang; Kainan Bao; Yuxuan Liang; Yexin Li; Yu Zheng; | In this paper, we study Neural Architecture Search (NAS) for spatio-temporal prediction and propose an efficient spatio-temporal neural architecture search method, entitled AutoST. |
78 | COMPOSE: Cross-Modal Pseudo-Siamese Network for Patient Trial Matching | Junyi Gao; Cao Xiao; Lucas M. Glass; Jimeng Sun; | In this paper, we proposed CrOss-Modal PseudO-SiamEse network (COMPOSE) to address these challenges for patient-trial matching. |
79 | Discovering Succinct Pattern Sets Expressing Co-Occurrence and Mutual Exclusivity | Jonas Fischer; Jilles Vreeken; | As the search space for the optimal model is enormous and unstructured, we propose Mexican, a heuristic algorithm to efficiently discover high quality sets of patterns of co-occurences and mutual exclusivity. |
80 | TIPRDC: Task-Independent Privacy-Respecting Data Crowdsourcing Framework for Deep Learning with Anonymized Intermediate Representations | Ang Li; Yixiao Duan; Huanrui Yang; Yiran Chen; Jianlei Yang; | To tackle the case where the learning task may be unknown or changing, we present TIPRDC, a task-independent privacy-respecting data crowdsourcing framework with anonymized intermediate representation. |
81 | AutoGrow: Automatic Layer Growing in Deep Convolutional Networks | Wei Wen; Feng Yan; Yiran Chen; Hai Li; | We propose robust growing and stopping policies to generalize to different network architectures and datasets. |
82 | Curb-GAN: Conditional Urban Traffic Estimation through Spatio-Temporal Generative Adversarial Networks | Yingxue Zhang; Yanhua Li; Xun Zhou; Xiangnan Kong; Jun Luo; | To tackle these challenges, we propose a novel Conditional Urban Traffic Generative Adversarial Network (Curb-GAN), which provides traffic estimations in consecutive time slots based on different (unprecedented) travel demands, thus enables urban planners to accurately evaluate urban plans before deploying them. |
83 | Incremental Mobile User Profiling: Reinforcement Learning with Spatial Knowledge Graph for Modeling Event Streams | Pengyang Wang; Kunpeng Liu; Lu Jiang; Xiaolin Li; Yanjie Fu; | We propose to formulate the problem into a reinforcement learning task, where an agent is a next-visit planner, an action is a POI that a user will visit next, and the state of environment is a fused representation of a user and spatial entities (e.g., POIs, activity types, functional zones). |
84 | Identifying Sepsis Subphenotypes via Time-Aware Multi-Modal Auto-Encoder | Changchang Yin; Ruoqi Liu; Dongdong Zhang; Ping Zhang; | However, most sepsis subtyping studies ignore the temporality of EHR data and suffer from missing values. In this paper, we propose a new sepsis subtyping framework to address the two issues. |
85 | A Causal Look at Statistical Definitions of Discrimination | Elias Chaibub Neto; | Here, we investigate these fairness criteria from a causality perspective. |
86 | Targeted Data-driven Regularization for Out-of-Distribution Generalization | Mohammad Mahdi Kamani; Sadegh Farhang; Mehrdad Mahdavi; James Z. Wang; | In this paper, we propose a unified data-driven regularization approach to learn a generalizable model from biased data. |
87 | Neural Dynamics on Complex Networks | Chengxi Zang; Fei Wang; | To address these challenges, we propose to combine Ordinary Differential Equation Systems (ODEs) and Graph Neural Networks (GNNs) to learn continuous-time dynamics on complex networks in a data-driven manner. |
88 | Grammatically Recognizing Images with Tree Convolution | Guangrun Wang; Guangcong Wang; Keze Wang; Xiaodan Liang; Liang Lin; | Attempting to tackle this problem, this paper proposes a simple yet effective tree convolution (TreeConv) operation for deep neural networks. |
89 | Generic Outlier Detection in Multi-Armed Bandit | Yikun Ban; Jingrui He; | In this paper, we study the problem of outlier arm detection in multi-armed bandit settings, which finds plenty of applications in many high-impact domains such as finance, healthcare, and online advertising. |
90 | Robust Spammer Detection by Nash Reinforcement Learning | Yingtong Dou; Guixiang Ma; Philip S. Yu; Sihong Xie; | To address the challenges, we formulate a minimax game where the spammers and spam detectors compete with each other on their practical goals that are not solely based on detection accuracy. |
91 | Mining Persistent Activity in Continually Evolving Networks | Caleb Belth; Xinyi Zheng; Danai Koutra; | In this work, we propose the problem of mining activity that persists through time in continually evolving networks-i.e., activity that repeatedly and consistently occurs. |
92 | Towards Automated Neural Interaction Discovery for Click-Through Rate Prediction | Qingquan Song; Dehua Cheng; Hanning Zhou; Jiyan Yang; Yuandong Tian; Xia Hu; | To address these challenges, we propose an automated interaction architecture discovering framework for CTR prediction named AutoCTR. |
93 | High-Dimensional Similarity Search with Quantum-Assisted Variational Autoencoder | Nicholas Gao; Max Wilson; Thomas Vandal; Walter Vinci; Ramakrishna Nemani; Eleanor Rieffel; | We show how to construct a space-efficient search index based on the latent space representation of a QVAE. |
94 | Off-policy Bandits with Deficient Support | Noveen Sachdeva; Yi Su; Thorsten Joachims; | To overcome this gap between theory and applications, we identify three approaches that provide various guarantees for IPS-based learning despite the inherent limitations of support-deficient data: restricting the action space, reward extrapolation, and restricting the policy space. |
95 | Adaptive Graph Encoder for Attributed Graph Embedding | Ganqu Cui; Jie Zhou; Cheng Yang; Zhiyuan Liu; | To address these issues, we propose Adaptive Graph Encoder (AGE), a novel attributed graph embedding framework. |
96 | NetTrans: Neural Cross-Network Transformation | Si Zhang; Hanghang Tong; Yinglong Xia; Liang Xiong; Jiejun Xu; | In this paper, we address these limitations and tackle cross-network node associations from a new angle, i.e., cross-network transformation. |
97 | Redundancy-Free Computation for Graph Neural Networks | Zhihao Jia; Sina Lin; Rex Ying; Jiaxuan You; Jure Leskovec; Alex Aiken; | Here we propose Hierarchically Aggregated computation Graphs(HAGs), a new GNN representation technique that explicitly avoids redundancy by managing intermediate aggregation results hierarchically and eliminates repeated computations and unnecessary data transfers in GNN training and inference. |
98 | Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion | Kun Zhou; Wayne Xin Zhao; Shuqing Bian; Yuanhang Zhou; Ji-Rong Wen; Jingsong Yu; | To address these issues, we incorporate both word-oriented and entity-oriented knowledge graphs~(KG) to enhance the data representations in CRSs, and adopt Mutual Information Maximization to align the word-level and entity-level semantic spaces. |
99 | Sliding Sketches: A Framework using Time Zones for Data Stream Processing in Sliding Windows | Xiangyang Gou; Long He; Yinda Zhang; Ke Wang; Xilai Liu; Tong Yang; Yi Wang; Bin Cui; | In this paper, we propose a generic framework, namely Sliding sketches, which can be applied to many existing solutions for the above three queries, and enable them to support queries in sliding windows. |
100 | STEAM: Self-Supervised Taxonomy Expansion with Mini-Paths | Yue Yu; Yinghao Li; Jiaming Shen; Hao Feng; Jimeng Sun; Chao Zhang; | We propose a self-supervised taxonomy expansion model named STEAM, which leverages natural supervision in the existing taxonomy for expansion. |
101 | Probabilistic Metric Learning with Adaptive Margin for Top-K Recommendation | Chen Ma; Liheng Ma; Yingxue Zhang; Ruiming Tang; Xue Liu; Mark Coates; | To tackle this, we develop a distance-based recommendation model with several novel aspects: (i) each user and item are parameterized by Gaussian distributions to capture the learning uncertainties; (ii) an adaptive margin generation scheme is proposed to generate the margins regarding different training triplets; (iii) explicit user-user/item-item similarity modeling is incorporated in the objective function. |
102 | Re-identification Attack to Privacy-Preserving Data Analysis with Noisy Sample-Mean | Du Su; Hieu Tri Huynh; Ziao Chen; Yi Lu; Wenmiao Lu; | This paper studies the hazard of re-identification of entire class caused by revealing a noisy sample mean of the class. |
103 | BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision | Chen Liang; Yue Yu; Haoming Jiang; Siawpeng Er; Ruijia Wang; Tuo Zhao; Chao Zhang; | To address this challenge, we propose a new computational framework — BOND, which leverages the power of pre-trained language models (e.g., BERT and RoBERTa) to improve the prediction performance of NER models. |
104 | Graph Structural-topic Neural Network | Qingqing Long; Yilun Jin; Guojie Song; Yi Li; Wei Lin; | Correspondingly, in this paper, we propose Graph Structural topic Neural Network, abbreviated GraphSTONE 1, a GCN model that utilizes topic models of graphs, such that the structural topics capture indicative graph structures broadly from a probabilistic aspect rather than merely a few structures. |
105 | Correlation Networks for Extreme Multi-label Text Classification | Guangxu Xun; Kishlay Jha; Jianhui Sun; Aidong Zhang; | This paper develops the Correlation Networks (CorNet) architecture for the extreme multi-label text classification (XMTC) task, where the objective is to tag an input text sequence with the most relevant subset of labels from an extremely large label set. |
106 | Predicting Temporal Sets with Deep Neural Networks | Le Yu; Leilei Sun; Bowen Du; Chuanren Liu; Hui Xiong; Weifeng Lv; | In this paper, we propose an integrated solution based on the deep neural networks for temporal sets prediction. |
107 | FreeDOM: A Transferable Neural Architecture for Structured Information Extraction on Web Documents | Bill Yuchen Lin; Ying Sheng; Nguyen Vo; Sandeep Tata; | In this paper, we present a novel two-stage neural approach, named FreeDOM, which overcomes both these limitations. |
108 | SEAL: Learning Heuristics for Community Detection with Generative Adversarial Networks | Yao Zhang; Yun Xiong; Yun Ye; Tengfei Liu; Weiqiang Wang; Yangyong Zhu; Philip S. Yu; | In this paper, we instead study the semi-supervised community detection problem where we are given several communities in a network as training data and aim to discover more communities. |
109 | Matrix Profile XXI: A Geometric Approach to Time Series Chains Improves Robustness | Makoto Imamura; Takaaki Nakamura; Eamonn Keogh; | Inspired by observations from dynamical systems theory, this paper introduces two novel quality metrics for time series chains, directionality and graduality, to improve robustness and to enable top-K search. |
110 | Retrospective Loss: Looking Back to Improve Training of Deep Neural Networks | Surgan Jandial; Ayush Chopra; Mausoom Sarkar; Piyush Gupta; Balaji Krishnamurthy; Vineeth Balasubramanian; | In this work, we introduce a new retrospective loss to improve the training of deep neural network models by utilizing the prior experience available in past model states during training. |
111 | Average Sensitivity of Spectral Clustering | Pan Peng; Yuichi Yoshida; | To make reliable and efficient decisions based on spectral clustering, we assess the stability of spectral clustering against edge perturbations in the input graph using the notion of average sensitivity, which is the expected size of the symmetric difference of the output clusters before and after we randomly remove edges. |
112 | Semi-Supervised Multi-Label Learning from Crowds via Deep Sequential Generative Model | Wanli Shi; Victor S. Sheng; Xiang Li; Bin Gu; | In this paper, we propose a deep generative model to describe the label generation process for this semi-supervised multi-label learning problem. |
113 | GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training | Jiezhong Qiu; Qibin Chen; Yuxiao Dong; Jing Zhang; Hongxia Yang; Ming Ding; Kuansan Wang; Jie Tang; | We design GCC’s pre-training task as subgraph instance discrimination in and across networks and leverage contrastive learning to empower graph neural networks to learn the intrinsic and transferable structural representations. |
114 | HGCN: A Heterogeneous Graph Convolutional Network-Based Deep Learning Model Toward Collective Classification | Zhihua Zhu; Xinxin Fan; Xiaokai Chu; Jingping Bi; | To address the challenges, in this paper, we propose a novel heterogeneous graph convolutional network-based deep learning model, called HGCN, to collectively categorize the entities in HINs. |
115 | Handling Information Loss of Graph Neural Networks for Session-based Recommendation | Tianwen Chen; Raymond Chi-Wing Wong; | To solve the first problem, we propose a lossless encoding scheme and an edge-order preserving aggregation layer based on GRU that is dedicatedly designed to process the losslessly encoded graphs. |
116 | Ultrafast Local Outlier Detection from a Data Stream with Stationary Region Skipping | Susik Yoon; Jae-Gil Lee; Byung Suk Lee; | We propose a new algorithm, abbr. STARE, which identifies local regions in which data distributions hardly change and then skips updating the densities in those regions-a notion called stationary region skipping. |
117 | LayoutLM: Pre-training of Text and Layout for Document Image Understanding | Yiheng Xu; Minghao Li; Lei Cui; Shaohan Huang; Furu Wei; Ming Zhou; | In this paper, we propose the LayoutLM to jointly model interactions between text and layout information across scanned document images, which is beneficial for a great number of real-world document image understanding tasks such as information extraction from scanned documents. |
118 | Block Model Guided Unsupervised Feature Selection | Zilong Bai; Hoa Nguyen; Ian Davidson; | Here we take the novel approach of first building a block model on the graph and then using the block model for feature selection. |
119 | Data Compression as a Comprehensive Framework for Graph Drawing and Representation Learning | Claudia Plant; Sonja Biedermann; Christian Böhm; | Our fundamental idea is to compress the adjacency matrix by predicting the existence of an edge from the Euclidean distance between the corresponding vertices in the embedding, and to use the achieved compression as a quality measure for the embedding. |
120 | Joint Policy-Value Learning for Recommendation | Olivier Jeunen; David Rohde; Flavian Vasile; Martin Bompaire; | In this work, we conduct the first broad empirical study of counterfactual learning methods for recommendation, in a simulated environment. |
121 | FedFast: Going Beyond Average for Faster Training of Federated Recommender Systems | Khalil Muhammad; Qinqin Wang; Diarmuid O’Reilly-Morgan; Elias Tragos; Barry Smyth; Neil Hurley; James Geraci; Aonghus Lawlor; | We present a novel technique, FedFast, to accelerate distributed learning which achieves good accuracy for all users very early in the training process. |
122 | AM-GCN: Adaptive Multi-channel Graph Convolutional Networks | Xiao Wang; Meiqi Zhu; Deyu Bo; Peng Cui; Chuan Shi; Jian Pei; | We tackle the challenge and propose an adaptive multi-channel graph convolutional networks for semi-supervised classification (AM-GCN). |
123 | Discovering Approximate Functional Dependencies using Smoothed Mutual Information | Frédéric Pennerath; Panagiotis Mandros; Jilles Vreeken; | In this paper, we consider a different correction strategy and counter data sparsity using uniform priors and smoothing techniques, that leads to an efficient and robust estimating process. |
124 | Competitive Analysis for Points of Interest | Shuangli Li; Jingbo Zhou; Tong Xu; Hao Liu; Xinjiang Lu; Hui Xiong; | To this end, in this paper, we study how to predict the POI competitive relationship. |
125 | HOPS: Probabilistic Subtree Mining for Small and Large Graphs | Pascal Welke; Florian Seiffarth; Michael Kamp; Stefan Wrobel; | In this paper, we adapt sampling techniques from mathematical combinatorics to the problem of probabilistic subtree mining in arbitrary databases of many small to medium-size graphs or a single large graph. |
126 | The NodeHopper: Enabling Low Latency Ranking with Constraints via a Fast Dual Solver | Anton Zhernov; Krishnamurthy Dj Dvijotham; Ivan Lobov; Dan A. Calian; Michelle Gong; Natarajan Chandrashekar; Timothy A. Mann; | To address this challenge, we exploit the structure of the dual optimization problem to develop a fast solver. |
127 | HGMF: Heterogeneous Graph-based Fusion for Multimodal Data with Incompleteness | Jiayi Chen; Aidong Zhang; | We propose a Heterogeneous Graph-based Multimodal Fusion (HGMF) approach to enable multimodal fusion of incomplete data within a heterogeneous graph structure. |
128 | ST-SiameseNet: Spatio-Temporal Siamese Networks for Human Mobility Signature Identification | Huimin Ren; Menghai Pan; Yanhua Li; Xun Zhou; Jun Luo; | To deal with this challenge, in this work, we make the first attempt to match identities of human agents only from the observed location trajectory data by proposing a novel and efficient framework named Spatio-temporal Siamese Networks (ST-SiameseNet). |
129 | A Novel Deep Learning Model by Stacking Conditional Restricted Boltzmann Machine and Deep Neural Network | Tianyu Kang; Ping Chen; John Quackenbush; Wei Ding; | Similar to Convolution Neural Network dealing with spatially correlated features and Recurrent Neural Network with temporally correlated features, in this paper we present a novel deep learning model to tackle functionally interactive features by stacking a Conditional Restricted Boltzmann Machine and a Deep Neural Network (CRBM-DNN). |
130 | InfiniteWalk: Deep Network Embeddings as Laplacian Embeddings with a Nonlinearity | Sudhanshu Chanpuriya; Cameron Musco; | We study the objective in the limit as T goes to infinity, which allows us to simplify the expression of Qiu et al. |
131 | xGAIL: Explainable Generative Adversarial Imitation Learning for Explainable Human Decision Analysis | Menghai Pan; Weixiao Huang; Yanhua Li; Xun Zhou; Jun Luo; | This paper addresses this research gap by proposing xGAIL, the first explainable generative adversarial imitation learning framework. |
132 | Catalysis Clustering with GAN by Incorporating Domain Knowledge | Olga Andreeva; Wei Li; Wei Ding; Marieke Kuijjer; John Quackenbush; Ping Chen; | In this work we propose a GAN-based approach called Catalysis Clustering to incorporate domain knowledge into the clustering process. |
133 | Prediction and Profiling of Audience Competition for Online Television Series | Peng Zhang; Chuanren Liu; Kefeng Ning; Wenxiang Zhu; Yu Zhang; | In this paper, we develop a data-driven framework to model and predict audience competition patterns for popular online television series. |
134 | Multi-Class Data Description for Out-of-distribution Detection | Dongha Lee; Sehun Yu; Hwanjo Yu; | In this work, we present a deep multi-class data description, termed as Deep-MCDD, which is effective to detect out-of-distribution (OOD) samples as well as classify in-distribution (ID) samples. |
135 | In and Out: Optimizing Overall Interaction in Probabilistic Graphs under Clustering Constraints | Domenico Mandaglio; Andrea Tagarelli; Francesco Gullo; | We study two novel clustering problems in which the pairwise interactions between entities are characterized by probability distributions and conditioned by external factors within the environment where the entities interact. |
136 | Recurrent Halting Chain for Early Multi-label Classification | Thomas Hartvigsen; Cansu Sen; Xiangnan Kong; Elke Rundensteiner; | We design an effective solution to this open problem, the Recurrent Halting Chain (RHC), that for the first time integrates key innovations in both Early and Multi-label Classification into one multi-objective model. |
137 | Minimal Variance Sampling with Provable Guarantees for Fast Training of Graph Neural Networks | Weilin Cong; Rana Forsati; Mahmut Kandemir; Mehrdad Mahdavi; | In this paper, we theoretically analyze the variance of sampling methods and show that, due to the composite structure of empirical risk, the variance of any sampling method can be decomposed intoembedding approximation variance in the forward stage andstochastic gradient variance in the backward stage that necessities mitigating both types of variance to obtain faster convergence rate. |
138 | Discovering Functional Dependencies from Mixed-Type Data | Panagiotis Mandros; David Kaltenpoth; Mario Boley; Jilles Vreeken; | In this paper, we analyze these fundamental questions and derive formal criteria as to when a discretization process applied to a mixed set of random variables leads to consistent estimates of mutual information. |
139 | Attackability Characterization of Adversarial Evasion Attack on Discrete Data | Yutong Wang; Yufei Han; Hongyan Bao; Yun Shen; Fenglong Ma; Jin Li; Xiangliang Zhang; | Based on our attackability analysis, we propose a computationally efficient orthogonal matching pursuit-guided attack method for evasion attack on discrete data. |
140 | The Spectral Zoo of Networks: Embedding and Visualizing Networks with Spectral Moments | Shengmin Jin; Reza Zafarani; | We introduce a spectral embedding method for a network, its Spectral Point, which is basically the first few spectral moments of a network. |
141 | Unsupervised Differentiable Multi-aspect Network Embedding | Chanyoung Park; Carl Yang; Qi Zhu; Donghyun Kim; Hwanjo Yu; Jiawei Han; | In this paper, we propose a novel end-to-end framework for multi-aspect network embedding, called asp2vec, in which the aspects of each node are dynamically assigned based on its local context. |
142 | AutoML Pipeline Selection: Efficiently Navigating the Combinatorial Space | Chengrun Yang; Jicong Fan; Ziyang Wu; Madeleine Udell; | In this work, we design a new AutoML system TensorOboe to address this challenge: an automated system to design a supervised learning pipeline. |
143 | Towards Physics-informed Deep Learning for Turbulent Flow Prediction | Rui Wang; Karthik Kashinath; Mustafa Mustafa; Adrian Albert; Rose Yu; | In this paper, we aim to predict turbulent flow by learning its highly nonlinear dynamics from spatiotemporal velocity fields of large-scale fluid flow simulations of relevance to turbulence modeling and climate modeling. |
144 | Evaluating Fairness Using Permutation Tests | Cyrus DiCiccio; Sriram Vasudevan; Kinjal Basu; Krishnaram Kenthapadi; Deepak Agarwal; | We propose a permutation testing methodology that performs a hypothesis test that a model is fair across two groups with respect to any given metric. |
145 | Leveraging Model Inherent Variable Importance for Stable Online Feature Selection | Johannes Haug; Martin Pawelczyk; Klaus Broelemann; Gjergji Kasneci; | In this work, we introduce FIRES, a novel framework for online feature selection. By treating model parameters as random variables, we can penalize features with high uncertainty and thus generate more stable feature sets. |
146 | Multi-level Graph Convolutional Networks for Cross-platform Anchor Link Prediction | Hongxu Chen; Hongzhi YIN; Xiangguo Sun; Tong Chen; Bogdan Gabrys; Katarzyna Musial; | In this paper, to address this problem, we propose a novel framework that considers multi-level graph convolutions on both local network structure and hypergraph structure in a unified manner. |
147 | Evaluating Conversational Recommender Systems via User Simulation | Shuo Zhang; Krisztian Balog; | As an alternative, we propose automated evaluation by means of simulating users. |
148 | Measuring Model Complexity of Neural Networks with Curve Activation Functions | Xia Hu; Weiqing Liu; Jiang Bian; Jian Pei; | To tackle the challenge, in this paper, we first propose linear approximation neural network (LANN for short), a piecewise linear framework to approximate a given deep model with curve activation function. |
149 | Diverse Rule Sets | Guangyi Zhang; Aristides Gionis; | Here we propose a novel approach of inferring diverse rule sets, by optimizing small overlap among decision rules with a 2-approximation guarantee under the framework of Max-Sum diversification. |
150 | Vamsa: Automated Provenance Tracking in Data Science Scripts | Mohammad Hossein Namaki; Avrilia Floratou; Fotis Psallidas; Subru Krishnan; Ashvin Agrawal; Yinghui Wu; Yiwen Zhu; Markus Weimer; | In this work, we introduce the ML provenance tracking problem: the fundamental idea is to automatically track which columns in a dataset have been used to derive the features/labels of an ML model. |
151 | Deep State-Space Generative Model For Correlated Time-to-Event Predictions | Yuan Xue; Denny Zhou; Nan Du; Andrew M. Dai; Zhen Xu; Kun Zhang; Claire Cui; | In this work, we propose a deep latent state-space generative model to capture the interactions among different types of correlated clinical events (e.g., kidney failure, mortality) by explicitly modeling the temporal dynamics of patients’ latent states. |
152 | Meta-learning on Heterogeneous Information Networks for Cold-start Recommendation | Yuanfu Lu; Yuan Fang; Chuan Shi; | In MetaHIN, we propose a novel semantic-enhanced tasks constructor and a co-adaptation meta-learner to address the two questions. |
153 | WavingSketch: An Unbiased and Generic Sketch for Finding Top-k Items in Data Streams | Jizhou Li; Zikun Li; Yifei Xu; Shiqi Jiang; Tong Yang; Bin Cui; Yafei Dai; Gong Zhang; | In this paper, we propose a new sketch, WavingSketch, which is much more accurate than existing unbiased algorithms. |
154 | Dynamic Knowledge Graph based Multi-Event Forecasting | Songgaojun Deng; Huzefa Rangwala; Yue Ning; | In this paper, we study a temporal graph learning method with heterogeneous data fusion for predicting concurrent events of multiple types and inferring multiple candidate actors simultaneously. |
155 | A Geometric Approach to Predicting Bounds of Downstream Model Performance | Brian J. Goode; Debanjan Datta; | This paper presents the motivation and methodology for including model application criteria into baseline analysis. |
156 | Context-to-Session Matching: Utilizing Whole Session for Response Selection in Information-Seeking Dialogue Systems | Zhenxin Fu; Shaobo Cui; Mingyue Shang; Feng Ji; Dongyan Zhao; Haiqing Chen; Rui Yan; | In this paper, we consider the response and its context as a whole session and explore the task of matching the query’s context with the sessions. |
157 | HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units | Shenda Hong; Yanbo Xu; Alind Khare; Satria Priambada; Kevin Maher; Alaa Aljiffry; Jimeng Sun; Alexey Tumanov; | To address these challenges, we propose HOLMES—an online model ensemble serving framework for healthcare applications. |
158 | LogPar: Logistic PARAFAC2 Factorization for Temporal Binary Data with Missing Values | Kejing Yin; Ardavan Afshar; Joyce C. Ho; William K. Cheung; Chao Zhang; Jimeng Sun; | In this paper, we propose Logistic PARAFAC2 (LogPar) by modeling the binary irregular tensor with Bernoulli distribution parameterized by an underlying real-valued tensor. |
159 | RECORD: Resource Constrained Semi-Supervised Learning under Distribution Shift | Lan-Zhe Guo; Zhi Zhou; Yu-Feng Li; | This paper presents a systemic solution Record consisting of three sub-steps, that is, distribution tracking, sample selection and model updating. |
160 | Statistically Significant Pattern Mining with Ordinal Utility | Thien Q. Tran; Kazuto Fukuchi; Youhei Akimoto; Jun Sakuma; | Our study aims to introduce a preference relation into patterns and to discover the most preferred patterns under the constraint of statistical significance, which has never been considered in existing SSPM problems. |
161 | Certifiable Robustness of Graph Convolutional Networks under Structure Perturbations | Daniel Zügner; Stephan Günnemann; | In this work we close this gap and propose the first method to certify robustness of Graph Convolutional Networks (GCNs) under perturbations of the graph structure. |
162 | Understanding Negative Sampling in Graph Representation Learning | Zhen Yang; Ming Ding; Chang Zhou; Hongxia Yang; Jingren Zhou; Jie Tang; | To bridge the gap, we systematically analyze the role of negative sampling from the perspectives of both objective and risk, theoretically demonstrating that negative sampling is as important as positive sampling in determining the optimization objective and the resulted variance. |
163 | Aligning Superhuman AI with Human Behavior: Chess as a Model System | Reid McIlroy-Young; Siddhartha Sen; Jon Kleinberg; Ashton Anderson; | We develop and introduce Maia, a customized version of AlphaZero trained on human chess games, that predicts human moves at a much higher accuracy than existing engines, and can achieve maximum accuracy when predicting decisions made by players at a specific skill level in a tuneable way. |
164 | Heidegger: Interpretable Temporal Causal Discovery | Mehrdad Mansouri; Ali Arab; Zahra Zohrevand; Martin Ester; | Toward a new horizon, this study introduces the novel problem of Causal Profile Discovery, which is crucial for many applications such as adverse drug reaction and cyber-attack detection. |
165 | Interpretable Deep Graph Generation with Node-edge Co-disentanglement | Xiaojie Guo; Liang Zhao; Zhao Qin; Lingfei Wu; Amarda Shehu; Yanfang Ye; | To address these challenges, we propose a new disentanglement enhancement framework for deep generative models for attributed graphs. |
166 | Minimizing Localized Ratio Cut Objectives in Hypergraphs | Nate Veldt; Austin R. Benson; Jon Kleinberg; | Here we present a framework for local hypergraph clustering based on minimizing localized ratio cut objectives. |
167 | RECIPTOR: An Effective Pretrained Model for Recipe Representation Learning | Diya Li; Mohammed J. Zaki; | In this paper, we provide a joint approach for learning effective pretrained recipe embeddings using both the ingredients and cooking instructions. |
168 | Hyperbolic Distance Matrices | Puoya Tabaghi; Ivan Dokmanić; | In this paper, we propose a unified framework to compute hyperbolic embeddings from an arbitrary mix of noisy metric and non-metric data. |
169 | RayS: A Ray Searching Method for Hard-label Adversarial Attack | Jinghui Chen; Quanquan Gu; | In this paper, we present the Ray Searching attack (RayS), which greatly improves the hard-label attack effectiveness as well as efficiency. |
170 | On Sampled Metrics for Item Recommendation | Walid Krichene; Steffen Rendle; | We show that it is possible to improve the quality of the sampled metrics by applying a correction, obtained by minimizing different criteria such as bias or mean squared error. |
171 | ALO-NMF: Accelerated Locality-Optimized Non-negative Matrix Factorization | Gordon E. Moon; J. Austin Ellis; Aravind Sukumaran-Rajam; Srinivasan Parthasarathy; P. Sadayappan; | In this paper, we present a novel optimization method for parallel NMF algorithm based on the HALS (Hierarchical Alternating Least Squares) scheme that incorporates algorithmic transformations to enhance data locality. |
172 | Multi-Source Deep Domain Adaptation with Weak Supervision for Time-Series Sensor Data | Garrett Wilson; Janardhan Rao Doppa; Diane J. Cook; | However, robust techniques have not yet been considered for time series data with varying amounts of data availability. In this paper, we make three main contributions to fill this gap. |
173 | Counterfactual Evaluation of Slate Recommendations with Sequential Reward Interactions | James McInerney; Brian Brost; Praveen Chandar; Rishabh Mehrotra; Benjamin Carterette; | We propose a new counterfactual estimator that allows for sequential interactions in the rewards with lower variance in an asymptotically unbiased manner. |
174 | TAdaNet: Task-Adaptive Network for Graph-Enriched Meta-Learning | Qiuling Suo; Jingyuan Chou; Weida Zhong; Aidong Zhang; | In this paper, we propose a task-adaptive network (TAdaNet) that makes use of a domain-knowledge graph to enrich data representations and provide task-specific customization. |
175 | Unsupervised Paraphrasing via Deep Reinforcement Learning | A. B. Siddique; Samet Oymak; Vagelis Hristidis; | We propose Progressive Unsupervised Paraphrasing (PUP): a novel unsupervised paraphrase generation method based on deep reinforcement learning (DRL). |
176 | CICLAD: A Fast and Memory-efficient Closed Itemset Miner for Streams | Tomas Martin; Guy Francoeur; Petko Valtchev; | In a search for a better storage-efficiency trade-off, we designed Ciclad, an intersection-based sliding-window FCI miner. |
177 | Graph Attention Networks over Edge Content-Based Channels | Lu Lin; Hongning Wang; | In this paper, we propose a channel-aware attention mechanism enabled by edge text content when aggregating information from neighboring nodes; and we realize this mechanism in a graph autoencoder framework. |
178 | Multimodal Learning with Incomplete Modalities by Knowledge Distillation | Qi Wang; Liang Zhan; Paul Thompson; Jiayu Zhou; | In this paper, we proposed a framework based on knowledge distillation, utilizing the supplementary information from all modalities, and avoiding imputation and noise associated with it. |
179 | Estimating the Percolation Centrality of Large Networks through Pseudo-dimension Theory | Alane M. de Lima, Murilo V. G. da Silva, André L. Vignatti; | In this work we investigate the problem of estimating the percolation centrality of every vertex in a graph. |
180 | TinyGNN: Learning Efficient Graph Neural Networks | Bencheng Yan; Chaokun Wang; Gaoyang Guo; Yunkai Lou; | In this paper, we try to learn a small GNN (called TinyGNN), which can achieve high performance and infer the node representation in a short time. |
181 | GPT-GNN: Generative Pre-Training of Graph Neural Networks | Ziniu Hu; Yuxiao Dong; Kuansan Wang; Kai-Wei Chang; Yizhou Sun; | In this paper, we present the GPT-GNN framework to initialize GNNs by generative pre-training. |
182 | Parameterized Correlation Clustering in Hypergraphs and Bipartite Graphs | Nate Veldt; Anthony Wirth; David F. Gleich; | Motivated by applications in community detection and dense subgraph discovery, we consider new clustering objectives in hypergraphs and bipartite graphs. |
183 | Prioritized Restreaming Algorithms for Balanced Graph Partitioning | Amel Awadelkarim; Johan Ugander; | With the help of this modular perspective, we find that a key combination of design decisions leads to a novel family of algorithms with notably better empirical performance than any existing highly-scalable algorithm on a broad range of real-world graphs. |
184 | A Non-Iterative Quantile Change Detection Method in Mixture Model with Heavy-Tailed Components | Yuantong Li; Qi Ma; Sujit K. Ghosh; | In this paper, we propose a robust and quick approach based on change-point methods to determine the number of mixture components that works for almost any location-scale families even when the components are heavy tailed (e.g., Cauchy). |
185 | AdvMind: Inferring Adversary Intent of Black-Box Attacks | Ren Pang; Xinyang Zhang; Shouling Ji; Xiapu Luo; Ting Wang; | In this paper, we present AdvMind, a new class of estimation models that infer the adversary intent of black-box adversarial attacks in a robust and prompt manner. |
186 | Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding | Yu Meng; Yunyi Zhang; Jiaxin Huang; Yu Zhang; Chao Zhang; Jiawei Han; | To guide the hierarchical topic discovery process with minimal user supervision, we propose a new task, Hierarchical Topic Mining, which takes a category tree described by category names only, and aims to mine a set of representative terms for each category from a text corpus to help a user comprehend his/her interested topics. |
187 | Combinatorial Black-Box Optimization with Expert Advice | Hamid Dadkhahi; Karthikeyan Shanmugam; Jesus Rios; Payel Das; Samuel C. Hoffman; Troy David Loeffler; Subramanian Sankaranarayanan; | To address this problem, we propose a computationally efficient model learning algorithm based on multilinear polynomials and exponential weight updates. |
188 | CoRel: Seed-Guided Topical Taxonomy Construction by Concept Learning and Relation Transferring | Jiaxin Huang; Yiqing Xie; Yu Meng; Yunyi Zhang; Jiawei Han; | In this paper, we propose a method for seed-guided topical taxonomy construction, which takes a corpus and a seed taxonomy described by concept names as input, and constructs a more complete taxonomy based on user’s interest, wherein each node is represented by a cluster of coherent terms. |
189 | Treatment Policy Learning in Multiobjective Settings with Fully Observed Outcomes | Soorajnath Boominathan; Michael Oberst; Helen Zhou; Sanjat Kanjilal; David Sontag; | We present, compare, and evaluate three approaches for learning individualized treatment policies in this setting: First, we consider two indirect approaches, which use predictive models of treatment response to construct policies optimal for different trade-offs between objectives. Second, we consider a direct approach that constructs such a set of policies without intermediate models of outcomes. |
190 | List-wise Fairness Criterion for Point Processes | Jin Shang; Mingxuan Sun; Nina S.N. Lam; | In this paper, we propose a novel list-wise fairness criterion for point processes, which can efficiently evaluate the ranking fairness in event prediction. |
191 | Neural Subgraph Isomorphism Counting | Xin Liu; Haojie Pan; Mutian He; Yangqiu Song; Xin Jiang; Lifeng Shang; | In this paper, we study a new graph learning problem: learning to count subgraph isomorphisms. |
192 | Hypergraph Clustering Based on PageRank | Yuuki Takai; Atsushi Miyauchi; Masahiro Ikeda; Yuichi Yoshida; | In this study, we develop two clustering algorithms based on personalized PageRank on hypergraphs. |
193 | DeepSinger: Singing Voice Synthesis with Data Mined From the Web | Yi Ren; Xu Tan; Tao Qin; Jian Luan; Zhou Zhao; Tie-Yan Liu; | In this paper, we develop DeepSinger, a multi-lingual multi-singer singing voice synthesis (SVS) system, which is built from scratch using singing training data mined from music websites. |
194 | Scaling Choice Models of Relational Social Data | Jan Overgoor; George Pakapol Supaniratisai; Johan Ugander; | Given the importance of negative sampling, in this work we introduce a model simplification technique for mixed logit models that we call "de-mixing”, whereby standard mixture models of network formation—particularly models that mix local and global link formation—are reformulated to operate their modes over disjoint choice sets. |
195 | Deep Exogenous and Endogenous Influence Combination for Social Chatter Intensity Prediction | Subhabrata Dutta; Sarah Masud; Soumen Chakrabarti; Tanmoy Chakraborty; | To address the three limitations noted above, we propose a novel framework, ChatterNet, which, to our knowledge, is the first that can model and predict user engagement without considering the underlying user network. |
196 | Geography-Aware Sequential Location Recommendation | Defu Lian; Yongji Wu; Yong Ge; Xing Xie; Enhong Chen; | To this end, we propose a Geography-aware sequential recommender based on the Self-Attention Network (GeoSAN for short) for location recommendation. |
197 | Dual Channel Hypergraph Collaborative Filtering | Shuyi Ji; Yifan Feng; Rongrong Ji; Xibin Zhao; Wanwan Tang; Yue Gao; | Under such circumstances, we propose a dual channel hypergraph collaborative filtering (DHCF) framework to tackle the above issues. |
198 | A Framework for Recommending Accurate and Diverse Items Using Bayesian Graph Convolutional Neural Networks | Jianing Sun; Wei Guo; Dengcheng Zhang; Yingxue Zhang; Florence Regol; Yaochen Hu; Huifeng Guo; Ruiming Tang; Han Yuan; Xiuqiang He; Mark Coates; | To alleviate the above issue, in this work, we take a first step to introduce a principled way to model the uncertainty in the user-item interaction graph using the Bayesian Graph Convolutional Neural Network framework. |
199 | Learning Based Distributed Tracking | Hao WU; Junhao Gan; Rui Zhang; | In this paper, we revisit a fundamental problem called Distributed Tracking (DT) under an assumption that the data follows a certain (known or unknown) distribution, and propose a number Data-dependent algorithms with improved theoretical bounds. |
200 | Tight Sensitivity Bounds For Smaller Coresets | Alaa Maalouf; Adiel Statman; Dan Feldman; | We provide algorithms that compute provably tight bounds for the sensitivity of each input row. It is based on two ingredients: (i) iterative algorithm that computes the exact sensitivity of each row up to arbitrary small precision for (non-affine) k-subspaces, and (ii) a general reduction for computing a coreset for affine subspaces, given a coreset for (non-affine) subspaces in Rd. |
201 | GHashing: Semantic Graph Hashing for Approximate Similarity Search in Graph Databases | Zongyue Qin; Yunsheng Bai; Yizhou Sun; | Inspired by the recent success of deep-learning-based semantic hashing in image and document retrieval, we propose a novel graph neural network (GNN) based semantic hashing, i.e. GHashing, for approximate pruning. |
202 | Interactive Path Reasoning on Graph for Conversational Recommendation | Wenqiang Lei; Gangyi Zhang; Xiangnan He; Yisong Miao; Xiang Wang; Liang Chen; Tat-Seng Chua; | In this paper, we propose Conversational Path Reasoning (CPR), a generic framework that models conversational recommendation as an interactive path reasoning problem on a graph. |
203 | Algorithmic Aspects of Temporal Betweenness | Sebastian Buß; Hendrik Molter; Rolf Niedermeier; Maciej Rymar; | We provide a systematic study of temporal betweenness variants based on various concepts of optimal temporal paths both on a theoretical and empirical level. |
204 | Non-Linear Mining of Social Activities in Tensor Streams | Koki Kawabata; Yasuko Matsubara; Takato Honda; Yasushi Sakurai; | In this paper, we propose a streaming method, namely, CubeCast, that is designed to capture basic trends and seasonality in tensor streams and extract temporal and multi-dimensional relationships between such dynamics. |
205 | DeepLine: AutoML Tool for Pipelines Generation using Deep Reinforcement Learning and Hierarchical Actions Filtering | Yuval Heffetz; Roman Vainshtein; Gilad Katz; Lior Rokach; | In this study we present DeepLine, a reinforcement learning-based approach for automatic pipeline generation. |
206 | On Sampling Top-K Recommendation Evaluation | Dong Li; Ruoming Jin; Jing Gao; Zhi Liu; | In this work, we thoroughly investigate the relationship between the sampling and global top-K Hit-Ratio (HR, or Recall), originally proposed by Koren[2] and extensively used by others. |
207 | Algorithmic Decision Making with Conditional Fairness | Renzhe Xu; Peng Cui; Kun Kuang; Bo Li; Linjun Zhou; Zheyan Shen; Wei Cui; | We thus define conditional fairness as a more sound fairness metric by conditioning on the fairness variables. |
208 | Semi-supervised Collaborative Filtering by Text-enhanced Domain Adaptation | Wenhui Yu; Xiao Lin; Junfeng Ge; Wenwu Ou; Zheng Qin; | To solve these difficulties, we regard the problem of recommendation on sparse implicit feedbacks as a semi-supervised learning task, and explore domain adaption to solve it. |
209 | Rich Information is Affordable: A Systematic Performance Analysis of Second-order Optimization Using K-FAC | Yuichiro Ueno; Kazuki Osawa; Yohei Tsuji; Akira Naruse; Rio Yokota; | In this work, we conduct a step-by-step performance analysis when computing the Fisher information matrix during training of ResNet-50 on ImageNet, and show that the overhead can be reduced to the same amount as the cost of performing a single SGD step. |
210 | Voronoi Graph Traversal in High Dimensions with Applications to Topological Data Analysis and Piecewise Linear Interpolation | Vladislav Polianskii; Florian T. Pokorny; | We propose a randomized approximation approach that mitigates the prohibitive cost of exact computation of Voronoi diagrams in high dimensions for machine learning applications. |
211 | MCRapper: Monte-Carlo Rademacher Averages for Poset Families and Approximate Pattern Mining | Leonardo Pellegrina; Cyrus Cousins; Fabio Vandin; Matteo Riondato; | We present MCRapper, an algorithm for efficient computation of Monte-Carlo Empirical Rademacher Averages (MCERA) for families of functions exhibiting poset (e.g., lattice) structure, such as those that arise in many pattern mining tasks. |
212 | REA: Robust Cross-lingual Entity Alignment Between Knowledge Graphs | Shichao Pei; Lu Yu; Guoxian Yu; Xiangliang Zhang; | Our proposed method named REA (Robust Entity Alignment) consists of two components: noise detection and noise-aware entity alignment. |
213 | Stable Learning via Differentiated Variable Decorrelation | Zheyan Shen; Peng Cui; Jiashuo Liu; Tong Zhang; Bo Li; Zhitang Chen; | In this paper, we incorporate the unlabled data from multiple environments into the variable decorrelation framework and propose a Differentiated Variable Decorrelation (DVD) algorithm based on the clustering of variables. |
214 | Learning Stable Graphs from Multiple Environments with Selection Bias | Yue He; Peng Cui; Jianxin Ma; Hao Zou; Xiaowei Wang; Hongxia Yang; Philip S. Yu; | In this paper, we target the problem of learning stable graphs from multiple environments with selection bias. |
215 | Fast RobustSTL: Efficient and Robust Seasonal-Trend Decomposition for Time Series with Complex Patterns | Qingsong Wen; Zhe Zhang; Yan Li; Liang Sun; | In this paper, we extend RobustSTL to handle multiple seasonality. |
216 | CurvaNet: Geometric Deep Learning based on Directional Curvature for 3D Shape Analysis | Wenchong He; Zhe Jiang; Chengming Zhang; Arpan Man Sainju; | In contrast, this paper proposes a novel geometric deep learning model called CurvaNet that integrates differential geometry with graph neural networks. |
217 | Attentional Multi-graph Convolutional Network for Regional Economy Prediction with Open Migration Data | Fengli Xu; Yong Li; Shusheng Xu; | We study the problem of predicting regional economy of U.S. counties with open migration data collected from U.S. Internal Revenue Service (IRS) records. |
218 | Octet: Online Catalog Taxonomy Enrichment with Self-Supervision | Yuning Mao; Tong Zhao; Andrey Kan; Chenwei Zhang; Xin Luna Dong; Christos Faloutsos; Jiawei Han; | In this paper, we present a self-supervised end-to-end framework, Octet, for Online Catalog Taxonomy EnrichmenT. |
219 | TIMME: Twitter Ideology-detection via Multi-task Multi-relational Embedding | Zhiping Xiao; Weiping Song; Haoyan Xu; Zhicheng Ren; Yizhou Sun; | We aim at solving the problem of predicting people’s ideology, or political tendency. |
220 | Knowing your FATE: Friendship, Action and Temporal Explanations for User Engagement Prediction on Social Apps | Xianfeng Tang; Yozen Liu; Neil Shah; Xiaolin Shi; Prasenjit Mitra; Suhang Wang; | In this paper, we study a novel problem of explainable user engagement prediction for social network Apps. |
221 | Sub-Matrix Factorization for Real-Time Vote Prediction | Alexander Immer; Victor Kristof; Matthias Grossglauser; Patrick Thiran; | We address the problem of predicting aggregate vote outcomes (e.g., national) from partial outcomes (e.g., regional) that are revealed sequentially. |
222 | Temporal-Contextual Recommendation in Real-Time | Yifei Ma; Balakrishnan (Murali) Narayanaswamy; Haibin Lin; Hao Ding; | To fill this gap, we present a black-box recommender system that can adapt to a diverse set of scenarios without the need for manual tuning. |
223 | OptMatch: Optimized Matchmaking via Modeling the High-Order Interactions on the Arena | Linxia Gong; Xiaochuan Feng; Dezhi Ye; Hao Li; Runze Wu; Jianrong Tao; Changjie Fan; Peng Cui; | This paper proposes a two-stage data-driven matchmaking framework (namely OptMatch), which is applicable to most of gaming products and has the minimal product knowledge required. |
224 | PinnerSage: Multi-Modal User Embedding Framework for Recommendations at Pinterest | Aditya Pal; Chantat Eksombatchai; Yitong Zhou; Bo Zhao; Charles Rosenberg; Jure Leskovec; | In this work, we introduce PinnerSage, an end-to-end recommender system that represents each user via multi-modal embeddings and leverages this rich representation of users to provides high quality personalized recommendations. |
225 | Polestar: An Intelligent, Efficient and National-Wide Public Transportation Routing Engine | Hao Liu; Ying Li; Yanjie Fu; Huaibo Mei; Jingbo Zhou; Xu Ma; Hui Xiong; | To this end, in this paper, we present Polestar, a data-driven engine for intelligent and efficient public transportation routing. |
226 | Context-Aware Attentive Knowledge Tracing | Aritra Ghosh; Neil Heffernan; Andrew S. Lan; | In this paper, we propose attentive knowledge tracing (AKT), which couples flexible attention-based neural network models with a series of novel, interpretable model components inspired by cognitive and psychometric models. |
227 | Improving Movement Predictions of Traffic Actors in Bird’s-Eye View Models using GANs and Differentiable Trajectory Rasterization | Eason Wang; Henggang Cui; Sai Yalamanchi; Mohana Moorthy; Nemanja Djuric; | In this paper we build upon these two directions and propose a raster-based conditional GAN architecture, powered by a novel differentiable rasterizer module at the input of the conditional discriminator that maps generated trajectories into the raster space in a differentiable manner. |
228 | M2GRL: A Multi-task Multi-view Graph Representation Learning Framework for Web-scale Recommender Systems | Menghan Wang; Yujie Lin; Guli Lin; Keping Yang; Xiao-ming Wu; | In this paper, we use a multi-view representation alignment approach to address this issue. |
229 | Attribute-based Propensity for Unbiased Learning in Recommender Systems: Algorithm and Case Studies | Zhen Qin; Suming J. Chen; Donald Metzler; Yongwoo Noh; Jingzheng Qin; Xuanhui Wang; | In this paper, we generalize the traditional position bias model to an attribute-based propensity framework. |
230 | Predicting Individual Treatment Effects of Large-scale Team Competitions in a Ride-sharing Economy | Teng Ye; Wei Ai; Lingyu Zhang; Ning Luo; Lulu Zhang; Jieping Ye; Qiaozhu Mei; | In this study, we analyze data collected from more than 500 large-scale team competitions organized by a leading ride-sharing platform, building machine learning models to predict individual treatment effects. |
231 | Cellular Network Radio Propagation Modeling with Deep Convolutional Neural Networks | Xin Zhang; Xiujun Shu; Bingwen Zhang; Jie Ren; Lizhou Zhou; Xin Chen; | In this article we present a novel method to model radio propagation using deep convolutional neural networks and report significantly improved performance compared to conventional models. |
232 | Neural Input Search for Large Scale Recommendation Models | Manas R. Joglekar; Cong Li; Mei Chen; Taibai Xu; Xiaoming Wang; Jay K. Adams; Pranav Khaitan; Jiahui Liu; Quoc V. Le; | We present Neural Input Search (NIS), a technique for learning the optimal vocabulary sizes and embedding dimensions for categorical features. |
233 | Easy Perturbation EEG Algorithm for Spectral Importance (easyPEASI): A Simple Method to Identify Important Spectral Features of EEG in Deep Learning Models | David O. Nahmias; Kimberly L. Kontson; | This work proposes and validates a method to investigate frequency bands important to EEG-driven deep learning models. |
234 | Building Continuous Integration Services for Machine Learning | Bojan Karlaš; Matteo Interlandi; Cedric Renggli; Wentao Wu; Ce Zhang; Deepak Mukunthu Iyappan Babu; Jordan Edwards; Chris Lauren; Andy Xu; Markus Weimer; | We develop the first CI system for ML, to the best of our knowledge, that integrates seamlessly with existing ML development tools. |
235 | Learning to Cluster Documents into Workspaces Using Large Scale Activity Logs | Weize Kong; Michael Bendersky; Marc Najork; Brandon Vargo; Mike Colagrosso; | We go beyond the textual similarity-based unsupervised clustering paradigm and instead directly learn from users’ activity for document clustering. |
236 | What is that Building?: An End-to-end System for Building Recognition from Streetside Images | Chiqun Zhang; Dragomir Yankov; Chun-Ting Wu; Simon Shapiro; Jason Hong; Wei Wu; | The paper describes Streetside Building Search-Retrieve System (SBSRS) – a system for recognizing buildings from steetside images. To evaluate the system, we generate a dataset of over 23K unique business buildings from four major US cities. |
237 | MultiSage: Empowering GCN with Contextualized Multi-Embeddings on Web-Scale Multipartite Networks | Carl Yang; Aditya Pal; Andrew Zhai; Nikil Pancha; Jiawei Han; Charles Rosenberg; Jure Leskovec; | Here, we present a contextualized GCN engine by modeling the multipartite networks of target nodes and their intermediatecontext nodes that specify the contexts of their interactions. |
238 | HetETA: Heterogeneous Information Network Embedding for Estimating Time of Arrival | Huiting Hong; Yucheng Lin; Xiaoqing Yang; Zang Li; Kung Fu; Zheng Wang; Xiaohu Qie; Jieping Ye; | In this paper, we propose HetETA to leverage heterogeneous information graph in ETA task. |
239 | Hubble: An Industrial System for Audience Expansion in Mobile Marketing | Chenyi Zhuang; Ziqi Liu; Zhiqiang Zhang; Yize Tan; Zhengwei Wu; Zhining Liu; Jianping Wei; Jinjie Gu; Guannan Zhang; Jun Zhou; Yuan Qi; | Addressing the above challenges, in this paper, we present the Hubble System, an industrial solution for audience expansion in mobile marketing scenario. |
240 | Scaling Graph Neural Networks with Approximate PageRank | Aleksandar Bojchevski; Johannes Klicpera; Bryan Perozzi; Amol Kapoor; Martin Blais; Benedek Rózemberczki; Michal Lukasik; Stephan Günnemann; | We present the PPRGo model which utilizes an efficient approximation of information diffusion in GNNs resulting in significant speed gains while maintaining state-of-the-art prediction performance. |
241 | Combo-Attention Network for Baidu Video Advertising | Tan Yu; Yi Yang; Yi Li; Xiaodong Chen; Mingming Sun; Ping Li; | In this paper, we introduce a technique used in Baidu video advertising for feeding relevant video ads according to the user’s query. To testify the effectiveness of the proposed CAN offline, we built a Daily700K dataset collected from HaoKan APP. |
242 | Federated Doubly Stochastic Kernel Learning for Vertically Partitioned Data | Bin Gu; Zhiyuan Dang; Xiang Li; Heng Huang; | In this paper, we focus on nonlinear learning with kernels,and propose a federated doubly stochastic kernel learning (FDSKL) algorithm for vertically partitioned data. |
243 | To Tune or Not to Tune?: In Search of Optimal Configurations for Data Analytics | Ayat Fekry; Lucian Carata; Thomas Pasquier; Andrew Rice; Andy Hopper; | We adapt different ML techniques in order to obtain efficient incremental tuning in our problem domain, and propose Tuneful, a configuration tuning framework. |
244 | Reconstruction and Decomposition of High-Dimensional Landscapes via Unsupervised Learning | Jing Lei; Nasrin Akhter; Wanli Qiao; Amarda Shehu; | In this paper, we present a novel, hybrid method that combines strengths of these methods, allowing both visualization of the landscape and discovery of macrostates. |
245 | Map Generation from Large Scale Incomplete and Inaccurate Data Labels | Rui Zhang; Conrad Albrecht; Wei Zhang; Xiaodong Cui; Ulrich Finkler; David Kung; Siyuan Lu; | In this paper we present progress in developing an algorithmic pipeline and distributed compute system that automates the process of map creation using high resolution aerial images. |
246 | Grale: Designing Networks for Graph Learning | Jonathan Halcrow; Alexandru Mosoi; Sam Ruth; Bryan Perozzi; | In this work, we present Grale, a scalable method we have developed to address the problem of graph design for graphs with billions of nodes. |
247 | Automatic Validation of Textual Attribute Values in E-commerce Catalog by Learning with Limited Labeled Data | Yaqing Wang; Yifan Ethan Xu; Xian Li; Xin Luna Dong; Jing Gao; | In this paper, we propose to develop an automatic validation approach that verifies the correctness of textual attribute values for products. |
248 | CLARA: Confidence of Labels and Raters | Viet-An Nguyen; Peibei Shi; Jagdish Ramakrishnan; Udi Weinsberg; Henry C. Lin; Steve Metz; Neil Chandra; Jane Jing; Dimitris Kalimeris; | In this paper, we present CLARA (Confidence of Labels and Raters), a system developed and deployed at Facebook for aggregating reviewer decisions and estimating their uncertainty. |
249 | Embedding-based Retrieval in Facebook Search | Jui-Ting Huang; Ashish Sharma; Shuying Sun; Li Xia; David Zhang; Philip Pronin; Janani Padmanabhan; Giuseppe Ottaviano; Linjun Yang; | In this paper, we discuss the techniques for applying EBR to a Facebook Search system. |
250 | Lumos: A Library for Diagnosing Metric Regressions in Web-Scale Applications | Jamie Pool; Ebrahim Beyrami; Vishak Gopal; Ashkan Aazami; Jayant Gupchup; Jeff Rowland; Binlong Li; Pritesh Kanani; Ross Cutler; Johannes Gehrke; | In this work, we open sourceLumos and present our results from applying it to two different components within the RTC group over millions of sessions. |
251 | Order Fulfillment Cycle Time Estimation for On-Demand Food Delivery | Lin Zhu; Wei Yu; Kairong Zhou; Xing Wang; Wenxing Feng; Pengyu Wang; Ning Chen; Pei Lee; | In this paper, we present the OFCT prediction model that is currently deployed at Ele.me, which is one of the world’s largest OFD platforms and delivers over 10 million meals in more than 200 Chinese cities every day. |
252 | Calendar Graph Neural Networks for Modeling Time Structures in Spatiotemporal User Behaviors | Daheng Wang; Meng Jiang; Munira Syed; Oliver Conway; Vishal Juneja; Sriram Subramanian; Nitesh V. Chawla; | In this work, we propose a novel model based on graph neural networks for learning user representations from spatiotemporal behavior data. |
253 | Privileged Features Distillation at Taobao Recommendations | Chen Xu; Quan Li; Junfeng Ge; Jinyang Gao; Xiaoyong Yang; Changhua Pei; Fei Sun; Jian Wu; Hanxiao Sun; Wenwu Ou; | Inspired by the distillation techniques which bridge the gap between training and inference, in this work, we propose privileged features distillation (PFD). |
254 | Cracking Tabular Presentation Diversity for Automatic Cross-Checking over Numerical Facts | Hongwei Li; Qingping Yang; Yixuan Cao; Jiaquan Yao; Ping Luo; | This paper introduces the key module of such a system, which aims to identify whether a pair of table cells are semantically equivalent, namely referring to the same fact. |
255 | GrokNet: Unified Computer Vision Model Trunk and Embeddings For Commerce | Sean Bell; Yiqun Liu; Sami Alsheikh; Yina Tang; Edward Pizzi; M. Henning; Karun Singh; Omkar Parkhi; Fedor Borisyuk; | In this paper, we present GrokNet, a deployed image recognition system for commerce applications. |
256 | Learning Instrument Invariant Characteristics for Generating High-resolution Global Coral Reef Maps | Ata Akbari Asanjan; Kamalika Das; Alan Li; Ved Chirayath; Juan Torres-Perez; Soroosh Sorooshian; | In this work, we develop a deep learning model for extracting domain invariant features from multimodal remote sensing imagery and creating high-resolution global maps of coral reefs by combining various sources of imagery and limited hand-labeled data available for certain regions. |
257 | Causal Meta-Mediation Analysis: Inferring Dose-Response Function From Summary Statistics of Many Randomized Experiments | Zenan Wang; Xuan Yin; Tianbo Li; Liangjie Hong; | We model the online evaluation metric as a mediator and formalize its causality with the business KPI as dose-response function (DRF). |
258 | AutoFIS: Automatic Feature Interaction Selection in Factorization Models for Click-Through Rate Prediction | Bin Liu; Chenxu Zhu; Guilin Li; Weinan Zhang; Jincai Lai; Ruiming Tang; Xiuqiang He; Zhenguo Li; Yong Yu; | In this work, we propose a two-stage algorithm called Automatic Feature Interaction Selection (AutoFIS). |
259 | City Metro Network Expansion with Reinforcement Learning | Yu Wei; Minjia Mao; Xi Zhao; Jianhua Zou; Ping An; | To address these limitations, we propose a reinforcement learning based method for the city metro network expansion problem. |
260 | Game Action Modeling for Fine Grained Analyses of Player Behavior in Multi-player Card Games (Rummy as Case Study) | Sharanya Eswaran; Mridul Sachdeva; Vikram Vimal; Deepanshi Seth; Suhaas Kalpam; Sanjay Agarwal; Tridib Mukherjee; Samrat Dattagupta; | We present a deep learning framework for game action modeling, which enables fine-grained analyses of player behavior. |
261 | Cascade-LSTM: A Tree-Structured Neural Classifier for Detecting Misinformation Cascades | Francesco Ducci; Mathias Kraus; Stefan Feuerriegel; | As a remedy, we propose a novel tree-structured neural network named Cascade-LSTM. |
262 | Personalized Prefix Embedding for POI Auto-Completion in the Search Engine of Baidu Maps | Jizhou Huang; Haifeng Wang; Miao Fan; An Zhuo; Ying Li; | In this paper, we present an end-to-end neural-based framework for POI-AC, which has been recently deployed in the search engine of Baidu Maps, one of the largest Web mapping applications with hundreds of millions monthly active users worldwide. |
263 | Category-Specific CNN for Visual-aware CTR Prediction at JD.com | Hu Liu; Jing Lu; Hao Yang; Xiwei Zhao; Sulong Xu; Hao Peng; Zehua Zhang; Wenjie Niu; Xiaokun Zhu; Yongjun Bao; Weipeng Yan; | To overcome the two challenges, we propose Category-specific CNN (CSCNN) specially for CTR prediction. |
264 | ConSTGAT: Contextual Spatial-Temporal Graph Attention Network for Travel Time Estimation at Baidu Maps | Xiaomin Fang; Jizhou Huang; Fan Wang; Lingke Zeng; Haijin Liang; Haifeng Wang; | In this paper, we propose an end-to-end neural framework named ConSTGAT, which integrates traffic prediction and contextual information to address these two problems. |
265 | Faster Secure Data Mining via Distributed Homomorphic Encryption | Junyi Li; Heng Huang; | In this paper, we propose a novel general distributed HE-based data mining framework towards one step of solving the scaling problem. |
266 | Contagious Chain Risk Rating for Networked-guarantee Loans | Dawei Cheng; Zhibin Niu; Yiyi Zhang; | To this end, we propose a novel approach to rate the risk of contagion chains in the bank industry with the deep neural network. |
267 | AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types | Xin Luna Dong; Xiang He; Andrey Kan; Xian Li; Yan Liang; Jun Ma; Yifan Ethan Xu; Chenwei Zhang; Tong Zhao; Gabriel Blanco Saldana; Saurabh Deshpande; Alexandre Michetti Manduca; Jay Ren; Surender Pal Singh; Fan Xiao; Haw-Shiuan Chang; Giannis Karamanolakis; Yuning Mao; Yaqing Wang; Christos Faloutsos; Andrew McCallum; Jiawei Han; | We describe AutoKnow, our automatic (self-driving) system that addresses these challenges. |
268 | Personalized Image Retrieval with Sparse Graph Representation Learning | Xiaowei Jia; Handong Zhao; Zhe Lin; Ajinkya Kale; Vipin Kumar; | In this paper, we develop a novel method CA-GCN for personalized image retrieval in the Adobe Stock image system. |
269 | Comprehensive Information Integration Modeling Framework for Video Titling | Shengyu Zhang; Ziqi Tan; Zhou Zhao; Jin Yu; Kun Kuang; Tan Jiang; Jingren Zhou; Hongxia Yang; Fei Wu; | To bridge this gap, we integrate comprehensive sources of information, including the content of consumer-generated videos, the narrative comment sentences supplied by consumers, and the product attributes, in an end-to-end modeling framework. |
270 | Acoustic Measures for Real-Time Voice Coaching | Ying Li; Abraham Miller; Arthur Liu; Kyle Coburn; Luis J. Salazar; | This paper presents methodologies for computing a set of physical properties from sound waves of a speaker’s voice directly, referred to as acoustic measures. |
271 | Geodemographic Influence Maximization | Kaichen Zhang; Jingbo Zhou; Donglai Tao; Panagiotis Karras; Qing Li; Hui Xiong; | In this paper, we address the natural problem that arises such data: given a distribution of population and point-to-point movement statistics over a network, find a set of locations within a budget that achieves maximum expected reach. |
272 | A Self-Evolving Mutually-Operative Recurrent Network-based Model for Online Tool Condition Monitoring in Delay Scenario | Monidipa Das; Mahardhika Pratama; Tegoeh Tjahjowidodo; | In order to tackle these issues, we propose SERMON as a novel learning model based on a pair of self-evolving mutually-operative recurrent neural networks. |
273 | Maximizing Cumulative User Engagement in Sequential Recommendation: An Online Optimization Perspective | Yifei Zhao; Yu-Hang Zhou; Mingdong Ou; Huan Xu; Nan Li; | In this paper, we study this problem from an online optimization perspective, and propose a flexible and practical framework to explicitly tradeoff longer user browsing length and high immediate user engagement. |
274 | Domain Specific Knowledge Graphs as a Service to the Public: Powering Social-Impact Funding in the US | Ying Li; Vitalii Zakhozhyi; Daniel Zhu; Luis J. Salazar; | This paper explores the practical applications of Domain Specific Knowledge Graphs that allow for the extraction of information from trusted published and unpublished sources, to map the extracted information to an ontology defined in collaboration with sector experts, and to enable the public to go from single queries into ongoing conversations meeting their knowledge needs reliably. |
275 | LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition | Jin Xu; Xu Tan; Yi Ren; Tao Qin; Jian Li; Sheng Zhao; Tie-Yan Liu; | In this paper, we develop LRSpeech, a TTS and ASR system under the extremely low-resource setting, which can support rare languages with low data cost. |
276 | Doing in One Go: Delivery Time Inference Based on Couriers’ Trajectories | Sijie Ruan; Zi Xiong; Cheng Long; Yiheng Chen; Jie Bao; Tianfu He; Ruiyuan Li; Shengnan Wu; Zhongyuan Jiang; Yu Zheng; | To this end, we propose Delivery Time Inference (DTInf), to automatically infer the delivery time of waybills based on couriers’ trajectories. |
277 | Improving Deep Learning for Airbnb Search | Malay Haldar; Prashant Ramanathan; Tyler Sax; Mustafa Abdool; Lanbo Zhang; Aamir Mansawala; Shulin Yang; Bradley Turnbull; Junshuo Liao; | In this paper we describe the journey beyond, discussing what we refer to as the ABCs of improving search: A for architecture, B for bias and C for cold start. |
278 | General-Purpose User Embeddings based on Mobile App Usage | Junqi Zhang; Bing Bai; Ye Lin; Jian Liang; Kun Bai; Fei Wang; | In this paper, we report our recent practice at Tencent for user modeling based on mobile app usage. |
279 | Unsupervised Translation via Hierarchical Anchoring: Functional Mapping of Places across Cities | Takahiro Yabe; Kota Tsubouchi; Toru Shimizu; Yoshihide Sekimoto; Satish V. Ukkusuri; | We address this problem by proposing an unsupervised translation method that translates embeddings by exploiting common hierarchical structures that exist across imbalanced domains. |
280 | Debiasing Grid-based Product Search in E-commerce | Ruocheng Guo; Xiaoting Zhao; Adam Henderson; Liangjie Hong; Huan Liu; | In this work, we extend unbiased learning to rank to the world of e-commerce search via considering a grid-based product search scenario. |
281 | Forecasting the Evolution of Hydropower Generation | Fan Zhou; Liang Li; Kunpeng Zhang; Goce Trajcevski; Fuming Yao; Ying Huang; Ting Zhong; Jiahao Wang; Qiao Liu; | In this paper, we present DeepHydro, a novel stochastic method for modeling multivariate time series (e.g., water inflow/outflow and temperature) and forecasting power generation of hydropower stations. |
282 | Salience and Market-aware Skill Extraction for Job Targeting | Baoxu Shi; Jaewon Yang; Feng Guo; Qi He; | In this work, we show that the commonly used text-based salience and market-agnostic skill extraction approach is sub-optimal because it only considers skill mention and ignores the salient level of a skill and its market dynamics, i.e., the market supply and demand influence on the importance of skills. |
283 | DATE: Dual Attentive Tree-aware Embedding for Customs Fraud Detection | Sundong Kim; Yu-Che Tsai; Karandeep Singh; Yeonsoo Choi; Etim Ibok; Cheng-Te Li; Meeyoung Cha; | This paper proposes DATE, a model of Dual-task Attentive Tree-aware Embedding, to classify and rank illegal trade flows that contribute the most to the overall customs revenue when caught. |
284 | User Sentiment as a Success Metric: Persistent Biases Under Full Randomization | Ercan Yildiz; Joshua Safyan; Marc Harper; | We study user sentiment (reported via optional surveys) as a metric for fully randomized A/B tests. |
285 | Improving Recommendation Quality in Google Drive | Suming J. Chen; Zhen Qin; Zac Wilson; Brian Calaci; Michael Rose; Ryan Evans; Sean Abraham; Donald Metzler; Sandeep Tata; Michael Colagrosso; | In this paper, we discuss both the challenges of iteratively improving the quality of a personal recommendation system as well as the variety of approaches that we took in order to improve this feature. |
286 | Large-Scale Training System for 100-Million Classification at Alibaba | Liuyihan Song; Pan Pan; Kang Zhao; Hao Yang; Yiming Chen; Yingya Zhang; Yinghui Xu; Rong Jin; | In this paper, we propose a large-scale training system to address these challenges. |
287 | Mining Implicit Relevance Feedback from User Behavior for Web Question Answering | Linjun Shou; Shining Bo; Feixiang Cheng; Ming Gong; Jian Pei; Daxin Jiang; | In this paper, we make the first study to explore the correlation between user behavior and passage relevance, and propose a novel approach for mining training data for Web QA. |
288 | Controllable Multi-Interest Framework for Recommendation | Yukuo Cen; Jianwei Zhang; Xu Zou; Chang Zhou; Hongxia Yang; Jie Tang; | In this paper, we propose a novel controllable multi-interest framework for the sequential recommendation, called ComiRec. |
289 | Managing Diversity in Airbnb Search | Mustafa Abdool; Malay Haldar; Prashant Ramanathan; Tyler Sax; Lanbo Zhang; Aamir Manaswala; Lynn Yang; Bradley Turnbull; Qing Zhang; Thomas Legrand; | In this paper, we describe our journey in tackling the problem of diversity for Airbnb search, starting from heuristic based approaches and concluding with a novel deep learning solution that produces an embedding of the entire query context by leveraging Recurrent Neural Networks (RNNs). |
290 | Molecular Inverse-Design Platform for Material Industries | Seiji Takeda; Toshiyuki Hama; Hsiang-Han Hsu; Victoria A. Piunova; Dmitry Zubarev; Daniel P. Sanders; Jed W. Pitera; Makoto Kogoh; Takumi Hongo; Yenwei Cheng; Wolf Bocanett; Hideaki Nakashika; Akihiro Fujita; Yuta Tsuchiya; Katsuhiko Hino; Kentaro Yano; Shuichi Hirose; Hiroki Toda; Yasumitsu Orii; Daiju Nakano; | In this paper, we present a material industry-oriented web platform of an AI-driven molecular inverse-design system, which automatically designs brand new molecular structures rapidly and diversely. |
291 | Learning to Score Economic Development from Satellite Imagery | Sungwon Han; Donghyun Ahn; Sungwon Park; Jeasurk Yang; Susang Lee; Jihee Kim; Hyunjoo Yang; Sangyoon Park; Meeyoung Cha; | In this paper, we introduce a novel approach for measuring economic development from high-resolution satellite images in the absence of ground truth statistics. |
292 | A Request-level Guaranteed Delivery Advertising Planning: Forecasting and Allocation | Hong Zhang; Lan Zhang; Lan Xu; Xiaoyang Ma; Zhengtao Wu; Cong Tang; Wei Xu; Yiguo Yang; | Facing the challenges, we present a holistic design of a request-level guaranteed delivery advertising planning system with careful optimization for all three critical components including impression forecasting, selling and serving. |
293 | Two Sides of the Same Coin: White-box and Black-box Attacks for Transfer Learning | Yinghua Zhang; Yangqiu Song; Jian Liang; Kun Bai; Qiang Yang; | To figure out this problem, we conduct extensive empirical evaluations to show that fine-tuning effectively enhances model robustness under white-box FGSM attacks. We also propose a black-box attack method for transfer learning models which attacks the target model with the adversarial examples produced by its source model. |
294 | Learning to Generate Personalized Query Auto-Completions via a Multi-View Multi-Task Attentive Approach | Di Yin; Jiwei Tan; Zhe Zhang; Hongbo Deng; Shujian Huang; Jiajun Chen; | In this paper, we study the task of Query Auto-Completion (QAC), which is a very significant feature of modern search engines. |
295 | A Sleeping, Recovering Bandit Algorithm for Optimizing Recurring Notifications | Kevin P. Yancey; Burr Settles; | In this paper, we introduce the Recovering Difference Softmax Algorithm to address the particular challenges of this problem domain, and use it to successfully optimize millions of daily reminders for the online language-learning app Duolingo. |
296 | Multi-objective Optimization for Guaranteed Delivery in Video Service Platform | Hang Lei; Yin Zhao; Longjun Cai; | In this paper, we study the problem of how to maximize certain gains, such as video view (VV) or fairness of different contents (CTR variations between contents) under the GD constraints. |
297 | Delivery Scope: A New Way of Restaurant Retrieval for On-demand Food Delivery Service | Xuetao Ding; Runfeng Zhang; Zhen Mao; Ke Xing; Fangxiao Du; Xingyu Liu; Guoxing Wei; Feifan Yin; Renqing He; Zhizhao Sun; | In order to draw suitable delivery scopes for millions of restaurant partners, we propose a pioneering delivery scope generation framework. |
298 | Fraud Transactions Detection via Behavior Tree with Local Intention Calibration | Can Liu; Qiwei Zhong; Xiang Ao; Li Sun; Wangli Lin; Jinghua Feng; Qing He; Jiayu Tang; | In this paper, we devise a tree-like structure named behavior tree to reorganize the user behavioral data, in which a group of successive sequential actions denoting a specific user intention are represented as a branch on the tree. |
299 | Balanced Order Batching with Task-Oriented Graph Clustering | Lu Duan; Haoyuan Hu; Zili Wu; Guozheng Li; Xinhang Zhang; Yu Gong; Yinghui Xu; | In this paper, rather than designing heuristics, we propose an end-to-end learning and optimization framework named Balanced Task-orientated Graph Clustering Network (BTOGCN) to solve the BOBP by reducing it to balanced graph clustering optimization problem. |
300 | Efficiently Solving the Practical Vehicle Routing Problem: A Novel Joint Learning Approach | Lu Duan; Yang Zhan; Haoyuan Hu; Yu Gong; Jiangwen Wei; Xiaodong Zhang; Yinghui Xu; | We propose a strategy that combines the reinforcement learning manner with the supervised learning manner to train the model. |
301 | Meta-Learning for Query Conceptualization at Web Scale | Fred X. Han; Di Niu; Haolan Chen; Weidong Guo; Shengli Yan; Bowei Long; | In this paper, we study the problem of query conceptualization, which is to find the most appropriate matching concepts for any given search query from a large pool of pre-defined concepts. |
302 | Hybrid Spatio-Temporal Graph Convolutional Network: Improving Traffic Prediction with Navigation Data | Rui Dai; Shenkun Xu; Qian Gu; Chenguang Ji; Kaikui Liu; | Specifically, we propose an algorithm to acquire the upcoming traffic volume from an online navigation engine. |
303 | Multitask Mixture of Sequential Experts for User Activity Streams | Zhen Qin; Yicheng Cheng; Zhe Zhao; Zhe Chen; Donald Metzler; Jingzheng Qin; | In this work, we study the challenging problem of how to model sequential user behavior in the neural multi-task learning settings. |
304 | Identifying Homeless Youth At-Risk of Substance Use Disorder: Data-Driven Insights for Policymakers | Maryam Tabar; Heesoo Park; Stephanie Winkler; Dongwon Lee; Anamika Barman-Adhikari; Amulya Yadav; | This work aims to fill this gap by making the following three contributions: (i) we use a real-world dataset collected from ~1,400 homeless youth (across six American states) to build accurate Machine Learning (ML) models for predicting the susceptibility of homeless youth to SUD; (ii) we find a representative set of factors associated with SUD among this population by analyzing feature importance values associated with our ML models; and (iii) we investigate the effect of geographical heterogeneity on the factors associated with SUD. |
305 | Interleaved Sequence RNNs for Fraud Detection | Bernardo Branco; Pedro Abreu; Ana Sofia Gomes; Mariana S. C. Almeida; João Tiago Ascensão; Pedro Bizarro; | We present a complete RNN framework to detect fraud in real-time, proposing an efficient ML pipeline from preprocessing to deployment. |
306 | Attention based Multi-Modal New Product Sales Time-series Forecasting | Vijay Ekambaram; Kushagra Manglik; Sumanta Mukherjee; Surya Shravan Kumar Sajja; Satyam Dwivedi; Vikas Raykar; | In this paper, we propose and empirically evaluate several novel attention-based multi-modal encoder-decoder models to forecast the sales for a new product purely based on product images, any available product attributes and also external factors like holidays, events, weather, and discount. |
307 | Pest Management In Cotton Farms: An AI-System Case Study from the Global South | Aman Dalmia; Jerome White; Ankit Chaurasia; Vishal Agarwal; Rajesh Jain; Dhruvin Vora; Balasaheb Dhame; Raghu Dharmaraju; Rahul Panicker; | We address this problem by presenting a new solution for pesticide management that uses deep learning, smartphone cameras, inexpensive pest traps, existing digital pipelines, and agricultural extension-worker programs. |
308 | TIES: Temporal Interaction Embeddings for Enhancing Social Media Integrity at Facebook | Nima Noorshams; Saurabh Verma; Aude Hofleitner; | In this paper, we present our efforts to protect various social media entities at Facebook from people who try to abuse our platform. |
309 | Price Investment using Prescriptive Analytics and Optimization in Retail | Prakhar Mehrotra; Linsey Pang; Karthick Gopalswamy; Avinash Thangali; Timothy Winters; Ketki Gupte; Dnyanesh Kulkarni; Sunil Potnuru; Supreeth Shastry; Harshada Vuyyuri; | In this paper, we apply Machine Learning (ML) algorithms and Operations Research techniques for forecasting and optimization to build a new price recommendation system, which improves our ability to generate price recommendations accurately and automatically. |
310 | Climate Downscaling Using YNet: A Deep Convolutional Network with Skip Connections and Fusion | Yumin Liu; Auroop R. Ganguly; Jennifer Dy; | In this paper, we proposed YNet, a novel deep convolutional neural network (CNN) with skip connections and fusion capabilities to perform downscaling for climate variables, on multiple GCMs directly rather than on reanalysis data. |
311 | Cracking the Black Box: Distilling Deep Sports Analytics | Xiangyu Sun; Jack Davis; Oliver Schulte; Guiliang Liu; | We propose and compare several scalable model tree learning heuristics to address the computational challenge from datasets with millions of data points. |
312 | Taming Pretrained Transformers for Extreme Multi-label Text Classification | Wei-Cheng Chang; Hsiang-Fu Yu; Kai Zhong; Yiming Yang; Inderjit S. Dhillon; | In this paper, we propose X-Transformer, the first scalable approach to fine-tuning deep transformer models for the XMC problem. |
313 | Prediction of Hourly Earnings and Completion Time on a Crowdsourcing Platform | Anna Lioznova; Alexey Drutsa; Vladimir Kukushkin; Anastasia Bezzubtseva; | We study the problem of predicting future hourly earnings and task completion time for a crowdsourcing platform user who sees the list of available tasks and wants to select one of them to execute. |
314 | SimClusters: Community-Based Representations for Heterogeneous Recommendations at Twitter | Venu Satuluri; Yao Wu; Xun Zheng; Yilei Qian; Brian Wichers; Qieyun Dai; Gui Ming Tang; Jerry Jiang; Jimmy Lin; | In this paper, we present SimClusters, a general-purpose representation layer based on overlapping communities into which users as well as heterogeneous content can be captured as sparse, interpretable vectors to support a multitude of recommendation tasks. |
315 | Time-Aware User Embeddings as a Service | Martin Pavlovski; Jelena Gligorijevic; Ivan Stojkovic; Shubham Agrawal; Shabhareesh Komirishetty; Djordje Gligorijevic; Narayan Bhamidipati; Zoran Obradovic; | To that end, we address the limitations of the current state-of-the-art self-supervised methods for task-independent (unsupervised) sequence embedding, and propose a novel Time-Aware Sequential Autoencoder (TASA) that accounts for the temporal aspects of sequences of activities. |
316 | Shop The Look: Building a Large Scale Visual Shopping System at Pinterest | Raymond Shiau; Hao-Yu Wu; Eric Kim; Yue Li Du; Anqi Guo; Zhiyuan Zhang; Eileen Li; Kunlong Gu; Charles Rosenberg; Andrew Zhai; | In this work, we provide a holistic view of how we built Shop The Look, a shopping oriented visual search system, along with lessons learned from addressing shopping needs. |
317 | Dynamic Heterogeneous Graph Neural Network for Real-time Event Prediction | Wenjuan Luo; Han Zhang; Xiaodi Yang; Lin Bo; Xiaoqing Yang; Zang Li; Xiaohu Qie; Jieping Ye; | In this paper, we propose to use dynamically constructed heterogeneous graph for each ongoing event to encode the attributes of the event and its surroundings. |
318 | Bandit based Optimization of Multiple Objectives on a Music Streaming Platform | Rishabh Mehrotra; Niannan Xue; Mounia Lalmas; | This paper aims at extending contextual bandits to multi-objective setting so as to power recommendations in a multi-stakeholder platforms. |
319 | Multimodal Deep Learning Based Crop Classification Using Multispectral and Multitemporal Satellite Imagery | Krishna Karthik Gadiraju; Bharathkumar Ramachandra; Zexi Chen; Ranga Raju Vatsavai; | In this paper, we present a multimodal deep learning solution that jointly exploits spatial-spectral and phenological properties to identify major crop types. |
320 | BusTr: Predicting Bus Travel Times from Real-Time Traffic | Richard Barnes; Senaka Buthpitiya; James Cook; Alex Fabrikant; Andrew Tomkins; Fangzhou Xu; | We present BusTr, a machine-learned model for translating road traffic forecasts into predictions of bus delays, used by Google Maps to serve the majority of the world’s public transit systems where no official real-time bus tracking is provided. |
321 | Characterizing and Learning Representation on Customer Contact Journeys in Cellular Services | Shuai Zhao; Wen-Ling Hsu; George Ma; Tan Xu; Guy Jacobson; Raif Rustamov; | We propose to learn journey embeddings using a sequence-to-sequence framework that converts each customer journey into a fixed-length latent embedding. |
322 | CrowdQuake: A Networked System of Low-Cost Sensors for Earthquake Detection via Deep Learning | Xin Huang; Jangsoo Lee; Young-Woo Kwon; Chul-Ho Lee; | In this paper, we present CrowdQuake, a networked system based on low-cost acceleration sensors, which monitors ground motions and detects earthquakes, by developing a convolutional-recurrent neural network model. |
323 | An Empirical Analysis of Backward Compatibility in Machine Learning Systems | Megha Srivastava; Besmira Nushi; Ece Kamar; Shital Shah; Eric Horvitz; | We consider how updates, intended to improve ML models, can introduce new errors that can significantly affect downstream systems and users. |
324 | DeepTriage: Automated Transfer Assistance for Incidents in Cloud Services | Phuong Pham; Vivek Jain; Lukas Dauterman; Justin Ormont; Navendu Jain; | To address these challenges, we introduce DeepTriage, an intelligent incident transfer service combining multiple machine learning techniques – gradient boosted classifiers, clustering methods, and deep neural networks – in an ensemble to recommend the responsible team to triage an incident. |
325 | An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images | Zekun Li; Yao-Yi Chiang; Sasan Tavakkol; Basel Shbita; Johannes H. Uhl; Stefan Leyk; Craig A. Knoblock; | This paper presents an end-to-end approach to address the real-world problem of finding and indexing historical map images. |
326 | Bootstrapping Complete The Look at Pinterest | Eileen Li; Eric Kim; Andrew Zhai; Josh Beal; Kunlong Gu; | In this paper, we will describe how we bootstrapped the Complete The Look (CTL) system at Pinterest. |
327 | Explainable Classification of Brain Networks via Contrast Subgraphs | Tommaso Lanciano; Francesco Bonchi; Aristides Gionis; | In this paper we introduce a novel approach for classifying brain networks based on extracting contrast subgraphs, i.e., a set of vertices whose induced subgraphs are dense in one class of graphs and sparse in the other. |
328 | Jointly Learning to Recommend and Advertise | Xiangyu Zhao; Xudong Zheng; Xiwang Yang; Xiaobing Liu; Jiliang Tang; | To this end, in this paper, we propose a novel two-level reinforcement learning framework to jointly optimize the recommending and advertising strategies, where the first level generates a list of recommendations to optimize user experience in the long run; then the second level inserts ads into the recommendation list that can balance the immediate advertising revenue from advertisers and the negative influence of ads on long-term user experience. |
329 | Fitbit for Chickens?: Time Series Data Mining Can Increase the Productivity of Poultry Farms | Alireza Abdoli; Sara Alaee; Shima Imani; Amy Murillo; Alec Gerry; Leslie Hickle; Eamonn Keogh; | In this work, we propose a general-purpose framework to robustly learn and classify from datasets exhibiting these issues. |
330 | CompactETA: A Fast Inference System for Travel Time Prediction | Kun Fu; Fanlin Meng; Jieping Ye; Zheng Wang; | In this paper, we develop a novel ETA learning system named as CompactETA, which provides an accurate online travel time inference within 100 microseconds. |
331 | Intelligent Exploration for User Interface Modules of Mobile App with Collective Learning | Jingbo Zhou; Zhenwei Tang; Min Zhao; Xiang Ge; Fuzheng Zhuang; Meng Zhou; Liming Zou; Chenglei Yang; Hui Xiong; | To this end, we introduce FEELER, a framework to fast and intelligently explore design solutions of user interface modules with a collective machine learning approach. |
332 | Gemini: A Novel and Universal Heterogeneous Graph Information Fusing Framework for Online Recommendations | Jixing Xu; Zhenlong Zhu; Jianxin Zhao; Xuanye Liu; Minghui Shan; Jiecheng Guo; | To solve the above problems, we propose a universal and effective framework named Gemini, which only relies on the common interaction logs, avoiding the dependence on auxiliary information and ensuring a better generality. |
333 | Hypergraph Convolutional Recurrent Neural Network | Jaehyuk Yi; Jinkyoo Park; | In this study, we present a hypergraph convolutional recurrent neural network (HGC-RNN), which is a prediction model for structured time-series sensor network data. |
334 | Towards Building an Intelligent Chatbot for Customer Service: Learning to Respond at the Appropriate Time | Che Liu; Junfeng Jiang; Chao Xiong; Yi Yang; Jieping Ye; | In this paper, we propose a multi-turn response triggering model (MRTM) to address this problem. |
335 | Ads Allocation in Feed via Constrained Optimization | Jinyun Yan; Zhiyuan Xu; Birjodh Tiwana; Shaunak Chatterjee; | The paper describes how large-scale recommender system like feed ranking works, and why it is useful to consider ads allocation as a post-operation once the ranking of organic items and (separately) the ranking of ads are done. |
336 | USAD: UnSupervised Anomaly Detection on Multivariate Time Series | Julien Audibert; Pietro Michiardi; Frédéric Guyard; Sébastien Marti; Maria A. Zuluaga; | In this paper, we propose a fast and stable method called UnSupervised Anomaly Detection for multivariate time series (USAD) based on adversely trained autoencoders. |
337 | A Dual Heterogeneous Graph Attention Network to Improve Long-Tail Performance for Shop Search in E-Commerce | Xichuan Niu; Bofang Li; Chenliang Li; Rong Xiao; Haochuan Sun; Hongbo Deng; Zhenzhong Chen; | Specifically, we propose a dual heterogeneous graph attention network (DHGAT) integrated with the two-tower architecture, using the user interaction data from both shop search and product search. |
338 | Learning with Limited Labels via Momentum Damped & Differentially Weighted Optimization | Rishabh Mehrotra; Ashish Gupta; | In this paper, we consider the task of learning from limited labeled data, wherein we aim at jointly leveraging strong supervision data (e.g. explicit judgments) along with weak supervision data (e.g. implicit feedback or labels from the related task) to train neural models. |
339 | Learning to Simulate Human Mobility | Jie Feng; Zeyu Yang; Fengli Xu; Haisu Yu; Mudan Wang; Yong Li; | To solve this problem, we propose a model-free generative adversarial framework, which effectively integrates the domain knowledge of human mobility regularity utilized in the model-based methods. |
340 | Data-driven Simulation and Optimization for Covid-19 Exit Strategies | Salah Ghamizi, Renaud Rwemalika, Maxime Cordy, Lisa Veiber, Tegawendé F. Bissyandé, Mike Papadakis, Jacques Klein, Yves Le Traon; | In this paper, we propose to augment epidemiological forecasting with actual data-driven models that will learn to fine-tune predictions for different contexts (e.g., per country). |
341 | Understanding the Impact of the COVID-19 Pandemic on Transportation-related Behaviors with Human Mobility Data | Jizhou Huang; Haifeng Wang; Miao Fan; An Zhuo; Yibo Sun; Ying Li; | To be specific, we conduct data-driven analysis on transportation-related behaviors during the pandemic from the perspectives of 1) means of transportation, 2) type of visited venues, 3) check-in time of venues, 4) preference on "origin-destination” distance, and 5) "origin-transportation-destination” patterns. |
342 | Simulating the Impact of Hospital Capacity and Social Isolation to Minimize the Propagation of Infectious Diseases | Shaon Bhatta Shuvo; Bonaventure C. Molokwu; Ziad Kobti; | In this paper, we used artificial agent-based simulation modeling to identify the importance of social distancing and hospitals’ capacity in terms of the number of beds to shorten the length of an outbreak and reduce the total number of infections and deaths during an epidemic. |
343 | Effective Transfer Learning for Identifying Similar Questions: Matching User Questions to COVID-19 FAQs | Clara H. McCreery; Namit Katariya; Anitha Kannan; Manish Chablani; Xavier Amatriain; | In this paper, we show how a double fine-tuning approach of pretraining a neural network on medical question-answer pairs followed by fine-tuning on medical question-question pairs is a particularly useful intermediate task for the ultimate goal of determining medical question similarity. |
344 | Hi-COVIDNet: Deep Learning Approach to Predict Inbound COVID-19 Patients and Case Study in South Korea | Minseok Kim; Junhyeok Kang; Doyoung Kim; Hwanjun Song; Hyangsuk Min; Youngeun Nam; Dongmin Park; Jae-Gil Lee; | In this paper, to aid in such allocation by predicting the number of inbound COVID-19 cases, we propose Hi-COVIDNet, which takes advantage of the geographic hierarchy. |
345 | Exploring Automatic Diagnosis of COVID-19 from Crowdsourced Respiratory Sound Data | Chloë Brown, Jagmohan Chauhan, Andreas Grammenos, Jing Han, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Cecilia Mascolo; | In this paper we describe our data analysis over a large-scale crowdsourced dataset of respiratory sounds collected to aid diagnosis of COVID-19. |
346 | Understanding the Urban Pandemic Spreading of COVID-19 with Real World Mobility Data | Qianyue Hao; Lin Chen; Fengli Xu; Yong Li; | To address these challenges, we build a data-driven epidemic simulator with COVID-19 specific features, which incorporates real-world mobility data capturing the heterogeneity in urban environments. |