# JMLR Workshop and Conference Proceedings

## Volume 37: Proceedings of The 32nd International Conference on Machine Learning

**Editors:
Francis Bach,
David Blei
**

### Accepted Papers

Attribute Efficient Linear Regression with Distribution-Dependent Sampling

Finding Linear Structure in Large Datasets with Scalable Canonical Correlation Analysis

A Relative Exponential Weighing Algorithm for Adversarial Utility-based Dueling Bandits

Accelerated Online Low Rank Tensor Learning for Multivariate Spatiotemporal Streams

A Modified Orthant-Wise Limited Memory Quasi-Newton Method with Convergence Analysis

Generalization error bounds for learning to rank: Does the length of document lists matter?

PeakSeg: constrained optimal segmentation and supervised penalty learning for peak detection in count data

Paired-Dual Learning for Fast Training of Latent Variable Hinge-Loss MRFs

A Provable Generalized Tensor Spectral Method for Uniform Hypergraph Partitioning

Budget Allocation Problem with Multiple Advertisers: A Game Theoretic View

Tracking Approximate Solutions of Parameterized Optimization Problems over Multi-Dimensional (Hyper-)Parameter Domains

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Distributed Estimation of Generalized Matrix Rank: Efficient Algorithms and Lower Bounds

The Fundamental Incompatibility of Scalable Hamiltonian Monte Carlo and Naive Data Subsampling

Ordered Stick-Breaking Prior for Sequential MCMC Inference of Bayesian Nonparametric Models

A Unifying Framework of Anytime Sparse Gaussian Process Regression Models with Stochastic Variational Inference for Big Data

Online Tracking by Learning Discriminative Saliency Map with Convolutional Neural Network

Fast Kronecker Inference in Gaussian Processes with non-Gaussian Likelihoods

Statistical and Algorithmic Perspectives on Randomized Sketching for Ordinary Least-Squares

On TD(0) with function approximation: Concentration bounds and a centered variant with exponential convergence

Latent Gaussian Processes for Distribution Estimation of Multivariate Categorical Data

Improving the Gaussian Process Sparse Spectrum Approximation by Representing Uncertainty in Frequency Inputs

Ranking from Stochastic Pairwise Preferences: Recovering Condorcet Winners and Tournament Solution Sets at the Top

Log-Euclidean Metric Learning on Symmetric Positive Definite Manifold with Application to Image Set Classification

Multi-view Sparse Co-clustering via Proximal Alternating Linearized Minimization

Latent Topic Networks: A Versatile Probabilistic Programming Framework for Topic Models

Random Coordinate Descent Methods for Minimizing Decomposable Submodular Functions

DP-space: Bayesian Nonparametric Subspace Clustering with Small-variance Asymptotics

HawkesTopic: A Joint Model for Network Inference and Topic Modeling from Text-Based Cascades

Large-scale log-determinant computation through stochastic Chebyshev expansions

Enabling scalable stochastic gradient-based inference for Gaussian processes by employing the Unbiased LInear System SolvEr (ULISSE)

Large-Scale Markov Decision Problems with KL Control Cost and its Application to Crowdsourcing

Safe Subspace Screening for Nuclear Norm Regularized Least Squares Problems

Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays

Non-Linear Cross-Domain Collaborative Filtering via Hyper-Structure Transfer

The Power of Randomization: Distributed Submodular Maximization on Massive Datasets

Non-Gaussian Discriminative Factor Models via the Max-Margin Rank-Likelihood

Convergence rate of Bayesian tensor estimator and its minimax optimality

On Identifying Good Options under Combinatorially Structured Feedback in Finite Noisy Environments

Rebuilding Factorized Information Criterion: Asymptotically Accurate Marginal Likelihood

Double Nyström Method: An Efficient and Accurate Nyström Scheme for Large-Scale Data Sets

A Deterministic Analysis of Noisy Sparse Subspace Clustering for Dimensionality-reduced Data

\(\ell_{1,p}\)-Norm Regularization: Error Bounds and Convergence Rate Analysis of First-Order Methods

Entropy evaluation based on confidence intervals of frequency estimates : Application to the learning of decision trees

An Empirical Study of Stochastic Variational Inference Algorithms for the Beta Bernoulli Process

Coordinate Descent Converges Faster with the Gauss-Southwell Rule Than Random Selection

Predictive Entropy Search for Bayesian Optimization with Unknown Constraints

Kernel Interpolation for Scalable Structured Gaussian Processes (KISS-GP)

Robust Estimation of Transition Matrices in High Dimensional Heavy-tailed Vector Autoregressive Processes

Probabilistic Backpropagation for Scalable Learning of Bayesian Neural Networks

Preference Completion: Large-scale Collaborative Ranking from Pairwise Comparisons

Causal Inference by Identification of Vector Autoregressive Processes with Hidden Components

Unsupervised Riemannian Metric Learning for Histograms Using Aitchison Transformations

Algorithms for the Hard Pre-Image Problem of String Kernels and the General Problem of String Prediction

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

A Fast Variational Approach for Learning Markov Random Field Language Models

Scalable Nonparametric Bayesian Inference on Point Processes with Gaussian Processes

Intersecting Faces: Non-negative Matrix Factorization With New Guarantees

Global Convergence of Stochastic Gradient Descent for Some Non-convex Matrix Problems

Safe Policy Search for Lifelong Reinforcement Learning with Sublinear Regret

Optimizing Neural Networks with Kronecker-factored Approximate Curvature

A Convex Exemplar-based Approach to MAD-Bayes Dirichlet Process Mixture Models

Multi-instance multi-label learning in the presence of novel class instances

An Asynchronous Distributed Proximal Gradient Method for Composite Convex Optimization

Boosted Categorical Restricted Boltzmann Machine for Computational Prediction of Splice Junctions

Privacy for Free: Posterior Sampling and Stochastic Gradient Monte Carlo

A trust-region method for stochastic variational inference with applications to streaming data

Inference in a Partially Observed Queuing Model with Applications in Ecology

On the Optimality of Multi-Label Classification under Subset Zero-One Loss for Distributions Satisfying the Composition Property

Un-regularizing: approximate proximal point and faster stochastic algorithms for empirical risk minimization

