Please do advise with any papers I may have missed due to title quirks, etc. I will likely update this to publicly available papers that are not on arxiv.
- Approval Voting and Incentives in Crowdsourcing
- [1406.3852] A low variance consistent test of relative ... - arXiv
- Spectral Clustering via the Power Method -- Provably - arXiv
- [1312.4564] Adaptive Stochastic Alternating Direction ... - arXiv
- A Lower Bound for the Optimization of Finite Sums
- Learning Word Representations with Hierarchical Sparse ...
- Learning Transferable Features with Deep Adaptation ...
- How transferable are features in deep neural networks?
- On the Relationship between Sum-Product Networks ... - arXiv
- [1505.00526] An Explicit Sampling Dependent Spectral ...
- A Stochastic PCA and SVD Algorithm with an Exponential ...
- Learning Local Invariant Mahalanobis Distances
- [1501.03273] Classification with Low Rank and Missing Data
- Telling cause from effect in deterministic linear dynamical ...
- High Dimensional Bayesian Optimisation and Bandits via ...
- [1504.03991] Theory of Dual-sparse Regularized ... - arXiv
- A General Analysis of the Convergence of ADMM
- Stochastic Primal-Dual Coordinate Method for Regularized ...
- Spectral MLE: Top-$ K $ Rank Aggregation from Pairwise ...
- Exploring Algorithmic Limits of Matrix Rank Minimization ...
- Batch Normalization: Accelerating Deep Network Training ...
- Distributed Estimation of Generalized Matrix Rank: Efficient ...
- [1402.5876] Manifold Gaussian Processes for Regression
- Online Regret Bounds for Undiscounted Continuous ... - arXiv
- The Fundamental Incompatibility of Hamiltonian Monte ...
- Faster Rates for the Frank-Wolfe Method over Strongly ...
- Online Tracking by Learning Discriminative Saliency Map ...
- A Statistical Perspective on Randomized Sketching for ...
- [1411.3224] On TD(0) with function approximation ... - arXiv
- Learning Parametric-Output HMMs with Two Aliased States
- Latent Gaussian Processes for Distribution Estimation of ...
- Variational inference for sparse spectrum Gaussian process ...
- Stochastic Dual Coordinate Ascent with Adaptive Probabilities
- JUMP-Means: Small-Variance Asymptotics for Markov Jump ...
- [1211.0358] Deep Gaussian Processes - arXiv
- Fast Bilingual Distributed Representations without Word ...
- Cascading Bandits
- Random Coordinate Descent Methods for Minimizing ...
- Counterfactual Risk Minimization: Learning from Logged ...
- A Linear Dynamical System Model for Text
- Unsupervised Learning of Video Representations using ...
- MADE: Masked Autoencoder for Distribution Estimation
- Large-scale Log-determinant Computation through ...
- Differentially Private Bayesian Optimization
- Rademacher Observations, Private Data, and Boosting
- Bayesian and empirical Bayesian forests
- The Ladder: A Reliable Leaderboard for Machine Learning ...
- Enabling scalable stochastic gradient-based inference for ...
- Reified Context Models
- Learning Fast-Mixing Models for Structured Prediction
- [1406.6947] Deep Learning Multi-View Representation for ...
- [1406.7443] Efficient Learning in Large-Scale Combinatorial ...
- [1406.4311] Sparse Estimation with the Swept ... - arXiv
- Unsupervised Domain Adaptation by Backpropagation
- Markov Chain Monte Carlo and Variational Inference ...
- The Power of Randomization: Distributed Submodular ...
- Non-Gaussian Discriminative Factor Models via the Max ...
- Nested Sequential Monte Carlo Methods
- [1402.1389] Distributed Variational Inference in Sparse ...
- [1402.1412] Variational Inference in Sparse Gaussian ...
- Rebuilding Factorized Information Criterion: Asymptotically ...
- [1311.0776] The Composition Theorem for Differential Privacy
- Strongly Adaptive Online Learning
- [1411.0860] CUR Algorithm for Partially Observed Matrices
- Scaling-up Empirical Risk Minimization: Optimization of ...
- Towards a Learning Theory of Causation
- DRAW: A Recurrent Neural Network For Image Generation
- Distributed Gaussian Processes
- [1302.2684] A Tensor Approach to Learning Mixed ... - arXiv
- Consistent Estimation of Dynamic and Multi-layer Networks
- [1405.3229] Rate of Convergence and Error Bounds for ...
- Convex Learning of Multiple Tasks and their Structure - arXiv
- [1304.5610] Tight Performance Bounds for Approximate ...
- Approximate Modified Policy Iteration
- Long Short-Term Memory Over Tree Structures
- Predictive Entropy Search for Bayesian Optimization with ...
- Generative Moment Matching Networks
- Deep Learning with Limited Numerical Precision
- Teaching Deep Convolutional Neural Networks to Play Go
- Kernel Interpolation for Scalable Structured Gaussian ...
- [1407.2538] Learning Deep Structured Models - arXiv
- Personalized PageRank Solution Paths
- Scalable Variational Inference in Log-supermodular Models
- Variational Inference for Gaussian Process Modulated ...
- Probabilistic Backpropagation for Scalable Learning of ...
- Trust Region Policy Optimization
- [1410.5518] On Symmetric and Asymmetric LSHs for Inner ...
- Adding vs. Averaging in Distributed Primal-Dual Optimization
- Feature-Budgeted Random Forest
- Show, Attend and Tell: Neural Image Caption Generation ...
- Learning to Search Better Than Your Teacher
- Gated Feedback Recurrent Neural Networks
- [1502.03671] Phrase-based Image Captioning - arXiv
- Gradient-based Hyperparameter Optimization through ...
- [1406.1901] Subsampling Methods for Persistent Homology
- Binary Embedding: Fundamental Limits and Fast Algorithm
- Scalable Bayesian Optimization Using Deep Neural Networks
- Scalable Nonparametric Bayesian Inference on Point ...
- Deep Unsupervised Learning using Nonequilibrium ...
- Compressing Neural Networks with the Hashing Trick - arXiv
- Optimal and Adaptive Algorithms for Online Boosting
- [1411.1134] Global Convergence of Stochastic Gradient ...
- [1504.06785] Complete Dictionary Recovery over the Sphere
- PASSCoDe: Parallel ASynchronous Stochastic dual Co ...
- Optimizing Neural Networks with Kronecker-factored ...
- Novelty Detection Under Multi-Instance Multi-Label ... - arXiv
- [1212.4663] Concentration of Measure Inequalities in ...
- PU Learning for Matrix Completion
- A Distributed Proximal Method for Composite Convex ...
- Posterior Sampling and Stochastic Gradient Monte Carlo
- Inference for Partially Observed Multitype Branching ...