Parameterized complexity

Deeper, Broader and Artier Domain Generalization Open

Da Li, Yongxin Yang, Yi-Zhe Song, Timothy M. Hospedales · 2017

The problem of domain generalization is to learn from multiple training domains, and extract a domain-agnostic model that can then be applied to an unseen domain. Domain generalization (DG) has a clear motivation in contexts where there ar…

Joint Analysis of BICEP2/<i>Keck Array</i>and<i>Planck</i>Data Open

P. A. R. Ade, N. Aghanim, Zeeshan Ahmed, R. W. Aikin, K. D. Alexander , et al. · 2015

Physics Mathematics Chemistry

We report the results of a joint analysis of data from BICEP2/Keck Array and Planck. BICEP2 and Keck Array have observed the same approximately 400 deg^{2} patch of sky centered on RA 0 h, Dec. -57.5°. The combined maps reach a depth of 57…

ECCO version 4: an integrated framework for non-linear inverse modeling and global ocean state estimation Open

Gaël Forget, Jean‐Michel Campin, Patrick Heimbach, Chris Hill, Rui M. Ponte , et al. · 2015

Computer science Mathematics

This paper presents the ECCO v4 non-linear inverse modeling framework and its baseline solution for the evolving ocean state over the period 1992–2011. Both components are publicly available and subjected to regular, automated regression t…

Meta-Learning for Semi-Supervised Few-Shot Classification Open

Mengye Ren, Eleni Triantafillou, Sachin Ravi, Jake Snell, Kevin Swersky , et al. · 2018

Computer science

In few-shot classification, we are interested in learning algorithms that train a classifier from only a handful of labeled examples. Recent progress in few-shot classification has featured meta-learning, in which a parameterized model for…

PACT: Parameterized Clipping Activation for Quantized Neural Networks Open

Jungwook Choi, Zhuo Wang, Swagath Venkataramani, Pierce Chuang, Vijayalakshmi Srinivasan , et al. · 2018

Computer science Philosophy

Deep learning algorithms achieve high classification accuracy at the expense of significant computation cost. To address this cost, a number of quantization schemes have been proposed - but most of these techniques focused on quantizing we…

A Model of Text for Experimentation in the Social Sciences Open

Margaret E. Roberts, Brandon Stewart, Edoardo M. Airoldi · 2016

Computer science Sociology Mathematics

Statistical models of text have become increasingly popular in statistics and computer science as a method of exploring large document collections. Social scientists often want to move beyond exploration, to measurement and experimentation…

Deep forest Open

Zhi‐Hua Zhou, Ji Feng · 2018

Computer science Mathematics Philosophy

Current deep-learning models are mostly built upon neural networks, i.e. multiple layers of parameterized differentiable non-linear modules that can be trained by backpropagation. In this paper, we explore the possibility of building deep …

Attention-based Deep Multiple Instance Learning Open

Maximilian Ilse, Jakub M. Tomczak, Max Welling · 2018

Computer science Geography

Multiple instance learning (MIL) is a variation of supervised learning where a single class label is assigned to a bag of instances. In this paper, we state the MIL problem as learning the Bernoulli distribution of the bag label where the …

To prune, or not to prune: exploring the efficacy of pruning for model compression Open

Michael Zhu, Suyog Gupta · 2017

Computer science Mathematics Materials science

Model pruning seeks to induce sparsity in a deep neural network's various connection matrices, thereby reducing the number of nonzero-valued parameters in the model. Recent reports (Han et al., 2015; Narang et al., 2017) prune deep network…

Measuring and Mitigating Unintended Bias in Text Classification Open

Lucas Dixon, John Li, Jeffrey Sorensen, Nithum Thain, Lucy Vasserman · 2018

Computer science Philosophy Physics

We introduce and illustrate a new approach to measuring and mitigating unintended bias in machine learning models. Our definition of unintended bias is parameterized by a test set and a subset of input features. We illustrate how this can …

THE SLX MODEL Open

Solmaria Halleck Vega, J. Paul Elhorst · 2015

Economics Computer science Chemistry

We provide a comprehensive overview of the strengths and weaknesses of different spatial econometric model specifications in terms of spillover effects. Based on this overview, we advocate taking the SLX model as point of departure in case…

DropBlock: A regularization method for convolutional networks Open

Golnaz Ghiasi, Tsung-Yi Lin, Quoc V. Le · 2018

Computer science Philosophy

Deep neural networks often work well when they are over-parameterized and trained with a massive amount of noise and regularization, such as weight decay and dropout. Although dropout is widely used as a regularization technique for fully …

Variational Dropout and the Local Reparameterization Trick Open

Diederik P. Kingma, Tim Salimans, Max Welling · 2015

Computer science Mathematics Physics

We investigate a local reparameterizaton technique for greatly reducing the variance of stochastic gradients for variational Bayesian inference (SGVB) of a posterior over model parameters, while retaining parallelizability. This local repa…

ResNeSt: Split-Attention Networks Open

Hang Zhang, Chongruo Wu, Zhongyue Zhang, Yi Zhu, Haibin Lin , et al. · 2020

Computer science Political science Mathematics

It is well known that featuremap attention and multi-path representation are important for visual recognition. In this paper, we present a modularized architecture, which applies the channel-wise attention on different network branches to …

Born Again Neural Networks Open

Tommaso Furlanello, Zachary C. Lipton, Michael Tschannen, Laurent Itti, Anima Anandkumar · 2018

Computer science Mathematics Chemistry

Knowledge Distillation (KD) consists of transferring â€œknowledgeâ€ from one machine learning model (the teacher) to another (the student). Commonly, the teacher is a high-capacity model with formidable performance, while the student is m…

Gradient Descent Provably Optimizes Over-parameterized Neural Networks Open

Simon S. Du, Xiyu Zhai, Barnabás Póczos, Aarti Singh · 2018

Computer science Mathematics Engineering

One of the mysteries in the success of neural networks is randomly initialized first order methods like gradient descent can achieve zero training loss even though the objective function is non-convex and non-smooth. This paper demystifies…

Gradient-based Hyperparameter Optimization through Reversible Learning Open

Dougal Maclaurin, David Duvenaud, Ryan P. Adams · 2015

Computer science Mathematics Economics

Tuning hyperparameters of learning algorithms is hard because gradients are usually unavailable. We compute exact gradients of cross-validation performance with respect to all hyperparameters by chaining derivatives backwards through the e…

Computational homogenization of nonlinear elastic materials using neural networks Open

Ba-Anh Le, Julien Yvonnet, Q.‐C. He · 2015

Mathematics Materials science Computer science

Summary In this work, a decoupled computational homogenization method for nonlinear elastic materials is proposed using neural networks. In this method, the effective potential is represented as a response surface parameterized by the macr…

Stronger, Faster and More Explainable: A Graph Convolutional Baseline for Skeleton-based Action Recognition Open

Yifan Song, Zhang Zhang, Caifeng Shan, Liang Wang · 2020

Computer science Geology

One essential problem in skeleton-based action recognition is how to extract\ndiscriminative features over all skeleton joints. However, the complexity of\nthe State-Of-The-Art (SOTA) models of this task tends to be exceedingly\nsophistica…

A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space Open

Jan H. Jensen · 2019

Computer science Mathematics Geography

This paper presents a comparison of a graph-based genetic algorithm (GB-GA) and machine learning (ML) results for the optimization of log P values with a constraint for synthetic accessibility and shows that the GA is as good as or better …

Martini Coarse-Grained Force Field: Extension to DNA Open

Jaakko J. Uusitalo, Helgi I. Ingólfsson, Parisa Akhshi, D. Peter Tieleman, ‪Siewert J. Marrink · 2015

Chemistry Physics Computer science

We systematically parameterized a coarse-grained (CG) model for DNA that is compatible with the Martini force field. The model maps each nucleotide into six to seven CG beads and is parameterized following the Martini philosophy. The CG no…

On Lazy Training in Differentiable Programming Open

Lénaïc Chizat, Edouard Oyallon, Francis Bach · 2018

Computer science Mathematics Physics

In a series of recent theoretical works, it was shown that strongly over-parameterized neural networks trained with gradient-based methods could converge exponentially fast to zero training loss, with their parameters hardly varying. In th…

Online Knowledge Distillation with Diverse Peers Open

Defang Chen, Jian-Ping Mei, Can Wang, Feng Yan, Chun Chen · 2020

Computer science Chemistry Philosophy

Distillation is an effective knowledge-transfer technique that uses predicted distributions of a powerful teacher model as soft targets to train a less-parameterized student model. A pre-trained high capacity teacher, however, is not alway…

CondConv: Conditionally Parameterized Convolutions for Efficient\n Inference Open

Brandon Yang, Gabriel Bender, Quoc V. Le, Jiquan Ngiam · 2019

Computer science Chemistry

Convolutional layers are one of the basic building blocks of modern deep\nneural networks. One fundamental assumption is that convolutional kernels\nshould be shared for all examples in a dataset. We propose conditionally\nparameterized co…

Federated Meta-Learning with Fast Convergence and Efficient Communication Open

Fei Chen, Mi Luo, Zhenhua Dong, Zhenguo Li, Xiuqiang He · 2018

Computer science Psychology Economics

Statistical and systematic challenges in collaboratively training machine learning models across distributed networks of mobile devices have been the bottlenecks in the real-world application of federated learning. In this work, we show th…

Born Again Neural Networks Open

Tommaso Furlanello, Zachary C. Lipton, Michael Tschannen, Laurent Itti, Anima Anandkumar · 2018

Computer science Mathematics Chemistry

Knowledge Distillation (KD) consists of transferring “knowledge” from one machine learning model (the teacher) to another (the student). Commonly, the teacher is a high-capacity model with formidable performance, while the student is more …

Very High Resolution Object-Based Land Use–Land Cover Urban Classification Using Extreme Gradient Boosting Open

Stefanos Georganos, Taïs Grippa, Sabine Vanhuysse, Moritz Lennert, Michal Shimoni , et al. · 2018

Computer science Geography Engineering

In this letter, the recently developed extreme gradient boosting (Xgboost) classifier is implemented in a very high resolution (VHR) object-based urban land use-land cover application. In detail, we investigated the sensitivity of Xgboost …

Implicit Neural Representations with Periodic Activation Functions Open

Vincent Sitzmann, Julien Martel, Alexander W. Bergman, David B. Lindell, Gordon Wetzstein · 2020

Computer science Mathematics Political science

Implicitly defined, continuous, differentiable signal representations parameterized by neural networks have emerged as a powerful paradigm, offering many possible benefits over conventional representations. However, current network archite…

PremPS: Predicting the impact of missense mutations on protein stability Open

Yuting Chen, Haoyu Lu, Ning Zhang, Zefeng Zhu, Shuqin Wang , et al. · 2020

Computer science Biology Geography

Computational methods that predict protein stability changes induced by missense mutations have made a lot of progress over the past decades. Most of the available methods however have very limited accuracy in predicting stabilizing mutati…

Assessing the reliability of species distribution projections in climate change research Open

Luca Santini, Ana Benítez‐López, Luigi Maiorano, Mirza Čengić, Mark A. J. Huijbregts · 2021

Computer science Mathematics Chemistry

Aim Forecasting changes in species distribution under future scenarios is one of the most prolific areas of application for species distribution models (SDMs). However, no consensus yet exists on the reliability of such models for drawing …