Parameterized complexity

Deeper, Broader and Artier Domain Generalization Open

Da Li, Yongxin Yang, Yi-Zhe Song, Timothy M. Hospedales · 2017

The problem of domain generalization is to learn from multiple training domains, and extract a domain-agnostic model that can then be applied to an unseen domain. Domain generalization (DG) has a clear motivation in contexts where there ar…

Meta-Learning for Semi-Supervised Few-Shot Classification Open

Mengye Ren, Eleni Triantafillou, Sachin Ravi, Jake Snell, Kevin Swersky , et al. · 2018

Computer science

In few-shot classification, we are interested in learning algorithms that train a classifier from only a handful of labeled examples. Recent progress in few-shot classification has featured meta-learning, in which a parameterized model for…

PACT: Parameterized Clipping Activation for Quantized Neural Networks Open

Jungwook Choi, Zhuo Wang, Swagath Venkataramani, Pierce Chuang, Vijayalakshmi Srinivasan , et al. · 2018

Computer science Philosophy

Deep learning algorithms achieve high classification accuracy at the expense of significant computation cost. To address this cost, a number of quantization schemes have been proposed - but most of these techniques focused on quantizing we…

Modeling the dynamics of novel coronavirus (2019-nCov) with fractional derivative Open

Muhammad Altaf Khan, Abdon Atangana · 2020

Computer science Biology Mathematics

The present paper describes the mathematical modeling and dynamics of a novel corona virus (2019-nCoV). We describe the brief details of interaction among the bats and unknown hosts, then among the peoples and the infections reservoir (sea…

A Model of Text for Experimentation in the Social Sciences Open

Margaret E. Roberts, Brandon Stewart, Edoardo M. Airoldi · 2016

Computer science Sociology Mathematics

Statistical models of text have become increasingly popular in statistics and computer science as a method of exploring large document collections. Social scientists often want to move beyond exploration, to measurement and experimentation…

Deep forest Open

Zhi‐Hua Zhou, Ji Feng · 2018

Computer science Mathematics Philosophy

Current deep-learning models are mostly built upon neural networks, i.e. multiple layers of parameterized differentiable non-linear modules that can be trained by backpropagation. In this paper, we explore the possibility of building deep …

Attention-based Deep Multiple Instance Learning Open

Maximilian Ilse, Jakub M. Tomczak, Max Welling · 2018

Computer science Geography

Multiple instance learning (MIL) is a variation of supervised learning where a single class label is assigned to a bag of instances. In this paper, we state the MIL problem as learning the Bernoulli distribution of the bag label where the …

To prune, or not to prune: exploring the efficacy of pruning for model compression Open

Michael Zhu, Suyog Gupta · 2017

Computer science Mathematics Materials science

Model pruning seeks to induce sparsity in a deep neural network's various connection matrices, thereby reducing the number of nonzero-valued parameters in the model. Recent reports (Han et al., 2015; Narang et al., 2017) prune deep network…

Measuring and Mitigating Unintended Bias in Text Classification Open

Lucas Dixon, John Li, Jeffrey Sorensen, Nithum Thain, Lucy Vasserman · 2018

Computer science Philosophy Physics

We introduce and illustrate a new approach to measuring and mitigating unintended bias in machine learning models. Our definition of unintended bias is parameterized by a test set and a subset of input features. We illustrate how this can …

DropBlock: A regularization method for convolutional networks Open

Golnaz Ghiasi, Tsung-Yi Lin, Quoc V. Le · 2018

Computer science Philosophy

Deep neural networks often work well when they are over-parameterized and trained with a massive amount of noise and regularization, such as weight decay and dropout. Although dropout is widely used as a regularization technique for fully …

ResNeSt: Split-Attention Networks Open

Hang Zhang, Chongruo Wu, Zhongyue Zhang, Yi Zhu, Haibin Lin , et al. · 2020

Computer science Political science Mathematics

It is well known that featuremap attention and multi-path representation are important for visual recognition. In this paper, we present a modularized architecture, which applies the channel-wise attention on different network branches to …

Born Again Neural Networks Open

Tommaso Furlanello, Zachary C. Lipton, Michael Tschannen, Laurent Itti, Anima Anandkumar · 2018

Computer science Mathematics Chemistry

Knowledge Distillation (KD) consists of transferring â€œknowledgeâ€ from one machine learning model (the teacher) to another (the student). Commonly, the teacher is a high-capacity model with formidable performance, while the student is m…

Gradient Descent Provably Optimizes Over-parameterized Neural Networks Open

Simon S. Du, Xiyu Zhai, Barnabás Póczos, Aarti Singh · 2018

Computer science Mathematics Engineering

One of the mysteries in the success of neural networks is randomly initialized first order methods like gradient descent can achieve zero training loss even though the objective function is non-convex and non-smooth. This paper demystifies…

Stronger, Faster and More Explainable: A Graph Convolutional Baseline for Skeleton-based Action Recognition Open

Yifan Song, Zhang Zhang, Caifeng Shan, Liang Wang · 2020

Computer science Geology

One essential problem in skeleton-based action recognition is how to extract\ndiscriminative features over all skeleton joints. However, the complexity of\nthe State-Of-The-Art (SOTA) models of this task tends to be exceedingly\nsophistica…

A graph-based genetic algorithm and generative model/Monte Carlo tree search for the exploration of chemical space Open

Jan H. Jensen · 2019

Computer science Mathematics Geography

This paper presents a comparison of a graph-based genetic algorithm (GB-GA) and machine learning (ML) results for the optimization of log P values with a constraint for synthetic accessibility and shows that the GA is as good as or better …

On Lazy Training in Differentiable Programming Open

Lénaïc Chizat, Edouard Oyallon, Francis Bach · 2018

Computer science Mathematics Physics

In a series of recent theoretical works, it was shown that strongly over-parameterized neural networks trained with gradient-based methods could converge exponentially fast to zero training loss, with their parameters hardly varying. In th…

Online Knowledge Distillation with Diverse Peers Open

Defang Chen, Jian-Ping Mei, Can Wang, Feng Yan, Chun Chen · 2020

Computer science Chemistry Philosophy

Distillation is an effective knowledge-transfer technique that uses predicted distributions of a powerful teacher model as soft targets to train a less-parameterized student model. A pre-trained high capacity teacher, however, is not alway…

CondConv: Conditionally Parameterized Convolutions for Efficient\n Inference Open

Brandon Yang, Gabriel Bender, Quoc V. Le, Jiquan Ngiam · 2019

Computer science Chemistry

Convolutional layers are one of the basic building blocks of modern deep\nneural networks. One fundamental assumption is that convolutional kernels\nshould be shared for all examples in a dataset. We propose conditionally\nparameterized co…

Federated Meta-Learning with Fast Convergence and Efficient Communication Open

Fei Chen, Mi Luo, Zhenhua Dong, Zhenguo Li, Xiuqiang He · 2018

Computer science Psychology Economics

Statistical and systematic challenges in collaboratively training machine learning models across distributed networks of mobile devices have been the bottlenecks in the real-world application of federated learning. In this work, we show th…

Born Again Neural Networks Open

Tommaso Furlanello, Zachary C. Lipton, Michael Tschannen, Laurent Itti, Anima Anandkumar · 2018

Computer science Mathematics Chemistry

Knowledge Distillation (KD) consists of transferring “knowledge” from one machine learning model (the teacher) to another (the student). Commonly, the teacher is a high-capacity model with formidable performance, while the student is more …

Very High Resolution Object-Based Land Use–Land Cover Urban Classification Using Extreme Gradient Boosting Open

Stefanos Georganos, Taïs Grippa, Sabine Vanhuysse, Moritz Lennert, Michal Shimoni , et al. · 2018

Computer science Geography Engineering

In this letter, the recently developed extreme gradient boosting (Xgboost) classifier is implemented in a very high resolution (VHR) object-based urban land use-land cover application. In detail, we investigated the sensitivity of Xgboost …

Implicit Neural Representations with Periodic Activation Functions Open

Vincent Sitzmann, Julien Martel, Alexander W. Bergman, David B. Lindell, Gordon Wetzstein · 2020

Computer science Mathematics Political science

Implicitly defined, continuous, differentiable signal representations parameterized by neural networks have emerged as a powerful paradigm, offering many possible benefits over conventional representations. However, current network archite…

Assessing the reliability of species distribution projections in climate change research Open

Luca Santini, Ana Benítez‐López, Luigi Maiorano, Mirza Čengić, Mark A. J. Huijbregts · 2021

Computer science Mathematics Chemistry

Aim Forecasting changes in species distribution under future scenarios is one of the most prolific areas of application for species distribution models (SDMs). However, no consensus yet exists on the reliability of such models for drawing …

PremPS: Predicting the impact of missense mutations on protein stability Open

Yuting Chen, Haoyu Lu, Ning Zhang, Zefeng Zhu, Shuqin Wang , et al. · 2020

Computer science Biology Geography

Computational methods that predict protein stability changes induced by missense mutations have made a lot of progress over the past decades. Most of the available methods however have very limited accuracy in predicting stabilizing mutati…

CondConv: Conditionally Parameterized Convolutions for Efficient Inference Open

Brandon Yang, Gabriel Bender, Quoc V. Le, Jiquan Ngiam · 2019

Computer science Chemistry

Convolutional layers are one of the basic building blocks of modern deep neural networks. One fundamental assumption is that convolutional kernels should be shared for all examples in a dataset. We propose conditionally parameterized convo…

Global Convergence of Policy Gradient Methods for the Linear Quadratic Regulator Open

Maryam Fazel, Rong Ge, Sham M. Kakade, Mehran Mesbahi · 2018

Computer science Mathematics Economics

Direct policy gradient methods for reinforcement learning and continuous control problems are a popular approach for a variety of reasons: 1) they are easy to implement without explicit knowledge of the underlying model 2) they are an "end…

DAGs with NO TEARS: Continuous Optimization for Structure Learning Open

Xun Zheng, Bryon Aragam, Pradeep Ravikumar, Eric P. Xing · 2018

Computer science Mathematics

Estimating the structure of directed acyclic graphs (DAGs, also known as Bayesian networks) is a challenging problem since the search space of DAGs is combinatorial and scales superexponentially with the number of nodes. Existing approache…

Separating Neural Oscillations from Aperiodic 1/f Activity: Challenges and Recommendations Open

Moritz Gerster, Gunnar Waterstraat, Vladimir Litvak, Klaus Lehnertz, Alfons Schnitzler , et al. · 2022

Computer science Mathematics Physics

Electrophysiological power spectra typically consist of two components: An aperiodic part usually following an 1/f power law $$P\propto 1/{f}^{\beta }$$ and periodic components appearing as spectral peaks. While the investigation of the pe…

Stochastic Gradient Descent Optimizes Over-parameterized Deep ReLU Networks Open

Difan Zou, Yuan Cao, Dongruo Zhou, Quanquan Gu · 2018

Computer science Mathematics Physics

We study the problem of training deep neural networks with Rectified Linear Unit (ReLU) activation function using gradient descent and stochastic gradient descent. In particular, we study the binary classification problem and show that for…

Differentiable Programming Tensor Networks Open

Haijun Liao, Jin-Guo Liu, Lei Wang, Tao Xiang · 2019

Computer science Mathematics

Differentiable programming is a fresh programming paradigm which composes\nparameterized algorithmic components and trains them using automatic\ndifferentiation (AD). The concept emerges from deep learning but is not only\nlimited to train…