Zeming Lin YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

Simulating 500 million years of evolution with a language model Open

Thomas Hayes, Roshan Rao, Halil Akin, Nicholas J. Sofroniew, Deniz Oktay , et al. · 2024

Computer science Biology Geography

More than three billion years of evolution have produced an image of biology encoded into the space of natural proteins. Here we show that language models trained on tokens generated by evolution can act as evolutionary simulators to gener…

Systematic identification and validation of the reference genes from 447 transcriptome datasets of moso bamboo (Phyllostachys edulis) Open

Yan Liu, Chenglei Zhu, Zeming Lin, Hui Li, Xiaolin Di , et al. · 2024

Biology

Bamboo was one of the first plants to be cultivated in China and is widely used in industry and daily life. The study of gene function has become an important part of bamboo breeding, whereas quantitative real-time PCR (qRT-PCR) is a power…

Evolutionary relationship of moso bamboo forms and a multihormone regulatory cascade involving culm shape variation Open

Yan Liu, Chenglei Zhu, Xianghua Yue, Zeming Lin, Hui Li , et al. · 2024

Biology

Summary Moso bamboo ( Phyllostachys edulis ) known as Mao Zhu (MZ) in Chinese exhibits various forms with distinct morphological characteristics. However, the evolutionary relationship among MZ forms and the mechanisms of culm shape variat…

A bamboo ‘<span>PeSAPK4‐PeMYB99‐</span><i>PeTIP4‐3</i>’ regulatory model involved in water transport Open

Chenglei Zhu, Zeming Lin, Kebin Yang, Yongfeng Lou, Yan Liu , et al. · 2024

Biology Chemistry Engineering

Summary Water plays crucial roles in expeditious growth and osmotic stress of bamboo. Nevertheless, the molecular mechanism of water transport remains unclear. In this study, an aquaporin gene, PeTIP4‐3 , was identified through a joint ana…

Evolutionary-scale prediction of atomic-level protein structure with a language model Open

Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu , et al. · 2023

Computer science Biology Geography

Recent advances in machine learning have leveraged evolutionary information in multiple sequence alignments to predict protein structure. We demonstrate direct inference of full atomic-level protein structure from primary sequence using a …

A high-level programming language for generative protein design Open

Brian Hie, Salvatore Candido, Zeming Lin, Ori Kabeli, Roshan Rao , et al. · 2022

Computer science Engineering Psychology

Combining a basic set of building blocks into more complex forms is a universal design principle. Most protein designs have proceeded from a manual bottom-up approach using parts created by nature, but top-down design of proteins is fundam…

ESM Atlas v0 random sample of high confidence predicted protein structures Open

Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu , et al. · 2022

Computer science Mathematics Geology

A random sample out of the 225M high confidence predictions in the ESM Atlas v0 dataset introduced in "Evolutionary-scale prediction of atomic level protein structure with a language model.". All predictions can be accessed in the ESM Meta…

ESM Atlas v0 representative random sample of predicted protein structures Open

Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu , et al. · 2022

Geography Geology Physics

A representative random sample of the ESM Atlas v0 dataset introduced in "Evolutionary-scale prediction of atomic level protein structure with a language model.". All predictions can be accessed in the ESM Metagenomic Atlas (https://esmatl…

ESM Atlas v0 random sample of high confidence predicted protein structures Open

Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu , et al. · 2022

Mathematics Geology Chemistry

A random sample out of the 225M high confidence predictions in the ESM Atlas v0 dataset introduced in "Evolutionary-scale prediction of atomic level protein structure with a language model.". All predictions can be accessed in the ESM Meta…

ESM Atlas v0 representative random sample of predicted protein structures Open

Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu , et al. · 2022

Physics Geology

A representative random sample of the ESM Atlas v0 dataset introduced in "Evolutionary-scale prediction of atomic level protein structure with a language model.". All predictions can be accessed in the ESM Metagenomic Atlas (https://esmatl…

Evolutionary-scale prediction of atomic level protein structure with a language model Open

Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu , et al. · 2022

Computer science Biology Geography

Artificial intelligence has the potential to open insight into the structure of proteins at the scale of evolution. It has only recently been possible to extend protein structure prediction to two hundred million cataloged proteins. Charac…

Learning inverse folding from millions of predicted structures Open

Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie , et al. · 2022

Computer science Mathematics Chemistry

We consider the problem of predicting a protein sequence from its backbone atom coordinates. Machine learning approaches to this problem to date have been limited by the number of available experimentally determined protein structures. We …

STARDATA: A StarCraft AI Research Dataset Open

Zeming Lin, Jonas Gehring, Vasil Khalidov, Gabriel Synnaeve · 2021

Computer science Psychology

We release a dataset of 65646 StarCraft replays that contains 1535 million frames and 496 million player actions. We provide full game state data along with the original replays that can be viewed in StarCraft. The game state data was reco…

Neural Potts Model Open

Tom Sercu, Robert Verkuil, Joshua Meier, Brandon Amos, Zeming Lin , et al. · 2021

Computer science Mathematics Biology

A bstract We propose the Neural Potts Model objective as an amortized optimization problem. The objective enables training a single model with shared parameters to explicitly model energy landscapes across multiple protein families. Given …

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences Open

Alexander Rives, Joshua Meier, Tom Sercu, Siddharth Goyal, Zeming Lin , et al. · 2021

Computer science Biology

Significance Learning biological properties from sequence data is a logical step toward generative and predictive artificial intelligence for biology. Here, we propose scaling a deep contextual language model with unsupervised learning to …

Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching Open

Kalika Bali, Pushpak Bhattacharyya, Marina Fomicheva, Philipp Koehn, Holger Schwenk , et al. · 2021

Computer science Philosophy

Bienvenidos to the proceedings of the fifth edition of the workshop on computational approaches for linguistic code-switching (CALCS-2021)!Code-switching is this very interesting phenomenon where multilingual speakers communicate by moving…

PyTorch: An Imperative Style, High-Performance Deep Learning Library Open

Adam Paszke, Sam Gross, Francisco Massa, Adam Lerer, James Bradbury , et al. · 2019

Computer science Art

Deep learning frameworks have often focused on either usability or speed, but not both. PyTorch is a machine learning library that shows that these two goals are in fact compatible: it provides an imperative and Pythonic programming style …

Growing Action Spaces Open

Gregory Farquhar, Laura Gustafson, Zeming Lin, Shimon Whiteson, Nicolas Usunier , et al. · 2019

Computer science Mathematics Sociology

In complex tasks, such as those with large combinatorial action spaces, random exploration may be too inefficient to achieve meaningful learning progress. In this work, we use a curriculum of progressively growing action spaces to accelera…

Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences Open

Alexander Rives, Joshua Meier, Tom Sercu, Siddharth Goyal, Zeming Lin , et al. · 2019

Computer science Biology Mathematics

In the field of artificial intelligence, a combination of scale in data and model capacity enabled by un-supervised learning has led to major advances in representation learning and statistical generation. In the life sciences, the anticip…

Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger Open

Gabriel Synnaeve, Zeming Lin, Jonas Gehring, Daniel B. Gant, Vegard Mella , et al. · 2018

Computer science Economics Biology

We formulate the problem of defogging as state estimation and future state prediction from previous, partial observations in the context of real-time strategy games. We propose to employ encoder-decoder neural networks for this task, and i…

Value Propagation Networks Open

Nantas Nardelli, Gabriel Synnaeve, Zeming Lin, Pushmeet Kohli, Philip H. S. Torr , et al. · 2018

Computer science Mathematics History

We present Value Propagation (VProp), a set of parameter-efficient differentiable planning modules built on Value Iteration which can successfully be trained using reinforcement learning to solve unseen tasks, has the capability to general…

Value Propagation Networks. Open

Nantas Nardelli, Gabriel Synnaeve, Zeming Lin, Pushmeet Kohli, Philip H. S. Torr , et al. · 2018

Computer science Mathematics History

We present Value Propagation (VProp), a parameter-efficient differentiable planning module built on Value Iteration which can successfully be trained in a reinforcement learning fashion to solve unseen tasks, has the capability to generali…

An Analysis of Model-Based Heuristic Search Techniques for StarCraft Combat Scenarios Open

David G. Churchill, Zeming Lin, Gabriel Synnaeve · 2017

Computer science Geography

Real-Time Strategy games have become a popular test-bed for modern AI system due to their real-time computational constraints, complex multi-unit control problems, and imperfect information. One of the most important aspects of any RTS AI …

STARDATA: A StarCraft AI Research Dataset Open

Zeming Lin, Jonas Gehring, Vasil Khalidov, Gabriel Synnaeve · 2017

Computer science

We release a dataset of 65646 StarCraft replays that contains 1535 million frames and 496 million player actions. We provide full game state data along with the original replays that can be viewed in StarCraft. The game state data was reco…

Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play Open

Sainbayar Sukhbaatar, Zeming Lin, Ilya Kostrikov, Gabriel Synnaeve, Arthur Szlam , et al. · 2017

Psychology Computer science

We describe a simple scheme that allows an agent to learn about its environment in an unsupervised manner. Our scheme pits two versions of the same agent, Alice and Bob, against one another. Alice proposes a task for Bob to complete; and t…

DeepCloak: Masking Deep Neural Network Models for Robustness Against Adversarial Samples Open

Ji Gao, Beilun Wang, Zeming Lin, Weilin Xu, Yanjun Qi · 2017

Computer science Art Biology

Recent studies have shown that deep neural networks (DNN) are vulnerable to adversarial samples: maliciously-perturbed samples crafted to yield incorrect model outputs. Such attacks can severely undermine DNN systems, particularly in secur…

TorchCraft: a Library for Machine Learning Research on Real-Time Strategy Games Open

Gabriel Synnaeve, Nantas Nardelli, Alex Auvolat, Soumith Chintala, Timothée Lacroix , et al. · 2016

Computer science Geography

We present TorchCraft, a library that enables deep learning research on Real-Time Strategy (RTS) games such as StarCraft: Brood War, by making it easier to control these games from a machine learning framework, here Torch. This white paper…

Episodic Exploration for Deep Deterministic Policies: An Application to StarCraft Micromanagement Tasks Open

Nicolas Usunier, Gabriel Synnaeve, Zeming Lin, Soumith Chintala · 2016

Computer science Mathematics Political science

We consider scenarios from the real-time strategy game StarCraft as new benchmarks for reinforcement learning algorithms. We propose micromanagement tasks, which present the problem of the short-term, low-level control of army members duri…

MUST-CNN: A Multilayer Shift-and-Stitch Deep Convolutional Architecture for Sequence-based Protein Structure Prediction Open

Zeming Lin, Jack Lanchantin, Yanjun Qi · 2016

Computer science Mathematics Art

Predicting protein properties such as solvent accessibility and secondary structure from its primary amino acid sequence is an important task in bioinformatics. Recently, a few deep learning models have surpassed the traditional window bas…

Deep Motif: Visualizing Genomic Sequence Classifications Open

Jack Lanchantin, Ritambhara Singh, Zeming Lin, Yanjun Qi · 2016

Computer science Biology Physics

This paper applies a deep convolutional/highway MLP framework to classify genomic sequences on the transcription factor binding site task. To make the model understandable, we propose an optimization driven strategy to extract "motifs", or…

Creating related items for first view…

Fetching topic information...