User profiles for Weizhong Li

Weizhong Li

- Verified email at mail.sysu.edu.cn - Cited by 45146

Weizhong Li

- Verified email at sdsc.edu - Cited by 32022

Fast, scalable generation of high‐quality protein multiple sequence alignments using Clustal Omega

…, D Dineen, TJ Gibson, K Karplus, W Li… - Molecular systems …, 2011 - embopress.org
Multiple sequence alignments are fundamental to many sequence analysis methods. Most
alignments are computed using the progressive alignment heuristic. These methods are …

InterProScan 5: genome-scale protein function classification

P Jones, D Binns, HY Chang, M Fraser, W Li… - …, 2014 - academic.oup.com
Motivation: Robust large-scale sequence analysis is a major challenge in modern genomic
science, where biologists are frequently trying to characterize many millions of sequences. …

Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences

W Li, A Godzik - Bioinformatics, 2006 - academic.oup.com
Motivation: In 2001 and 2002, we published two papers (Bioinformatics, 17, 282–283,
Bioinformatics, 18, 77–82) describing an ultrafast protein sequence clustering program called cd-…

CD-HIT: accelerated for clustering the next-generation sequencing data

L Fu, B Niu, Z Zhu, S Wu, W Li - Bioinformatics, 2012 - academic.oup.com
CD-HIT is a widely used program for clustering biological sequences to reduce sequence
redundancy and improve the performance of other sequence analyses. In response to the …

CD-HIT Suite: a web server for clustering and comparing biological sequences

Y Huang, B Niu, Y Gao, L Fu, W Li - Bioinformatics, 2010 - academic.oup.com
CD-HIT is a widely used program for clustering and comparing large biological sequence
datasets. In order to further assist the CD-HIT users, we significantly improved this program …

A new bioinformatics analysis tools framework at EMBL–EBI

M Goujon, H McWilliam, W Li, F Valentin… - Nucleic acids …, 2010 - academic.oup.com
The EMBL-EBI provides access to various mainstream sequence analysis applications.
These include sequence similarity search services such as BLAST, FASTA, InterProScan and …

Analysis tool web services from the EMBL-EBI

H McWilliam, W Li, M Uludag, S Squizzato… - Nucleic acids …, 2013 - academic.oup.com
Since 2004 the European Bioinformatics Institute (EMBL-EBI) has provided access to a wide
range of databases and analysis tools via Web Services interfaces. This comprises services …

The EMBL-EBI bioinformatics web and programmatic tools framework

W Li, A Cowley, M Uludag, T Gur… - Nucleic acids …, 2015 - academic.oup.com
Since 2009 the EMBL-EBI Job Dispatcher framework has provided free access to a range of
mainstream sequence analysis applications. These include sequence similarity search …

Clustering of highly homologous sequences to reduce the size of large protein databases

W Li, L Jaroszewski, A Godzik - Bioinformatics, 2001 - academic.oup.com
We present a fast and flexible program for clustering large protein databases at different
sequence identity levels. It takes less than 2 h for the all-against-all sequence comparison and …

[HTML][HTML] The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein Families

…, W Li, L Jaroszewski, P Cieplak, CS Miller, H Li… - PLoS …, 2007 - journals.plos.org
Metagenomics projects based on shotgun sequencing of populations of micro-organisms
yield insight into protein families. We used sequence similarity clustering to explore proteins …