|
|
|
|
| | | |
Release Announcements
VectorBase is pleased to announce the VB-2012-04 release. Highlights of this release include: - Entrez-style search box. Our site search now automatically queries multiple domains and allows one click access to search results across VectorBase data.
- New gene sets. Aedes, Culex and Ixodes gene sets have all been updated with community-supplied gene improvements.
- Updated genome browser. VectorBase now runs e! version 66.
More detailed release notes are available on the Wiki.
- The guide provides the list of resistant, susceptible and humanized mouse models used in the infection studies.
- Pathogens (bacteria, eukaryotic pathogens and viruses) are linked out
directly to their corresponding BRC websites (PATRIC, EuPathDB, ViPR and IRD).
- Mouse strains selected for the pathogen infection studies are linked out to
the JAX mice database for detailed mouse strain information and purchasing information (if any).
- The PubMed and Mouse Genome Informatics (MGI) references are
provided.
The next IRD release will be in May 2012.
The next ViPR release will be in May 2012.
Click to view genomes.
show more...
The Pathogen Portal team at the Virginia Bioinformatics Institute (VBI)
is pleased to announce a new version of the RNA-Seq pipeline.
This release has an enhanced user interface, as well as new data and features, including:
- Improved pipeline navigation and organization
- Mammalian host genomes
- Customized help and informational documentation
- Advanced options for pipeline tools.
Built on Galaxy, the VBI Portal team has modified selected features of
Galaxy to simplify some of the most typical user workflows for mapping
RNA-Seq reads to reference genomes from EuPathDB, PATRIC, and VectorBase
and for estimating gene expression values.
VectorBase is pleased to announce the VB-2012-02 release.
Highlights of this release include:
- Improved search functionality. You can now search the expression data, population data and ontologies from the same search box.
- Simplified, enhanced new graphical interface to our expression maps.
- You can associate publications to genes from within the genome browser via the PubMed link in the left hand menu of a gene page
More details are available in the Release notes.
Click for a summary of improvements and genomes available on PATRIC.
show more...
Improved Phylogenetic Tree Viewer:
Recent improvements to the phylogenetic tree viewer allow users to select their genomes of interest from the phylogenetic tree and save them as a genome group to their workspace.
Other Improvements and Bug Fixes:
- This release also includes various minor improvements and bug fixes in some of the existing data and functionalities at PATRIC.
- All of the tables on the PATRIC website now include mouse over feature which allows user to see full content of any cell in the table when the cell content is very long and is displayed only partially.
New genomes and annotations
Since November 2011, 105 new genomes have been added to PATRIC and 104 of them have been annotated using RAST.
A summary of the genomes available on the PATRIC website through February, 2012 is provided in the table below:
|
PATRIC
|
Legacy BRC
|
RefSeq
|
| Number of genomes |
3911
|
337
|
4014
|
| Number of Complete genomes |
1723
|
237
|
1711
|
| Number of WGS genomes |
2188
|
96
|
1906
|
| Number of Plasmid only genomes |
0
|
4
|
397
|
The Pathogen Portal team is pleased to announce the public availability of the RNA-Seq pipeline to map RNA-Seq reads to
reference genomes from EuPathDB, PATRIC, and VectorBase and to estimate gene expression values. The Pipeline is built on
Galaxy, with modifications to help simplify the process for routine use. The RNA-Seq Pipeline is available at
rnaseq.pathogenportal.org.
We are pleased to
announce the release
of TriTrypDB 4.0
New Data:
- To search this data, go to the "Identify
Genes based on Mass Spec. Evidence" and select "Procyclic form
palmitoylated proteins (Emmer et al.)" under Trypanosoma brucei. This
search will return all genes that have mass spec palmitoylation
evidence.
- Mass spec data can also be visualized in the protein section of
gene pages (for example, scroll down to the "Protein" section of this gene page) and in the genome browser. Mousing
over peptide graphics displays additional peptide and modification site
information.
- Single nucleotide
polymorphism (SNP) calls based on high throughput sequencing data of T. brucei 927
and 427 provided by George Cross. Genes with SNPs can be
identified based on their type (synonymous, nonsynonymous, etc.), their
frequency and/or their depth of coverage.
- SNP data is represented as a table on gene pages. For example,
if you visit the gene page for Tb927.1.1740,
you can view a table under the heading "SNPs summary" which
includes information about the SNPs found in this gene based on this
comparison.
- An additional view of SNPs is available in the genome
browser where
SNPs can be visualized based on their coding potential (represented as
colored diamonds). Mousing over the diamonds reveals a popup with
additional information about the SNP at that specific location.
Also, the mouse over includes a link to the SNP record page with additional
information.
- The actual sequence reads in a particular region can be visualized in gbrowse by turning on
the "Sequence Reads" track under the heading "High Throughput
Sequencing (HTS) SNPs".
- Transcriptome-wide mRNA degradation data for T. brucei (Manful et. al.).
This data is represented on gene pages in graphical (fold change and
percentile) and tabular (raw data) formats. To see this data
visit any T. brucei gene page
(for example Tb927.8.620)
and scroll down to the section called "T.
brucei RNA Sequence of transcriptome-wide mRNA degradation"
under "Expression".
- Leishmania tarentolae Parrot-TarII
genome
sequence
and annotation (Raymond et. al.).
Chromosome sequence and annotations for Leishmania tarentolae are
provided by the CIHR Group on host pathogen interactions (Marc
Ouellette, Jacques Corbeil, Barbara Papadopoulou, Michel J. Tremblay,
Fr�d�ric Raymond, S�bastien Boisvert from Universit� Laval, and Martin
Olivier from McGill University).
- Genes may be queried based on gene IDs, text terms, EC numbers,
GO terms, sequence features (length, introns, signal sequences, Pfam
domains), etc.
- Genes and sequences may also be interrogated based on BLAST
similarity, user-defined motifs, etc.
- Gene pages and synteny data are also available.
- Draft assemblies of the genome sequences from the Trypanosomatid
sequencing white paper. This data is graciously provided
prepublication and full credit and data usage information is available
by clicking on the sequence name:
- New
Gene IDs -- T. cruzi
IDs have been truncated by GeneDB to make them more user
friendly. Old IDs have been mapped to the new ones (ie. you can
search with either old or new IDs). Currently the old ID is
displayed with the new gene ID on gene pages (old ID/new ID). In
a future release the only the new IDs will be displayed at the top of
the page.
New functionality:
- The genome browser has been updated. This newer version
includes several improvements and functionalities. A tutorial highlighting some of
the new features is also available.
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
- Several new tutorials have been generated:
Note:
Our bimonthly database releases incorporate new data and correct old
data when necessary. Changes in annotation and new experimental
data
may slightly alter your search results by increasing or decreasing the
number of hits. When search parameters change with a new release, we
invalidate (�) the search and ask you
to rerun it. When IDs are updated or removed, we map the old IDs to the
new ones, remove the old IDs from your Basket, and leave your Favorites
page alone.
EuPathDB
is now on FaceBook and Twitter
11 January 2012
We are pleased to announce the release
of ToxoDB 7.2
This release of ToxoDB contains the
following new datasets:
- Strand-specific RNA-sequence data from T. gondii ME49 oocyst stages
provided prepublication by the John Boothroyd and Brian Gregory
groups. These data are available as genome browser tracks.
- Note: RNA-sequence data is also used to
visualize intron/exon boundaries (based on intron-spanning
reads). Data specific or unified splice junction tracks
can be visualized in the genome browser.
- Microarray data from wild type and several bradyzoite
differentiation mutants of T. gondii
type I RH strain (Lescault et. al., 2010, PLoS One).
Microarray data can be searched in the transcript
expression
section of ToxoDB. Searching this data allows you to
identify differences between wild type and mutant parasites as well as
genes that are differentially regulated during a bradyzoite
differentiation time
course. Here is
a
search strategy that
identifies genes that are upregulated during the differentiation
series. In addition, microarray data is displayed on gene pages
in the expression section where data can be viewed as fold change
differences, percentile or in tabular format.
New functionality:
- The genome browser has been updated. This newer version
includes several improvements and functionalities. A tutorial highlighting some of
the new features is also available.
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
- Several new tutorials have been generated:
Note:
Our bimonthly database releases incorporate new data and correct old
data when necessary. Changes in annotation and new experimental
data
may slightly alter your search results by increasing or decreasing the
number of hits. When search parameters change with a new release, we
invalidate (�) the search and ask you
to rerun it. When IDs are updated or removed, we map the old IDs to the
new ones, remove the old IDs from your Basket, and leave your Favorites
page alone.
EuPathDB
is now on FaceBook and Twitter
11 January 2012
We are pleased to
announce new features in MicrosporidiaDB
New
functionality:
- The genome browser has been updated. This newer version
includes several improvements and functionalities. A tutorial highlighting some of
the new features is also available.
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
- Several new tutorials have been generated:
Note:
Our bimonthly database releases incorporate new data and correct old
data when necessary. Changes in annotation and new experimental
data
may slightly alter your search results by increasing or decreasing the
number of hits. When search parameters change with a new release, we
invalidate (�) the search and ask you
to rerun it. When IDs are updated or removed, we map the old IDs to the
new ones, remove the old IDs from your Basket, and leave your Favorites
page alone.
EuPathDB
is now on FaceBook and Twitter
11 January 2012
We are pleased to announce the release
of AmoebaDB 1.7
New Data:
- E. histolytica microarray data
graciously provided prepublication by Rivka Bracha, Carol A.
Gilchrist and David Mirelman. This data is based on E. histolytica G3 vs HM1:IMSS
Transcript Expression and can be searched in the microarray
section of AmoebaDB under the heading "G3 v. HM1". Genes can be
identified based on fold change or percentile searches. In
addition, microarray data is displayed graphically in the expression
section on gene pages.
- E. histolytica
proteomics data from bead-purified phagosomes (Marion et. al.). To search this
data click on the
protein expression link under the "Identify Genes by:" heading, then
click on "Mass
Spec. Evidence". On this page you can expand the
"Entamoeba histolytica" item
and select "Human Serum Coated Magnetic Bead-Purified Phagosomes from
Trophozoite (Marion et. al.)". This
search strategy identifies all genes with evidence of protein
expression based on this data.
- E. histolytica
proteomics data from Purified Trophozoite Uropods (Markiewicz et. al.).
To search this
data click on the
protein expression link under the "Identify Genes by:" heading, then
click on "Mass
Spec. Evidence". On this page you can expand the
"Entamoeba histolytica" item
and select "Purified Uropods from Trophozoite (Markiewicz et.
al.)". This
search strategy identifies all genes with evidence of protein
expression based on this data that also have a predicted signal
peptide.
New
functionality:
- The genome browser has been updated. This newer version
includes several improvements and functionalities. A tutorial highlighting some of
the new features is also available.
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
- Several new tutorials have been generated:
Note:
Our bimonthly database releases incorporate new data and correct old
data when necessary. Changes in annotation and new experimental
data
may slightly alter your search results by increasing or decreasing the
number of hits. When search parameters change with a new release, we
invalidate (�) the search and ask you
to rerun it. When IDs are updated or removed, we map the old IDs to the
new ones, remove the old IDs from your Basket, and leave your Favorites
page alone.
EuPathDB
is now on FaceBook and Twitter
11 January 2012
We are pleased to
announce new features in TrichDB
New
functionality:
- The genome browser has been updated. This newer version
includes several improvements and functionalities. A tutorial highlighting some of
the new features is also available.
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
- Several new tutorials have been generated:
Note:
Our bimonthly database releases incorporate new data and correct old
data when necessary. Changes in annotation and new experimental
data
may slightly alter your search results by increasing or decreasing the
number of hits. When search parameters change with a new release, we
invalidate (�) the search and ask you
to rerun it. When IDs are updated or removed, we map the old IDs to the
new ones, remove the old IDs from your Basket, and leave your Favorites
page alone.
EuPathDB
is now on FaceBook and Twitter
11 January 2012
We are pleased to
announce new features in PiroplasmaDB
New
functionality:
- The genome browser has been updated. This newer version
includes several improvements and functionalities. A tutorial highlighting some of
the new features is also available.
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
- Several new tutorials have been generated:
Note:
Our bimonthly database releases incorporate new data and correct old
data when necessary. Changes in annotation and new experimental
data
may slightly alter your search results by increasing or decreasing the
number of hits. When search parameters change with a new release, we
invalidate (�) the search and ask you
to rerun it. When IDs are updated or removed, we map the old IDs to the
new ones, remove the old IDs from your Basket, and leave your Favorites
page alone.
EuPathDB
is now on FaceBook
and Twitter
11 January 2012
We are pleased to
announce the release of PlasmoDB 8.2
New data in release 8.1 and 8.2:
- P. vivax Schizont stage
proteome from human blood samples (Roobsoong et. al.).
This data can be searched by choosing "Schizont stage proteome from
human blood sample (Roobsoong et al.)" in the P. vivax section of the Mass Spec. Evidence search page.
In
addition, peptide evidence is displayed on gene pages in the Protein
Section
and in the genome browser by turning on
the track specific for this data. This
search strategy identifies all P.
vivax genes with mass spec
evidence from this study.
- P. falciparum nuclear
and cytosolic fractions from rings, trophozoites, and schizonts
graciously provided prepublication by Till Voss and colleagues.
This
data can be searched by choosing "P.
falciparum
cytoplasmic and nuclear fractions from rings, trophozoites and
schizonts (3D7) (Oehring and Woodcroft et al. - unpublished)" in the P. falciparum section of the Mass spec. evidence search page.
In
addition, peptide evidence is displayed on gene pages in the Protein
Section
and in the genome browser by turning on
the track specific for this data. This
search strategy identifies all P.
falciparum genes with mass spec
evidence from this study.
- P. falciparum
ChIP-sequence data from the histone variant H2Bv (three time points)
and CenH3 (schizonts) represented as coverage plots in the genome browser.
This data was graciously provided prepublication by the Stunnenberg
group.
- Genome-wide expression level polymorphisms (ELPs) from a genetic
cross between phenotypically distinct parasite clones of P. falciparum (HB3 and Dd2) (Gonzales et. al.).
This data may be searched and visualized in many ways:
- Genes may be identified based on their association to genomic
segments, expression profile similarity or similarity of genetic
association. These searches are based on the microarray
expression profiles generated in each of the progeny and can be
accessed in the microarray search page under the
heading "eQTL studies on Hb3 X Dd2 progeny (Ferdig)".
- Regions/spans that are associated by eQTL are displayed in a
table on gene pages (example gene page). The table
"Regions/Spans associated by eQTL experiment on HB3 x DD2 progeny"
includes a column for haplotype blocks associated the gene, the
coordinates of the haplotype block, LOD scores, and links to searches
for genes contained within
the region or associated with the region.
- New
Gene IDs -- Please read the news
item from GeneDB regarding assignment of new P. falciparum gene IDs. We will be
updating to these new IDs in the next release. Rest assured that both
PlasmoDB and GeneDB will proper mapping of old IDs to the new ones (in
other words, you will always be able to search with the old or new
IDs). Currently in PlasmoDB we are displaying the new Gene ID next to
the old one at the top of gene pages (old ID/new ID).
New functionality:
- The genome browser has been updated. This newer version
includes several improvements and functionalities. A tutorial highlighting some of
the new features is also available.
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
- Several new tutorials have been generated:
Note:
Our bimonthly database releases incorporate new data and correct old
data when necessary. Changes in annotation and new experimental
data
may slightly alter your search results by increasing or decreasing the
number of hits. When search parameters change with a new release, we
invalidate (�) the search and ask you
to rerun it. When IDs are updated or removed, we map the old IDs to the
new ones, remove the old IDs from your Basket, and leave your Favorites
page alone.
EuPathDB
is now on FaceBook and Twitter
11 January 2012
We
are pleased to announce new data and features in GiardiaDB 2.5
New Data:
Giardia lamblia Genotype A1
WB C6 (assemblage A) contigs and scaffolds have been assembled into
chromosomes based on an optical map ( Perry et. al.). The optical map
has been incorporated into GiardiaDB where you can use it in searches
(ie. identifying
genes by chromosomal location) and is graphically represented in
the genomic context section of gene pages and in the genome
browser.
Note that while overall the optical map data and assembly data published by the Upcroft group
are in agreement, differences do exist. We encourage all
interested to review the published literature including the following:
- Upcroft JA, Krauer
KG, Upcroft P. Chromosome sequence maps of the Giardia lamblia
assemblage A isolate WB. Trends Parasitol. 2010 Oct;26(10):484-91.
- Perry DA, Morrison
HG, Adam RD. Optical map of the genotype A1 WB C6 Giardia lamblia
genome isolate. Mol Biochem Parasitol. 2011 Dec;180(2):112-4.
- Krauer KG, Burgess
AG, Dunn LA, Upcroft P, Upcroft JA. Sequence map of the 2 Mb Giardia
lamblia assemblage A chromosome. J Parasitol. 2010 Jun;96(3):660-2.
- Upcroft JA, Krauer KG, Burgess AG, Dunn LA,
Chen N, Upcroft P. Sequence map of the 3-Mb Giardia duodenalis
assemblage A chromosome. Chromosome Res. 2009;17(8):1001-14.
New
functionality:
- The genome browser has been updated. This newer version
includes several improvements and functionalities. A tutorial highlighting some of
the new features is also available.
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
- Several new tutorials have been generated:
Note:
Our bimonthly database releases incorporate new data and correct old
data when necessary. Changes in annotation and new experimental
data
may slightly alter your search results by increasing or decreasing the
number of hits. When search parameters change with a new release, we
invalidate (�) the search and ask you
to rerun it. When IDs are updated or removed, we map the old IDs to the
new ones, remove the old IDs from your Basket, and leave your Favorites
page alone.
EuPathDB
is now on FaceBook
and Twitter
11 January 2012
We are pleased to
announce new features in CryptoDB 4.6
New Data:
C. parvum post-infection
semi-quantitative Real Time PCR graciously provided prepublication by
the Mark Rutherford (University of Minnesota) and Jessica C. Kissinger
(University of Georgia) labs.
- You can find differentially expressed genes based on this data by
clicking on RT PCR Evidence
under the "Transcript Expression" heading. Clicking on "Identify Genes
based on C.p. post-infection semi-quantitative Real Time PCR (fold
change)" reveals search options that allow you to identify genes with a
desired change in expression.
- Clicking on "C.p. post-infection semi-quantitative Real Time PCR
(similarity)"
allows you to find genes that have a similar expression profile to an
input gene. For example, if you were interested in finding genes
with a similar profile to a "kinesin-related protein K2", you can enter
the gene ID "cgd3_2590" in the similarity search and return genes
with a similar expression profile.
- In addition, a graph of the expression profiles is made available
in the expression section of each C. parvum gene page.
New
functionality:
- The genome browser has been updated. This newer version
includes several improvements and functionalities. A tutorial highlighting some of
the new features is also available.
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
- Several new tutorials have been generated:
Note:
Our bimonthly database releases incorporate new data and correct old
data when necessary. Changes in annotation and new experimental
data
may slightly alter your search results by increasing or decreasing the
number of hits. When search parameters change with a new release, we
invalidate (�) the search and ask you
to rerun it. When IDs are updated or removed, we map the old IDs to the
new ones, remove the old IDs from your Basket, and leave your Favorites
page alone.
EuPathDB
is now on FaceBook and Twitter
11 January 2012
Includes over 280 new genomes.
show more...
Search PATRIC Data Using Single Site-wide Search Box:
- Every PATRIC page now includes a search box (top right, in the blue banner) that will search across PATRIC data types.
- Results are presented in a “Google-like” fashion, including a summary view and several data-specific views (e.g., features by function, genomes, etc.).
- Users can search on locus tags, function names, genome names, and much more.
- Search results include a collapsible toolbar, allowing the user to select specific search results of interest and perform actions (e.g., view sequence data, add features to workspace, etc.).
- PATRIC’s existing specialized searches (Feature Finder, GO Term Search, ) are still available from the main “Searches & Tools” navigation menu.
User Workspace Overhaul:
- This release includes a completely redesigned user workspace that embraces an iTunes-like interface.
- The user can view groups or lists of workspace items, and filter them based on certain metadata (e.g., group name, tag, etc.).
- All users (including registered users and guests) can add both features and genomes to workspace, allowing for multiple levels of analysis and comparison.
- When adding items to the workspace, users can chose whether to group them, or simply tag them for more flexible, ad hoc grouping in the workspace.
- Registered users’ workspace are persistent, meaning that all their work is saved between working sessions.
- Genome groups can be used to scope all PATRIC’s advanced Searches and Tools, allowing users to create reusable sets of genomes in which to, for example, explore pathways, browse protein families, and more.
Genomes and Annotations
Since August 2011, 284 new genomes have been added to PATRIC. In addition, 16 genomes have been updated or replaced with their newest versions. In total, 279 new genomes have been annotated using RAST.
A summary of the genomes available on the PATRIC website through November, 2011 is provided in the table below:
|
PATRIC |
Legacy BRC |
RefSeq |
| Number of genomes |
3807 |
337 |
3920 |
| Number of Complete genomes |
1619 |
237 |
1617 |
| Number of WGS genomes |
2188 |
96 |
1906 |
| Number of Plasmid only genomes |
0 |
4 |
397 |
|
We are pleased to announce the release
of ToxoDB 7.1
This release of ToxoDB contains the
following new datasets:
- Genes with mass spec evidence from each of these fractions may
be interrogated in ToxoDB. For example, this search
strategy identifies genes that have mass spec evidence from the
outer wall fraction (not treated) that do not have proteomics
expression evidence from the inner wall (bleach treated)
fraction. This search can
be further refined to ask how many of these genes also contain
predicted transmembrane domains.
- This search
strategy identifies Toxoplasma (GT1 and ME49 genes) genes with
evidence of phosphorylation during the intracellular stage that do not
have evidence of phosphorylation in purified extracellular parasites.
- Calcium-dependent phosphoproteomic data from intracellular T. gondii RH strain. This
data was graciously provided prepublication and is now in press (Nebl
et. al., 2011, PLoS Pathogens).
- This search
strategy identifies genes that have evidence of calcium-dependent
phosphorylation based on SEQUEST and MASCOT analysis.
- Note: all proteomic data
can be viewed graphically on gene pages or the genome browser in
ToxoDB. In addition, moving your cursor over peptide graphics
displays additional information including the peptide sequence and
phosphorylation sites when available. For example, you can
view peptide evidence from all the above described data in the "protein
features" section on this
gene page. Also, all protein expression data in ToxoDB can be
searched in the "Protein
expression" section.
- Single nucleotide polymorphism (SNP) data based on comparisons of
the T. gondii
RH strain (This data was graciously provided prepublication by Marc-Jan
Gubbles). Using this data, genes in ToxoDB may be identified
based
on their high throughput sequencing (HTS) SNP characteristics
(comparing RH to GT1 and ME49). On the SNP
search page (under the heading "Population Biology"), selecting
"HTS SNP characteristics" allows you to define the type of SNPs you are
interested in.
- This search
identifies all genes in ME49 that contain 20 or more nonsynonymous SNPs
compared to RH.
- HTS SNPs are summarized on gene pages in the "SNPs Summary"
table.
- SNP records can also be searched using the SNP search tools
under the heading "Identify other data types". Searches include
identifying SNPs based on ID, genomic location or GeneID. These
searches return SNP records which are populated with additional
information such as sequence surrounding a SNP.
- In addition, an alignment
pileup of reads around a specific SNP can be viewed to assess the
quality of the SNP call.
- The unannotated genome sequence T.
gondii
strains CatBR9 and the early diverging apicomplexan Gregarina niphandrodes provided by
the Toxoplasma white paper
sequencing project are now available. This data can be searched
from the
"Genomic Sequences" section under
the heading "Identify other data types" on the home page. Genome
sequences from this effort will be incorporated into ToxoDB release as
they become available. A webpage
with updated information on the
status of sequencing is maintained by Hernan Lorenzi at the J. Craig
Venter Institute. In addition, a PDF of the sequencing white
paper is available for download.
New functionality:
- A new column analysis tool is now available that allows you to
generate a word cloud or histogram based on the data in result
columns. To access this feature, run any search that returns a
list of results then click on the icon next to the product description
column name. This will reveal a popup with the option to select
"Word Cloud". Look for this analysis feature to be activated on
additional columns in future releases.
EuPathDB
is now on FaceBook and Twitter
11 October 2011
VectorBase is pleased to announce the VB-2011-08 release.
Highlights of this release include:
- First announcement of the new population biology resource prototype web interface
- Updated Ensembl genome browser (to schema 63)
More details are available in the Release notes.
Includes over 750 new genomes.
show more...
New Searches and Tools
Integration of Genome Metadata
PATRIC has collected metadata associated with bacterial genomes from multiple sources, such as NCBI’s BioProject database, GenBank records, and the Human Microbiome Project. Following the automated collection, the metadata was manually curated for consistency and accuracy, organized into a relational database, and integrated with the genomes available in PATRIC. This metadata provides useful information about a genome, including genome status, isolation source, isolation country, collection date, host organism, associated disease, and more. The metadata is organized into the following broad categories: organism info, project info, sequence info, isolate info, host info, and phenotype info.
Genome metadata can be accessed in multiple ways on the PATRIC website:
- The improved Genome Finder allows users to locate genomes available in PATRIC by their associated metadata. A keyword search allows for the search across all available metadata fields, including the free-text comments. These search results are summarized by key metadata attributes, including genome status, isolation country, host name, disease, collection date, and completion date. Progressive filtering of the results is also available. http://patricbrc.org/portal/portal/patric/GenomeFinder?cType=taxon&cId=&dm=
- A summary of key metadata attributes is provided on every Taxon Overview page.
For an example see: http://patricbrc.org/portal/portal/patric/Taxon?cType=taxon&cId=561
- Metadata is also available at the Genome List tab, present at all taxonomy levels. Progressive filtering by key metadata attributes is provided. The Genome List table allows users to choose any of the 61 currently available metadata attributes and add them to the table as columns. In addition, users can also download all available genomes and associated metadata as a tab delimited or Excel file. For an example see:
http://patricbrc.org/portal/portal/patric/GenomeList?cType=taxon&cId=561&dataSource=&displayMode=&pk=9379#key=9379&pS=20&aP=1&aT=0&cwG=false&cF=&gId=&gName=&gdir=ASC&gsort=genome_name&sdir=&ssort=
- All available metadata for an individual genome is located on the Genome Overview Page under Genome Summary.
http://patricbrc.org/portal/portal/patric/Genome?cType=genome&cId=200960
To learn more about the genome metadata available at PATRIC, or how to search for a genome by its available metadata, visit Genome Metadata FAQs or Genome Finder FAQs, respectively.
The collection and refinement of genome metadata is an ongoing task. Future plans include integrating the genome metadata with various analysis tools available on the PATRIC website.
Data Download in GFF Format
PATRIC annotations are now available for download as GFF files, which can be accessed via the Downloads tab at the top of any PATRIC page. Alternatively, GFF files can be accessed by clicking “Download genome data” in the upper-right of any PATRIC Genome Overview page. For an example, see:
http://brcdownloads.vbi.vt.edu/patric2/genomes/Escherichia_coli_O104-H4_str_LB226692/Escherichia_coli_O104-H4_str_LB226692.PATRIC.gff
Genomes and Annotations
Since the June release, 756 new genomes have been added to PATRIC. Many of them are draft assemblies available in GenBank but not in RefSeq. In addition, 62 genomes have been updated or replaced with their newest versions. In total, 749 new genomes have been annotated using RAST.
A summary of the genomes available on the PATRIC website through July, 2011 is provided in the table below:
|
PATRIC |
Legacy BRC |
RefSeq |
| Number of genomes |
3535 |
337 |
3685 |
| Number of Complete genomes |
1491 |
237 |
1488 |
| Number of WGS genomes |
2044 |
96 |
1800 |
| Number of Plasmid only genomes |
0 |
4 |
397 |
|
To view this Sequence Summary along with Genomic and Protein Feature Summaries, please visit: http://patricbrc.org/portal/portal/patric/Taxon?cType=taxon&cId=2
We are pleased to announce the release
of AmoebaDB 1.5
New Data:
- E. histolytica Rahman and moshkovskii genome
sequence is now available in AmoebaDB. This data was graciously
provided prepublication by Gareth Weedall and Neil Hall (Institute of
Integrative Biology, University of Liverpool, UK). Genomic
sequence data
from unannotated genomes can be searched from the "Genomic Sequences"
section under the heading "Identify other data types" on the home
page. Sequence data may be searched based on ID,
organism
or BLAST.
EuPathDB
is now on FaceBook and Twitter
21 July 2011
We are pleased to announce the release
of TriTrypDB 3.2
New Data:
- Graphs and tables representing the expression and
percentile values for individual genes are available in the
"Expression" section of gene pages.
- T. cruzi comparative
genomic hybridization of 16 strains (Minning et. al.).
- This data is available as a genome browse track.
Amplifications are indicated as positive values on the graph while
deletions are indicated as negative values. A link showing this
data on the entire chromosome 37 of T.
cruzi can be accessed here.
EuPathDB
is now on FaceBook and Twitter
21 July 2011
We are pleased to
announce the release of PlasmoDB 8.0
New data:
- Sequence and annotation of the
P. falciparum IT strain produced
by the Parasite Genomics Group at the Wellcome Trust Sanger Institute
with funding from the EVIMalaR Consortium (a European Commission Funded
Network of Excellence). While
this data has been graciously provided prepublication to the community
its usage for large scale genomic analysis is bound by a data
usage
policy.
- Updated linkouts to the Malaria Parasite Metabolic
Pathways
(MPMP) database. Numerous pathways have been added and
updated by
Dr. Hagai Ginsburg. A full description of new pathways is
available here
and genes included in the MPMP database can be searched based on
metabolic pathway in PlasmoDB here.
New functionality:
- You can now view intron/exon boundaries based on available RNA
sequence data. boundary determination was performed using the RNA-Seq Unified
Mapper (RUM). To view intron/exon boundary calls, turn on the
"RNA-seq alignments RUM Intron" track in the genome browser. Here
is a link to a gene
that is predicted to have multiple splice variants based on the
annotation provided by GeneDB with the RUM intron tracks turned
on. To get additional information mouse over the tracks and/or
the track names.
EuPathDB
is now on FaceBook and Twitter
21 July 2011
We are pleased to announce the release
of ToxoDB 7.0
This release of ToxoDB contains the
following new datasets:
- Strand-specific RNA sequence and non coding RNA (ncRNA) data for
T. gondii and N. caninum (graciously provided
prepublication by Brian Gregory, University of Pennsylvania).
This data was generated as part of one of the funded driving biological
projects.
- An example
region of the Toxoplasma
genome with RNA sequence coverage-plot tracks turned on can be accessed
here (note: experiment was
performed using VEG parasites and was mapped to both VEG and ME49
sequence in ToxoDB). This data is represented in two
tracks:
- "T.gondii
VEG
mRNA RNAseq - Unique Aligners - (log2 Coverage) (Gregory)" which
includes strand-specific coverage plots (Blue for the forward strand
and red for the reverse strand).
- "T.gondii VEG
mRNA
RNAseq - Nonunique Aligners - (log2 Coverage) (Gregory)" which shows
coverage plots for regions where probes did not uniquely align (ie.
gives and indication of possible repetitive regions).
- An example
region of the Toxoplasma
genome with ncRNA sequence coverage-plot tracks turned on can be
accessed here (note: experiment was
performed using RH and ME49 parasites and was mapped to ME49 sequence
in ToxoDB. RH data was also mapped against the GT1 strain).
- New T. gondii ME49 cosmid end library mapping to
the genome are now available in the genome browser. The libraries
were generated in Laura Knolls laboratory, University of Wisconsin.
- Updated restriction fragment polymorphism (RFLP) data provided by
Chunlei Su, University of Tennessee. RFLP data may be searched
based on Genotype
or Genotype Number.
- The unannotated genome sequence T.
gondii
strains FOU provided by the Toxoplasma white paper sequencing project
is now available. This data can be searched from the
"Genomic Sequences" section under
the heading "Identify other data types" on the home page. Genome
sequences from this effort will be incorporated into ToxoDB release as
they become available. The latest updated information on the
status of sequencing is available here. The sequencing white
paper is available here.
- Genome sequence and annotation of Eimeria tenella graciously provided
prepublication by Adam Reid and colleagues (Parasite Genomics group at
the Wellcome Trust Sanger Institute, UK). Annotated genomes may
be investigated in ToxoDB using multiple searches for example, genes
may be identified based on text, ID, BLAST, GO
terms, presence of transmembrane domains or signal peptides, orthology,
etc. Here is a search strategy that identifies all E. tenella genes predicted to
contain a secretory signal peptide and are possible kinases (note: the
orthology step at the end allows identification of possible gene
families):
New functionality:
- You can now view intron/exon boundaries based on available RNA
sequence data. boundary determination was performed using the RNA-Seq Unified
Mapper (RUM). To view intron/exon boundary calls, turn on the
"RNA-seq alignments RUM Intron" track in the genome browser. Here
is a link to a gene
that may require reevaluation based on RUM intron analysis. To
get additional information mouse over the tracks and/or
the track names.
EuPathDB
is now on FaceBook and Twitter
21 July 2011
We are pleased to
announce the release of MicrosporidiaDB 2.0
New data:
- Whole genome shotgun sequence of the following species
generated by The
Broad Institute Genome Sequencing Platform with
Federal funds from the National Institute of Allergy and Infectious
Diseases under Contract No.: HHSN2722009000018C:
- Encephalitozoon cuniculi
EcI
- Encephalitozoon cuniculi
EcII
- Encephalitozoon cuniculi
EcIII
- Nematocida parisii ERTm1
- Nematocida parisii ERTm2
- Vavraia culicis
floridensis
- Vittaforma corneae
- Genomic sequence data
from unannotated genomes can be searched from the "Genomic Sequences"
section under the heading "Identify other data types" on the home
page. Sequence data may be searched based on ID,
organism
or BLAST.
- Sequence and annotation for Encephalitozoon
hellem (graciously provided prepublication by Patrick Keeling's
group).
- Annotated genomes can be searched using any of the searches
under the heading "Identify Genes by:" on the MicrosporidiaDB home
page. Searches include, identifying genes by text, ID, BLAST, GO
terms, presence of transmembrane domains or signal peptides, orthology,
etc. Here is a search that identifies all E. hellem genes that have at least
7
transmembrane domains and do not have orthologs in mammals:
http://microsporidiadb.org/micro/im.do?s=d4174608fb1a4879
- Here is a search that quickly defines E. hellem orthologs of N. ceranae genes (note the column
labeled as "Input orthologs" refers to the N. ceranae genes in the same
orthology group as the E. hellem "Gene"
in the first column):
http://microsporidiadb.org/micro/im.do?s=9aa2e7a6ef1710f9
EuPathDB
is now on FaceBook and Twitter
21 July 2011
VectorBase is pleased to announce the VB-2011-06 release.
Highlights of this release include:
- Pre-site for Glossina moritans. Features mapped to the genome and preliminary gene sets available.
- New microarray data from Christian et al. and Cassone et al.
More details are available in the Release notes.
PATRIC has annotated and preformed comparative analyses on two isolates from the recent outbreak.
show more...

An outbreak of Escherichia coli causing a severe illness called hemolytic-uremic syndrome (HUS) began in Germany in May 2, 2011 and has killed more than 20 people and sickened more than 2,000. The organism causing the outbreak has been identified as a strain of E. coli O104:H4 that produces a Shiga toxin and causes an illness similar to infection with E. coli O157:H7. Two isolates from this outbreak have been sequenced. Both strains, TY-2482 (sequenced by the Beijing Genomics Institute in collaboration with University Medical Centre Hamburg-Eppendorf with 12X coverage from IonTorrent PGM: assembly) and LB226692, (sequenced by Life Tech in collaboration with the University of Muenster with 28X coverage from IonTorrent PGM: assembly) have been annotated and are now available at PATRIC Bioinformatics Resource Center.
The two genomes have been annotated using RAST, making them consistent with the 184 E. coli genomes and the total 2,865 bacterial genomes available at PATRIC. The proteins conserved across all E. coli have been used to generate a preliminary phylogenetic tree that is based on 166640 characters across 527 genes in 354 taxa. This tree, shown below shows that the two new strains are most closely related to the pathogenic, enteroaggregative strain 559899.

Click Image to enlarge.
Click here to view this tree in its interactive form on the PATRIC Website.
The proteins from these two new pathogenic strains can be compared to other bacterial genomes using the PATRIC Protein Family Sorter. An example of this, showing two of several unique islands that have been identified in these two genomes is provided in the figure below and in interactive form on the PATRIC Website by clicking here and then selecting the “Heatmap” tab.
See the Protein Family Sorter FAQs for help in using the sorter. An Excel file with tabs containing lists of proteins in these islands is available by clicking E coli protein islands.

Click Image to enlarge.
For a comparison of the RAST annotations with the other publicized annotation click here.
Much of the information in PATRIC is updated on an ongoing basis. Such as:
- Interactive Disease maps with outbreak information. Click here and then select the Disease Map tab.
- The PATRIC Google news search for countermeasures and other information. Click here.
Many laboratories are analyzing these genomes and providing data to the research community. PATRIC is performing additional analyses, including collecting a list of the important genes identified, and will be providing gene trees and multiple sequence alignments of the genes with their closest homologs, which we will release as additional news items. For updates:
For quarterly PATRIC updates on current PATRIC and PATRIC-related research, new PATRIC functionality, and PATRIC grant opportunities, please sign up for our PATRIC Newsletter.
We
are pleased to announce the release
of AmoebaDB 1.4
New Data:
- E. histolytica glycoproteins
proteomics data from Carpentieri et. al. To search this
data click on the
protein expression link under the "Identify Genes by:" heading, then
click on "Mass
Spec. Evidence". On this page you can expand the
"Entamoeba histolytica" item
and select your data "Mass
Spec/Glycosylation" and click "Get Answer".
- As usual you can combine search results with other searches in
a strategy. Here is an example
search strategy that leverages orthology to identify all Entamoeba orthologs of E. histolytica genes with
proteomics evidence from this data, that are predicted to have a
predicted signal peptide.
New functionality:
- You can now search for DNA segments defined by their chromosomal
location or their nucleotide sequence (DNA motif pattern). This
new search is available under "Identify Other Data Types"; click on
"Genomic Segments (DNA motif)" then select either DNA
motif pattern or Genomic
Location.
- We introduced a new combine operation that allows users to
combine
steps based on relative locations on the genome. This enables
you, for example, to return all genes that contain a specific DNA motif
within 1000 nucleotides upstream of the annotated start site. Or you
could return all genes that have a SNP between two strains within 500
nucleotides of the start of the gene (or return all SNPs that are
located in this region relative to genes).
- Here is an example
search strategy that identifies protein coding genes with an AP-2
like motif within 1000 bp of their 5' end.
In addition, several new tutorials have been developed:
3 May 2011
We are pleased to announce the release
of TriTrypDB 3.1
New Data:
- Graphs and tables representing the expression and
percentile values for individual genes are available in the
"Expression" section of gene pages.
- T. brucei high-throughput
phenotyping of bloodstream and procyclic form parasites (Alsford et. al.).
- To search this data click on "Putative Function" under the
heading "Identify Genes by" on the home page then click on "High-Throughput
Phenotyping".
A sample strategy that searches this data for genes that are likely
essential in all stages or time points examined can be accessed here.
- Graphs and tables representing the expression and
percentile values for individual genes are available in the
"Phenotype" section of gene pages.
- T. brucei microarray
data based on induced DHH1 RNA helicase in wild type and DEAD:DQAD
mutant procyclic trypanosomes (Kramer
et. al.).
- Graphs and tables representing the expression and percentile
values for individual genes are available in the "Expression" section
of gene pages.
- T. brucei microarray
data based on heat shock treated and control procyclic trypanosomes (Kramer et. al.).
- Graphs and tables representing the expression and percentile
values for individual genes are available in the "Expression" section
of gene pages.
- L. major and L. infantum RNA sequence data highlighting
splice leader and polyA sites are available as genome browser
tracks. This data was graciously made available prepublication by
Dr. Peter Myler's group
at Seattle BioMed.
- This data can be searched by going to the "Protein
Expression" section under the heading "Identify Genes by" on the
home page, then selecting the "L.
mexicana intracellular
Gel free analysis" option. As usual results from any search can
be combined with other searches in TriTrypDB. For example, here
is a search strategy that identifies
genes with proteomics evidence from this data that have a predicted
signal peptide.
- Proteomics data may
also be viewed on gene pages as tables and graphical representation
(scroll down to the "Protein Expression" section) or in the genome
browser by selecting the appropriate track(s) and updating the
image. Here is a link to a genome browser section
highlighting this proteomics data.
- Draft assemblies of the genome sequences from the Trypanosomatid
sequencing white paper. This data is graciously provided
prepublication and full credit and data usage information is available
by clicking on the sequence name:
New functionality:
- You can now search for DNA segments defined by their chromosomal
location or their nucleotide sequence (DNA motif pattern). This
new search is available under "Identify Other Data Types"; click on
"Genomic Segments (DNA motif)" then select either DNA
motif pattern or Genomic
Location.
- We introduced a new combine operation that allows users to
combine
steps based on relative locations on the genome. This enables
you, for
example, to return all genes that contain a specific DNA motif within
1000 nucleotides upstream of the annotated start site. Or you could
return all genes that have a SNP between two strains within 500
nucleotides of the start of the gene (or return all SNPs that are
located in this region relative to genes).
- Here is an example
search strategy that identifies protein coding genes with a
putative RNA binding protein-like site in their 3' region.
In addition, several new tutorials have been developed:
3 May 2011
We are pleased to
announce the release of MicrosporidiaDB 1.4
New data:
- Whole genome shotgun sequence of Nematocida parisii strain ERTm3
generated by The
Broad Institute Genome Sequencing Platform with
Federal funds from the National Institute of Allergy and Infectious
Diseases National under Contract No.: HHSN2722009000018C and obtained
from GenBank.
- Genomic sequence data
from unannotated genomes can be searched from the "Genomic Sequences"
section under the heading "Identify other data types" on the home
page.
New functionality:
- You can now search for DNA segments defined by their chromosomal
location or their nucleotide sequence (DNA motif pattern). This
new search is available under "Identify Other Data Types"; click on
"Genomic Segments (DNA motif)" then select either DNA
motif pattern or Genomic
Location.
- We introduced a new combine operation that allows users to
combine
steps based on relative locations on the genome. This enables
you, for
example, to return all genes that contain a specific DNA motif within
1000 nucleotides upstream of the annotated start site. Or you could
return all genes that have a SNP between two strains within 500
nucleotides of the start of the gene (or return all SNPs that are
located in this region relative to genes).
- Here is an example
search strategy that identifies protein coding genes with an AP-2
like motif within 1000 bp of their 5' end.
In addition, several new tutorials have been developed:
3 May 2011
New Features in GiardiaDB
- You can now search for DNA segments defined by their chromosomal
location or their nucleotide sequence (DNA motif pattern). This
new search is available under "Identify Other Data Types"; click on
"Genomic Segments (DNA motif)" then select either DNA
motif pattern or Genomic
Location.
- We introduced a new combine operation that allows users to
combine
steps based on relative locations on the genome. This enables
you, for
example, to return all genes that contain a specific DNA motif within
1000 nucleotides upstream of the annotated start site. Or you could
return all genes that have a SNP between two strains within 500
nucleotides of the start of the gene (or return all SNPs that are
located in this region relative to genes).
- Here is an example
search strategy that identifies protein coding genes with an AP-2
like motif within 1000 bp of their 5' end.
In addition, several new tutorials have been developed:
Follow EuPathDB on
Twitter
3 May 2011
We are pleased to
announce the release of PlasmoDB 7.2
New data:
- P. falciparum mRNA
half-life data intraerythrocytic developmental cycle (Shock et. al.)
is now available as mRNA expression and half-life graphs on gene
pages. To view this data visit any gene page and scroll down to
the section called "mRNA half-lives during intraerythrocytic
development - 3D7 post treatment with actD" in the Expression section
of any gene page.
- Updated linkouts to the Malaria Parasite Metabolic
Pathways
(MPMP) database. Numerous pathways have been added and
updated by
Dr. Hagai Ginsburg. A full description of new pathways is
available here
and genes included in the MPMP database can be searched based on
metabolic pathway in PlasmoDB here.
New functionality:
- You can now search for DNA segments defined by their chromosomal
location or their nucleotide sequence (DNA motif pattern). This
new search is available under "Identify Other Data Types"; click on
"Genomic Segments (DNA motif)" then select either DNA
motif pattern or Genomic
Location.
- We introduced a new combine operation that allows users to
combine
steps based on relative locations on the genome. This enables
you, for
example, to return all genes that contain a specific DNA motif within
1000 nucleotides upstream of the annotated start site. Or you could
return all genes that have a SNP between two strains within 500
nucleotides of the start of the gene (or return all SNPs that are
located in this region relative to genes).
- Here is an example
search strategy that identifies protein coding genes with an AP-2
like motif within 1000 bp of their 5' end.
In addition, several new tutorials have been developed:
Follow EuPathDB on
Twitter
3 May 2011
We are pleased to announce the release
of ToxoDB 6.4
New Data:
- The first unannotated sequences from the Toxoplasma sequencing
white paper are now available via ToxoDB. You can access the
genome sequence of T. gondii
strains MAS, P89 and VAND from the "Genomic Sequences" section under
the heading "Identify other data types" on the home page. Genome
sequences from this effort will be incorporated into ToxoDB release as
they become available. The latest updated information on the
status of sequencing is available here. The sequencing white
paper is available here.
New functionality:
- You can now search for DNA segments defined by their chromosomal
location or their nucleotide sequence (DNA motif pattern). This
new search is available under "Identify Other Data Types"; click on
"Genomic Segments (DNA motif)" then select either DNA
motif pattern or Genomic
Location.
- We introduced a new combine operation that allows users to
combine
steps based on relative locations on the genome. This enables
you, for
example, to return all genes that contain a specific DNA motif within
1000 nucleotides upstream of the annotated start site. Or you could
return all genes that have a SNP between two strains within 500
nucleotides of the start of the gene (or return all SNPs that are
located in this region relative to genes).
- Here is an example search
strategy that identifies protein coding genes with an AP-2 like
motif within 1000 bp of their 5' end.
In addition, several new tutorials have been developed:
3 May 2011
New Features in TrichDB
- You can now search for DNA segments defined by their chromosomal
location or their nucleotide sequence (DNA motif pattern). This
new search is available under "Identify Other Data Types"; click on
"Genomic Segments (DNA motif)" then select either DNA
motif pattern or Genomic
Location.
- We introduced a new combine operation that allows users to
combine
steps based on relative locations on the genome. This enables
you, for
example, to return all genes that contain a specific DNA motif within
1000 nucleotides upstream of the annotated start site. Or you could
return all genes that have a SNP between two strains within 500
nucleotides of the start of the gene (or return all SNPs that are
located in this region relative to genes).
- Here is an example
search strategy that identifies protein coding genes with an AP-2
like motif within 1000 bp of their 5' end.
In addition, several new tutorials have been developed:
Follow EuPathDB on
Twitter
3 May 2011
We
are pleased to announce the release
of CryptoDB 4.4
New Data:
New functionality:
- You can now search for DNA segments defined by their chromosomal
location or their nucleotide sequence (DNA motif pattern). This
new search is available under "Identify Other Data Types"; click on
"Genomic Segments (DNA motif)" then select either DNA
motif pattern or Genomic
Location.
- We introduced a new combine operation that allows users to
combine
steps based on relative locations on the genome. This enables
you, for
example, to return all genes that contain a specific DNA motif within
1000 nucleotides upstream of the annotated start site. Or you could
return all genes that have a SNP between two strains within 500
nucleotides of the start of the gene (or return all SNPs that are
located in this region relative to genes).
- Here is an example
search strategy that identifies protein coding genes with an AP-2
like motif within 1000 bp of their 5' end.
In addition, several new tutorials have been developed:
3 May 2011
PATRIC’s April website release includes 155 new or updated genomes; new protein family and comparative pathway visualizations; user registration to save PATRIC data in your personal workspace for future use; new host-pathogen interactions; new ID mapping functionality; new compare region viewer; and much more.
show more...
Website Enhancements
Enhanced Protein Family Sorter Can Now Sort Based on Gene Order
- The Heatmap Viewer associated with the protein family sorter now allows one to choose a reference genome to act as an anchor and then sort all the protein families based on the gene order in that genome. This functionality allows the user to quickly identify groups of contiguous genes present or absent in one or more of the selected genomes. In addition, the user can quickly visualize areas of loss or gain of genomic segments across genomes that are closely or more distantly related. Some of these areas may correspond to pathogenicity islands.
- The color scheme of the Heatmap has been changed to improve visual field differentiation,
- Advanced filters have been added that allow the user to search for perfect or non-perfect protein families and filter them by the number of members and/or genomes per protein family.
For an example, please visit the following page and click on the “Heatmap” tab:
http://patricbrc.org/portal/portal/patric/FIGfamSorterB?cType=taxon&cId=1392&dm=result
Improved Comparative Pathway Analysis with New Heatmap Viewer
- A new interactive feature that shows the presence or absence of EC numbers in pathways has been added. This feature is in the form of a Heatmap Viewer and it allows users to see the presence or absence of individual EC numbers within a pathway, with the ability to make comparisons across a wide number of genomes.
- In addition, the data supporting the Heatmap visualization, such as the individual locus tags or the presence/absence across genomes of interest, is available for download by users.
For an example, please visit the following page and click on the “Heatmap” tab:
http://patricbrc.org/portal/portal/patric/CompPathwayMap?cType=taxon&cId=234&map=00362&algorithm=PATRIC
Save Your PATRIC Data With Our New User Registration, Login, and Workspace
Users of PATRIC can now register and create their own user-accounts. During registration, you can subscribe to the PATRIC Newsletter where you’ll receive updates about data and software releases, upcoming workshops, grant solicitations, presentations, and meetings. Once registered, logging into your personal PATRIC account allows you to save and manage the data groups you’ve gathered throughout the PATRIC site to your personal Workspace. Future Workspace plans include allowing users to save their preferences, thereby customizing searching and browsing at PATRIC, and allowing users to upload their personal data in PATRIC and incorporate it into our various visualization and analysis tools.
To read about the benefits of registration and register as a PATRIC user, please visit:
https://patricbrc.org/portal/portal/patric/MyAccount
New Integrated Text Mining
Text mining capability, previously provided in PATRIC by the standalone tool KLEIO, is now integrated as a component of the enhanced literature search and text mining feature. The text mining results are delivered using technology developed in conjunction with the UK National Text Mining Centre (NaCTeM). Text mining results are based on indexes of UK Medline abstracts and identify key entities from that text such as genes, proteins, metabolites, drugs, diseases, symptoms, and other entities of interest. For a taxon, genome, or gene of interest, results are available under the “Literature” tab. Results are summarized by entity types and allow progressive filtering of the results. Abstracts are also provided with key entities highlighted in different colors.
For an example, select the Text Mining tab at:
http://patricbrc.org/portal/portal/patric/Literature?cType=taxon&cId=1386&time=a&kw=none
PATRIC Now Has YouTube Tutorial and Use Case Videos
Tutorials focus on general site navigation and features while Use Cases offer more in-depth examples of gathering and analyzing data within PATRIC. Videos are available on PATRIC’s YouTube page.
Videos can also be accessed from the About menu in the top navigation bar of the PATRIC website or by visiting PATRIC eNews at http://enews.patricbrc.org/
New Searches and Tools
Compare Region Viewer
The new Compare Region Viewer, implemented using JBrowse and data APIs provided by the SEED, allows researchers to compare genomic regions around a gene of interest across other closely related genomes. Using this viewer, one can quickly detect differences in translation start sites, potential frame shifts, or missing genes. Coloring is based on protein functions and allows users to visually group proteins with similar functions. Users can also change the size of the region and number of genomes compared. This viewer can be accessed via the Compare Region Viewer tab available on feature-level pages.
For an example see:
http://patricbrc.org/portal/portal/patric/CompareRegionViewer?cType=feature&cId=17814393&window=10000®ions=10
Host-Pathogen Interactions
PATRIC now offers a Host-Pathogen Interactions Search, which allows users to find and visualize networks of experimentally-confirmed, protein-protein interactions that occur between host and bacterial proteins. Interaction data are collected from multiple public resources, including BIND, DIP, GRID, IntAct, and MINT. Search tools allow users to combine interaction networks from up to 12 bacteria, and also to filter according to interaction type, detection method, and original source. Results are displayed as an interactive graph and as a table of pairs of interacting proteins. From either display, users can select subsets of the data for download, alignment, and other actions; alternatively, bacterial proteins can be saved to groups for later use.
The Host-Pathogen Interactions Search can be accessed from the Searches and Tools menu in the top navigation bar of the PATRIC website or by visiting:
http://patricbrc.org/portal/portal/patric/HPIFinder?cType=taxon&cId=&dm=
ID Mapping Tool
PATRIC’s new ID Mapping Tool is powered by UniProt’s ID Mapping service. It allows users to quickly map PATRIC identifiers to those from other prominent external databases such as GenBank, RefSeq, EMBL, UniProt, KEGG, etc. Alternatively, users can start with list of external database identifiers and map them to the corresponding features at PATRIC.
The ID Mapping tool can be accessed from the Searches and Tools menu in the top navigation bar of the PATRIC website or by visiting: http://patricbrc.org/portal/portal/patric/IDMapping?cType=taxon&cId=&dm=
This same ID Mapping tool is also available from the light blue Table Toolbar on any of the search results or feature tables found throughout the PATRIC site. Any feature(s) from tables or search results can be selected. Then clicking the “Map IDs” icon in the Table Toolbar and selecting an external database will instantly convert these feature identifiers to their corresponding identifiers in the database that was selected.
For an example see:
http://patricbrc.org/portal/portal/patric/FeatureTable?cType=genome&cId=87468#key=178&pS=20&aP=1&dir=ASC&sort=genome_name,accession,start_max&sS=All&fT=CDS&alg=PATRIC&kW=
Genomes and Annotations
Between PATRIC’s Feburary 2011 release and April 2011, 132 new genomes have been added and 23 genomes have been updated or replaced with the newer versions. In total, 115 new genomes have been annotated using RAST.
Twelve genomes previously present in PATRIC have since become obsolete and were removed from the website. In addition, the Legacy BRC annotations from 49 genomes have been removed from the database and newer versions of these genomes have been updated with both RefSeq and RAST annotations.
These Legacy BRC annotations are still available, but are now found under the file download tab: http://brcdownloads.vbi.vt.edu/patric2/LegacyBRC/
A Sequence Summary of the data currently available on the PATRIC website through April, 2011 is provided in the table below:
|
Total Count |
PATRIC Annotation |
Legacy BRC Annotation |
RefSeq Annotation |
| Number of Genomes |
3252 |
2786 |
356 |
3192 |
| Number of Complete Genomes |
1369 |
1358 |
245 |
1353 |
| Number of WGS Genomes |
1486 |
1428 |
105 |
1443 |
| Number of Plasmid-only Genomes |
397 |
0 |
6 |
396 |
View this Sequence Summary in addition to Genomic and Protein Feature Summaries on the PATRIC website: http://patricbrc.org/portal/portal/patric/Taxon?cType=taxon&cId=2
We are pleased to announce the release
of PiroplasmaDB 1.0
This first release of PiroplasmaDB contains the
following datasets:
15 April 2011
VectorBase is pleased to announce the VB-2011-04 release.
Highlights of this release include:
- New microarray expression data for Culex quinquefasciatus and Aedes aegypti
- Updated mass spec. peptides for Anopheles gambiae available via DAS and simple browsing interface
- Anopheles funestus RNA-seq contigs available for BLAST and browsing and search
- GTF file format for use in tophat/cufflinks analysis of RNA-seq data available for download from all species.
More detailed release notes are available on the Wiki.
We are pleased to announce the release
of ToxoDB 6.3
This release of ToxoDB contains the
following new datasets:
- Histone 3 monomethyl-Lysine 4 (H3K4me1) chromatin
Immunoprecipitation chip data identifying active transcriptional
units.
This data was generated by Mathieu Gissot, J. Eduardo Fajardo, Louis M.
Weiss, Andras Fiser and Kami Kim (Albert Einstein College of Medicine)
and was graciously provided pre-publication to the Toxoplasma research community via
ToxoDB.
- This data is available as genome browser tracks. An example
region of the Toxoplasma
genome with these tracks turned on can be accessed here.
- T. gondii (VEG) and N. caninum (Liverpool) tachyzoite
RNA-sequence data. This data was generated by Adam Reid, Mandy
Sanders, Mike Quail, Matthew Berriman, Arnab Pain (Wellcome Trust
Sanger Institute) Rebecca Norton, Dong Xia, Sarah Vermont and Jonathan
Wastling (University of Liverpool). This data was graciously
provided pre-publication to the Toxoplasma
research community via
ToxoDB.
- Genes in ToxoDB may be identified using this data based on
their expression percentile. This
search
is available under the "Transcript Expression" section in ToxoDB. In addition, expression
coverage and percentile graphs are available on gene pages.
- RNA-sequence coverage graphs may also be visualized in the genome
browser. Data has been mapped to both the VEG and ME49
genomes.
22 February 2011
We are pleased to
announce the release of MicrosporidiaDB 1.3
This release of MicrosporidiaDB includes the following new data:
- Whole genome shotgun sequence of Nematocida parisii strain ERTm1
generated by The
Broad Institute Genome Sequencing Platform with Federal funds from
the National Institute of Allergy and Infectious Diseases National
under Contract No.: HHSN2722009000018C and obtained from GenBank.
22 February 2011
We are pleased to
announce the release of TriTrypDB 3.0
This release of TriTrypDB contains the following new datasets:
- This release of TriTrypDB includes a complete rebuild of the
genome sequences and annotation. Annotation improvements come
from manual curation (spearheaded by GeneDB) and direct contribution of
the broader kinetoplastid research community through publications and
User comments on TriTrypDB gene pages. While the annotation is
current as of November 2010, newer changes at GeneDB are prominently
highlighted on gene pages with linkouts to GeneDB.
- Trypanosoma brucei Lister
strain 427 genome sequence
and annotation provided prepublication by Dr. George Cross. The
assemblies were evaluated and annotations were transferred from the
genome reference strain TREU 927 by Matthew Rogers at GeneDB.
Additional annotation performed by the AutoMagi software by Dr. Peter
Myler's group at Seattle BioMed are available as a genome
browser track.
- Genes may be queried based on gene IDs, text terms, EC numbers,
GO terms, sequence features (length, introns, signal sequences, Pfam
domains), etc.
- Genes and sequences may also be interrogated based on BLAST
similarity, user-defined motifs, etc.
- Please note that Lister 427 genes assembled de-novo, rather
than using TREU 927 chromosome templates, and including all VSG genes,
will be available for BLAST searching and BOWTIE mapping at the Cross
Lab and for BLAST searching at TriTrypDB.
- Phosphorylated cytosolic fraction peptides from bloodstream T. brucei (Nett et. al.).
Genes with evidence of protein expression based on this experiment can
be searched via the Mass Spec. As usual, results
from this experiment can be combined with results from other searches
-- for example, find all genes with evidence of phosphorylated peptides
that are also likely kinases -- visit this link
(or share it with others) to see the results of this search.
- TriTrypDB now provides linkouts to LeishCyc (database of
Leishmania metabolic pathways) from relevant Leishmania gene
pages.
The linkouts are available in the external links section of gene pages
-- for example: LmjF.07.1070
- A draft assembly of the genome sequence of Trypanosoma cruzi Esmeraldo strain
cl3. The sequence data has been graciously made
available prepublication to the community via TriTrypDB by Dr. Gregory
A. Buck (Virginia Commonwealth University) and The Genome Center at
Washington University School of Medicine. Total sequence coverage
was ~63X (fragments ~47X, 3 Kb paired end (PE) ~11X, 8 Kb PE ~5X) using
a genome size estimate of 40 Mbp. The combined sequence reads were
assembled using the Newbler software (Roche 454). The first draft
assembly was referred to as T_cruzi_Esmeraldo_5.1 and has been cleaned
of contaminants. The assembly is made up of a total of 23,808 scaffolds
with an N50 scaffold length of over 52kb (N50 contig length for the
31,945 contigs was 3.3kb). The assembly spans 35.5Mb. This assembly is
subject to change due to ongoing assembly improvement efforts. Complete
data source information is available here.
22 February 2011
Click to view specific details of the Sequence Summary.
show more...
Between PATRIC’s December 2010 release and Feb 18th 2011, 143 new genomes were added and annotated by RAST. The following table summarizes the data available in the PATRIC database through the February 18th 2011 PATRIC Website Release.
Sequence Summary
|
PATRIC |
Legacy BRC |
RefSeq |
| Number of genomes |
2671 |
405 |
3068 |
| Number of Complete genomes |
1294 |
251 |
1281 |
| Number of WGS genomes |
1377 |
149 |
1403 |
| Number of Plasmid only genomes |
0 |
5 |
384 |
|
View Genomic and Protein Feature Summaries on the PATRIC website.
VectorBase is pleased to announce the VB-2011-02 release.
Highlights of this release include:
Preliminary Rhodnius prolixus genome browsers with transcript and protein similarities, repeats and preliminary gene predictions.
Update to the mitochondrial gene data for Aedes, Anopheles, Culex and Ixodes.
Update to the Xrefs for Aedes, Anopheles.
More detailed release notes are available.
VectorBase is pleased to announce the VB-2010-12 release.
The December release of VectorBase (VB-2010-12) includes the first gene set update in over a year (for Anopheles gambiae PEST), and new expression, next-gen transcriptomics and variation data.
Highlights of this release of VectorBase include:
* New gene set AgamP3.6 for Anopheles gambiae PEST
* mRNAseq data for eight Anopheline species
* Wolbachia infection microarray experiment for Aedes aegypti added
* Genotype data for 60 M, S and Bamako Anopheles gambiae isolates
PATRIC’s December website release now includes 296 new genomes (with RAST annotation); improved FIGfams; updated functional annotation of proteins; new plasmid specific BLAST databases; new virulence and disease pages; new phylogeny species tree viewer; new 3D structure viewer; new interactive multiple sequence alignment viewer; integration with BEIResources and Structural Genomics Centers; and much more.
show more...
Website Enhancements
New Virulence and Disease Information
Information related to virulence, disease and outbreaks is now available through a new PATRIC tab at all taxonomy levels.
- Shows relationship between pathogens, diseases, virulence factors, and host genes.
- Represents data in tables, as well as, in a graph visualization.
- Includes global disease reports (e.g., incidents and other) in an interactive map as provided in collaboration with Health Map.
For an example, see http://patricbrc.org/portal/portal/patric/DiseaseOverview?cType=taxon&cId=1763
New Phylogenetic/Species Tree Viewer
We have updated all PATRIC phylogenetic trees to include latest genomes. These are accessed via a our new interactive, multicolor tree viewer which show large trees and phylogenetic relationships more efficiently and effectively than traditional coloring schemes.
For an example, see http://patricbrc.org/portal/portal/patric/Phylogeny?cType=taxon&cId=356
Improved Protein Family Sorter
The Protein Family Sorter (PFS) is now available as a tab from taxonomy pages at genus level and below. PFS now includes:
- PATRIC’s Protein Family Sorter allows users to search and filter on FIGfams and now provides improved coverage (as compared to previous releases) of proteins in PATRIC.
- New interactive heatmap visualization that allows users to visually sift through large numbers of protein families based on their presence or absence in the genomes of interest. Users can select collections of proteins (from genomes and/or protein families of interest) from the heatmap and download related data in multiple formats. For an example, see http://test.patricbrc.org/portal/portal/patric/FIGfamSorterB?cType=taxon&cId=1392&dm=result#gs_0=-1638842268.
- New interactive multiple sequence alignment (MSA) for protein families. More information on MSA is described below.
For an example, see http://patricbrc.org/portal/portal/patric/FIGfamSorter?cType=taxon&cId=234&dm=result
Protein Structure Viewer
To support exploration of protein structures, PATRIC now boasts an interactive JMOL-based viewer that displays 3D protein structures (accessible via the ‘Experiment Data’ tab at all taxon levels). Features, such as ligands, chains, and secondary structure annotations, can be dynamically displayed directly on these structures using interactive controls.
For an example of a structure list, see http://patricbrc.org/portal/portal/patric/Structure?cType=taxon&cId=234&filter=ssgcid#key=1747&pS=20&aP=1
For an example of our 3D viewer, see http://patricbrc.org/portal/portal/patric/Jmol?structureID=3P0X
Integration with BEIResources and Structural Genomics Centers
We now offer direct links from specific PATRIC structures to clone sets available from Biodefense & Emerging Infections Research Resources Repository (BEIRRR). Specifically, PATRIC links to BEIRRR clones which were submitted by the Seattle Structural Genomics Center for Infectious Disease (SSGCID) and the Center for Structural Genomics of Infectious Diseases (CSGID). These links are available from appropriate feature and structure pages.
New Interactive Multiple Sequence Alignment Viewer Now Available from Protein Family Sorter
PATRIC’s new multiple sequence alignment (MSA) viewer displays MSA and family tree side-by-side and allows users to collapse/expand tree branches in order to co-locate sequences of interest. The MSA viewer also allows interactive coloring of amino acids in the MSA based on conservation per position.
Enhanced Genome Selector in all Searches and Tools
Many of PATRIC’s searches and tools allow users to select specific orders, species, etc. in order to scope their search accordingly. To improve the selection process across all of these searches and tools, PATRIC now provides an improved genome selection panel that allows on-the-fly, in-line search to quickly find taxonomy nodes or genomes of interest. Searching and selecting is supported in the (previously released) taxonomy tree structure, as well as, the new alphabetized list.
For an example of the enhanced genome selector, see http://patricbrc.org/portal/portal/patric/GenomeFinder?cType=taxon&cId=&dm=
Improved Toolbar for All PATRIC Tables
Every table in PATRIC (e.g., feature tables, search result tables, tables of protein families), now sport an Improved toolbar design with intuitive icons and grouping of related actions. These new table toolbars allow users to select items in the table and perform various actions on them, such as view sequence data, download data, send selected features to analysis tools (e.g., MSA), and so on.
For an example, see http://patricbrc.org/portal/portal/patric/FeatureTable?cType=taxon&cId=234
PATRIC Group Explorer
Throughout the site, users can create custom ad-hoc groups of features using PATRIC’s ‘Add to Cart’ button on any table’s control bar. Once groups have been created, users can employ the Group Explorer to compare, contrast and even crate new groups using a Venn diagram-based interactive visualization. Users’ can conveniently access the Group Explorer from the Feature Cart icon at the top of every PATRIC page.
For more information, see the PATRIC Group Explorer FAQs.
Improved Protein Family Sorter
The Protein Family Sorter (PFS) is now available as a tab from taxonomy pages at genus level and below. PFS now includes:
- PATRIC’s Protein Family Sorter allows users to search and filter on FIGfams and now provides improved coverage (as compared to previous releases) of proteins in PATRIC.
- New interactive heatmap visualization that allows users to visually sift through large numbers of protein families based on their presence or absence in the genomes of interest. Users can select collections of proteins (from genomes and/or protein families of interest) from the heatmap and download related data in multiple formats. For an example, see http://test.patricbrc.org/portal/portal/patric/FIGfamSorterB?cType=taxon&cId=1392&dm=result#gs_0=-1638842268.
- New interactive multiple sequence alignment (MSA) for protein families. More information on MSA is described below.
For an example, see http://patricbrc.org/portal/portal/patric/FIGfamSorter?cType=taxon&cId=234&dm=result
Subscribe to PATRIC Newsletters
PATRIC is now encouraging users to sign up to receive quarterly newsletters. Newsletters will be used to announce new PATRIC data, tools, website features, and events. Recipients will also receive PATRIC-funded requests for proposals. To subscribe, use PATRIC’s new ‘Subscribe’ link located at the top of every PATRIC page, or use the subscription form on our PATRIC Facebook page.
PATRIC is Now on Facebook and YouTube
PATRIC now has a social media presence at both Facebook and YouTube. Our Facebook page shows our latest news and allows Facebook users to share PATRIC’s resrouces with their colleagues. PATRIC’s YouTube page, is newly established, but will contain short videos that describe PATRIC website features, as well as, videos of appropriate workshop content.
Miscellaneous Enhancements to the PATRIC Website
New Searches and Tools
Plasmid BLAST
PATRIC now supports plasmid-specific BLAST databases containing genomic sequences and proteins (annotated by PATRIC and RefSeq) from all of the plasmids in PATRIC database. As such, users can BLAST PATRIC or custom sequences specifically against all PATRIC bacterial plasmids. To view, see http://patricbrc.org/portal/portal/patric/Blast
GO Search
PATRIC now supports searching based on Enzyme Commission number (EC number); a numerical classification scheme for enzymes that is based on the chemical reactions they catalyze. This new tool allows users to enter a keyword, a known EC number, or a recognized EC name and view all related proteins with that specific annotation.
To view, see http://patricbrc.org/portal/portal/patric/GOSearch?cType=taxon&cId=&dm=
EC Search
Similarly, our GO Search allows users to search for proteins based on Gene Ontology (GO) terms representing gene product properties. Users enter a keyword, a known GO term number, or a recognized GO name to view all proteins with that specific annotation.
To view, see http://patricbrc.org/portal/portal/patric/ECSearch?cType=taxon&cId=&dm=
Data Download Tool
The new Download Tool allows users to select multiple genomes and download corresponding annotations in various different file formats in a single click. To view, see http://patricbrc.org/portal/portal/patric/Downloads
MG-RAST
We now provide an interface to the metagenome RAST annotation service (MG-RAST).
To view, see http://patricbrc.org/portal/portal/patric/MGRAST
Genomes and Annotations
In this release, 322 new genomes have been added and annotated by RAST since PATRIC’s August 2010 release. The following table summarizes the data available in the PATRIC database through December, 2010 PATRIC Website Release.
For summary, see http://patricbrc.org/portal/portal/patric/Taxon?cType=taxon&cId=2
Annotations Updates & Synchronization
Functional annotation of all the proteins (i.e. product, GO, EC, and pathway assignments) have been updated based on the latest FIGFams. Functional annotations are now consistent across all genomes in the PATRIC database.
We are pleased to
announce the release of TriTrypDB 2.5
This release of TriTrypDB contains the following new datasets:
New Data:
- L. braziliensis
secretome proteomics data from Cuervo et. al.
This data can be searched from the Mass spec. evidence search page
under the heading "Protein Expression" -- listed under Leishmania braziliensis. In
addition, peptide evidence is displayed on gene pages in the Protein Section
and in the genome browser by turning on
the track specific for this data.
- L. major exosome
proteomics data from Silverman et. al. This data can be
searched from the Mass spec. evidence search page
under the heading "Protein Expression" -- listed under Leishmania major In
addition, peptide evidence is displayed on gene pages in the Protein
Section
and in the genome browser by turning on
the track specific for this data.
- L. infantum differentiation microarray data from Rochette et. al. This data can be
searched from the Microarray
Evidence
search page under the heading "Axenic vs Intracellular Amastigotes
Comparison (Papadopoulou)". This data can be searched based on
fold change differences between life stages examined or based on
percentile expression. As usual, searches may be combined
with other searches in TriTrypDB -- for example this
search
combines a search of this data (gene upregulated in amastigotes v.
promastigotes) with a GO Term search. In addition, microarray
data is displayed on individual gene pages
as graphs and a table in the expression section. You can click on
"show" to see the table and you can select the type of graph to see
(fold change graph, percentile graph or both).
- A draft assembly of the genome sequence of Leishmania major
(strain SD 75.1). The sequence data has been graciously made
available prepublication to the community via TriTrypDB by Dr. Stephen
Beverley. The assembly spans 30.8 Mb and is made up of a total of
1904 scaffolds with an N50 scaffold length of over 691kb (N50 contig
length for the 4009 contigs is 27kb). Complete data source
information is available here.
- Gene names from GeneDB
are now included as part of the product description field on gene pages
in TriTrypDB.
New Feature in TriTrypDB: you
can now add publications to your user comments using their Digital Object Identifier
(DOI) numbers. To view a list of all genes that have a user
comment in TriTrypDB, click here.
18 November 2010
We are pleased to
announce the release of AmoebaDB 1.3
This release of AmoebaDB includes the following new data set:
Draft assemblies of the
following genomes (Complete data source information is available here):
- E. histolytica DS4
- E. histolytica KU50
- E. histolytica MS96
- E. histolytica HM-1:CA
- E. histolytica KU48
- E. histolytica KU27
Genomic sequence data in AmoebaDB may
be searched and accessed in multiple ways:
The sequence data has been graciously made available prepublication to
the community via AmoebaDB by Lis Caler at the J. Craig Venter
Institute (JCVI) and were generated as part of the Entamoeba strain sequencing white
paper. Additional information on the project at JCVI can be
accessed here.
Two presentations from JCVI are available here:
New Feature in AmoebaDB: you
can now add publications to your user comments using their Digital Object Identifier
(DOI) numbers. To view a list of all genes that have a user
comment in PlasmoDB, click here.
18 November 2010
We are pleased to
announce the release of PlasmoDB 7.1
This release of PlasmoDB contains the following new data:
- P. falciparum blood
stage proteomics data (trophozoites and early and mature gametocytes)
from Silvestrini et. al. This
data can be searched from the Mass spec. evidence search page
under the heading "Protein Expression" -- listed under Plasmodium falciparum and
selecting "Blood Stage Proteome - Trophozoites, Early Gametocytes (gct
I/II) and/or Mature Gametocytes (gct V)". In
addition, peptide evidence is displayed on gene pages in the Protein
Section
and in the genome browser by turning on
the track specific for this data. An example search strategy that
identifies genes that have mass spec evidence in the early gametocytes
but not in trophozoites or mature gametocytes can be accessed and
modified here.
New Feature in PlasmoDB:
- You can now add publications to your user comments using their Digital Object Identifier
(DOI) numbers. To view a list of all genes that have a user
comment in PlasmoDB, click here.
- You can view microarray oligo mapping for all arrays with data in
PlasmoDB. Oligo mapping can be viewed in the Genome
Browser by turning on tracks under the heading "Expression
Microarray Probes".
Note: We have corrected errors
inadvertently introduced into the last build of
PlasmoDB (7.0, released on Sept. 21, 2010) that resulted in inaccurate
representation of P. yoelii
liver stage expression profiling data. Please do not hesitate to contact us if you have any
questions.
18 November 2010
VectorBase is pleased to announce the VB-2010-10 release.
Highlights of this release include:
* Preliminary Anopheles gambiae M and S genome browsers with EST mappings and EST-based gene predictions.
* Update to the mitochondrial gene data for Aedes, Anopheles, Culex and Ixodes.
* Update to the Xrefs for Aedes, Anopheles, Culex and Ixodes.
We are pleased to
announce new data and features in GiardiaDB 2.3
This release of GiardiaDB includes the following new data set:
- Giardia intestinalis microarray
data during encystation from Adrian Hehl's group (Morf et. al.). Three types of
searches can be performed to
interrogateinterrogatedinterrogatedinterrogatedinterrogatedinterrogated
this data and are available in
the microarray
search page. As an example, one may ask for all genes that
are differentially regulated 7 hours post encystation (under both
encystation methods used in this study). To see the results of
this question please follow the shared strategy here.
Gene pages also contain a graphical representation of the microarray
results.
New features in this release include:
21 September 2010
We are pleased to
announce the release of AmoebaDB 1.2
This release of AmoebaDB includes the following new data set:
- Proteomics data from E.
histolytica phagosomes of various degrees of maturation
(Boettner
et. al.). This data can be searched in AmoebaDB from the Mass
Spec. search page and results can be integrated with other search
types. An example search strategy that integrates different
search types can be viewed here.
This strategy asks for all E.
histolytica genes with phagosome proteomic evidence, that
contain transmembrane domains and contain a "kinase" PFAM
domain.
- Gene that have evidence of protein expression also have a
graphical representation of the peptide hits on their respective gene
pages (mousing over the graphics provides additional information such
as peptide sequence). Here is a link to a gene page with
proteomics evidence -- scroll down to the "Protein" section and show
"Protein features". Note that samples are color coded.
- Proteomics data can also be visualized in the genome
browser. This link
shows a 200Kb genomic region with the gene models and proteomics track
turned on.
New features in this release include:
- The search strategy window is now resizable.
21 September 2010
We are pleased to
announce the release of TriTrypDB 2.4
This release of TriTrypDB contains the following new datasets and
features:
New Data:
- Genome sequence of Crithidia
fasciculata (strain/isolate Cf-C1). The sequence data has
been graciously made available prepublication to the community via
TriTrypDB by Dr. Stephen Beverley. The C. fasciculata strain was obtained
from Dr. Larry Simpson, University of California, Los Angeles.
Genomic DNA was generated by Drs. Natalia Akopyants, Lon-Fye Lye and
Stephen Beverley, Washington University School of Medicine. A
combination of next-generation sequencing and Sanger reads were
produced for a draft assembly by The Genome Center at Washington
University. Complete data source information is available here.
- Additional ChIP-sequence data is available via the genome browser
provided by Drs. George Cross, Nicolai Siegel and Jessica Wright (Wright et. al. and Siegel et. al.). An example
genome browser view with ChIP-seq track turned on may be viewed here.
New features in this release include:
21 September 2010
We are pleased to
announce the release of MicrosporidiaDB 1.2
This release of MicrosporidiaDB includes genome sequence survey data from a number of microsporidial species
The following Microsporidia
sequences from Genbank's Genome Survey Sequences Database (GSS) are now
available via MicrosporidiaDB:
- Anncaliia algerae
- Edhazardia aedis
- Nosema bombycis
- Vittaforma corneae
GSS data is available under the heading "Genomic Sequences" on the home
page and may be searched by species,
genomic
sequence IDs and BLAST.
New features in this release include:
- The search strategy window is now resizable.
21 September 2010
We are pleased to
announce the release of ToxoDB 6.2
New Data in this release:
This release of ToxoDB includes the tachyzoite cell-cycle
microarray
data from Michael White ( Behnke et. al.). Searches
against this data are available in the microarray
search page of ToxoDB under the heading "Tachyzoite Cell Cycle
(White)". This data can be interrogated using a number of
searches:
- Genes can be identified based on the fold change in their
expression between selected time points from the experiment using the
T.g.
Cell Cycle (fold change).
- Genes can be identified based on their maximal or minimal
expression among selected time points using the T.g. Cell Cycle (exprn
timing) search.
- Genes with a similar expression profile throughout the cell cycle
compared to an input gene can be identified using the T.g. Cell Cycle
(similarity) search. For example, genes that have a similar
expression profile throughout the cell cycle compared to a putative
nucleoporin can be identified as illustrated by this search strategy.
A graphical representation of the search results is presented along
with the list of results showing the expression profile of the
reference gene and each of the comparator genes.
In addition to specific searches against the cell cycle data, each gene
page contains a graphical representation of the gene's expression
profile throughout the cell cycle.
New features in this release:
21 September 2010
We are pleased to
announce the release of PlasmoDB 7.0
This release of PlasmoDB contains the following new datasets and
features:
New Data:
- Updated sequence and
annotation for Plasmodium falciparum, P. chabaudi, P. knowlesi and P. berghei was obtained from the
Pathogen Sequencing Unit at the Wellcome Trust Sanger Institute.
Additional details may be viewed here.
- Updated linkouts to the Malaria Parasite Metabolic Pathways
(MPMP) database. Numerous pathways have been added and updated by
Dr. Hagai Ginsburg. A full description of new pathways is
available here
and genes included in the MPMP database can be searched based on
metabolic pathway in PlasmoDB here.
- Plasmodium falciparum, P.
vivax, P. yoelii, P. berghei and P. chabaudi predicted ApiAP2
transcription-factor (TF) binding site data. This data was
graciously provided prepublication by Dr. Manuel Llinas. Gene
with predicted binding sites to each ApiAP2 TF may be identified using
the TF
binding site search available via the Transcript Expression section
of PlasmoDB. Search parameter such as distance from the 5' end,
number of TF binding sites and probability score can be adjusted as
needed. In addition, TF binding sites can be viewed in the genome
browser with additional information available when mousing over TF
binding site graphics. An example genome browser page may be
viewed here.
- P. falciparum
RNA-sequence data from the intraerythrocytic life cycle. This
data was graciously provided prepublication to the community via
PlasmoDB by Dr. Rich�rd B�rtfai, Wieteke A.M. Hoeijmakers and Dr.
Hendrik G. Stunnenberg. Searches against this data may be
accessed from the RNA-seq
evidence section in the Transcript Expression section
of PlasmoDB. RNA-seq data may also be viewed on individual gene
pages where a graphical representation of the gene's expression profile
throughout the time points of this study is presented. In
addition, RNA-seq coverage graphs can be viewed through the
PlasmoDB genome
browser.
- P. falciparum
ChIP-sequence data from the intraerythrocytic life cycle. ChIP
was performed using antibodies to H2A.Z, H3K9ac & H3K4me3. This
data was graciously provided prepublication to the community via
PlasmoDB by Dr. Rich�rd B�rtfai, Wieteke A.M. Hoeijmakers and Dr.
Hendrik G. Stunnenberg. This data can be viewed through the
PlasmoDB genome
browser.
- P. falciparum
RNA-sequence data of lab-adapted field parasites from malaria-infected
pregnant women and children. This data was graciously provided
prepublication to the community via PlasmoDB by Drs. Marissa Vignali
and Patrick E. Duffy. Searches against this data may be accessed
from the RNA-seq
evidence section in the Transcript
Expression section
of PlasmoDB. RNA-seq data may also be viewed on individual gene
pages
where a graphical representation of the gene's expression profile
throughout the time points of this study is presented. In
addition,
RNA-seq coverage graphs can be viewed through the PlasmoDB genome
browser.
New features in this release include:
21 September 2010
Release 5.1 Sept 2010 (No newsletter)
Release 2.1 September 2010 (No newsletter)
VectorBase is pleased to announce the VB-2010-08 release.
Highlights of this release include:
* New genome browser look and feel.
* Three Aedes aegypti expression studies added.
* Update to the mitochondrial gene data for Aedes, Anopheles, Culex and Ixodes.
* Update to the Xrefs for Aedes, Anopheles, Culex and Ixodes.
* Update to the EST data for Aedes and Anopheles.
PATRIC's latest website update now includes 93 new genomes; new Home Page and site-wide look and feel; new eNews content and outreach Web presence; improved FAQs; new comparative pathway tool; enhancements to Genome and Pathway Finders; post-genomic data; text-mining tools; and new data file types available via FTP download.
show more...
Genome and Annotations
In this release, 93 genomes have been annotated by RAST and added to PATRIC. These include genomes that have been newly released at RefSeq and genomes that have been updated. The following table summarizes the data available in the PATRIC database through August, 2010 PATRIC Website Release.
Website Enhancements
New PATRIC Home Page
We have redesigned the PATRIC home page to provide additional summary information about, and easy access into, our bacterial genome resource. Specifically, the home page now contains the following regions of interest:
- Watchlist Genera: Contains the list of 22 genera along with a detailed pane that displays a variety of information about genera of interest including a link to the genera overview page; taxon lineage; number of CDSs; number of Genomes; links to literature, taxonomy, phylogeny and post-genomics pages; and relevant images retrieved in real-time from Google.
- Searches and Tools: Provides lightweight interfaces to four of PATRIC’s tools including the Genome Finder, Feature Finder, Comparative Pathway Tool, and Protein Family Sorter. Users can also access the full featured front-end to each of these searches and tools by selecting the “Advanced Search” option.
- Most Viewed Bacteria Tag Cloud: Provides an “at a glance” view of the most popular bacteria pages visited by PATRIC users as tracked by Google Analytics. Users can visit the overview page for a specific bacteria by selecting one of the terms in the Tag Cloud.
- eNews: Displays a designated “Feature” article, as well as the most recent news related to PATRIC. A more complete description of eNews is provided below.
We have also upgraded the look and feel of the entire PATRIC website; including a new RSS feed, and social networking “share” capabilities. Site-wide, the main navigation remains consistent with the previous PATRIC website, with slight modifications to the “About” tab to accommodate the new FAQ and eNews web features.
PATRIC eNews
This release marks the introduction of PATRIC eNews; an interactive outreach-centered resource containing up-to-date information including PATRIC announcements, presentations, publications, website and data releases, and news stories that reference PATRIC.
PATRIC FAQs
PATRIC has new and expanded Frequently Asked Questions (FAQs) covering many of PATRIC’s key features, searches and tools. These FAQs allow users to fully leverage PATRIC’s capabilities. This release includes the FAQs about Annotation, Feature Cart, Feature Table, Comparative Pathway Tool, and Protein Family Sorter.
Comparative Pathway Analysis
PATRIC now supports comparative pathway analysis across multiple genomes. Data can be accessed in one of the two ways:
1) “Pathway” Tab: Is now available at any taxonomy level (for an example, see the Mycobacterium Pathway tab). Using the Pathway tab at this level provides all pathways, as well as the EC numbers and genes that have been annotated at this level. Clicking on any of the tabs available (Pathways, EC Numbers, or Genes) can take the user directly to specific annotation information. Additional features included on the opening page include:
- Genome Count: This gives the number of genomes that have some genes present in this pathway at the taxonomic level chosen.
- Unique Gene Count: This provides a list of all the genes in all the genomes that belong to this pathway. Clicking on any number in this column will provide a list of the annotated genes in each genome that belong to this family
- Unique EC Count: The Enzyme Commission number (EC number) is a numerical classification scheme for enzymes, based on the chemical reactions they catalyze and in a given KEGG metabolic pathway, each step has an EC number assigned to it. In a given genome there may be several genes that have been assigned the same EC number, meaning that several different genes have the possibility of doing the same job. The unique EC count tells how many steps within the pathway have at least one gene behind them.
- EC Conservation %: This number gives the percent of unique EC numbers present in all pathways. 100% describes a situation in which all the unique EC numbers are present in all the genomes being examined. Smaller numbers indicate that there is one or more genomes are missing some EC numbers.
- Gene Conservation: A genome can have several genes that have been assigned the same EC number. Gene conservation provides an estimate of pathways where there might be redundancies, or where EC numbers are missing. Numbers greater than one mean that in at least one genome, there is more than one gene that has been assigned a particular EC number. Numbers less than one mean indicate that in at least one genome, a particular EC number is missing. This provides a quick way to see which pathways have perfect conservation across all genomes (Gene Conservation = 1) to those pathways where there are differences among the genomes. The users are then able to explore these differences by drilling down on either the Unique Gene Count, or the Unique EC Count.
- User can also visualize data on pathway map by clicking on a “Pathway Name”. This takes the viewer to a KEGG pathway where all the annotation for all selected genomes has been mapped and displayed by either looking across the pathway, or by the table on the left that summarizes the information.
2) Comparative Pathway Tool : Allows users to select multiple genomes from different genus or taxonomic groups and compare metabolic pathways. Data is presented in a way similar to described above.
For more information, visit Comparative Pathway Tool FAQs.
Post-genomics Data
PATRIC now provides a summary of available post-genomic data across multiple sources, with consolidated access to specific experimental datasets, details, and results. At this time, PATRIC retrieves this data in real-time from the prominent post-genomic databases. Data is displayed by clicking the ‘Post-genomic Data’ tab at any taxonomic level. Actual experimental data and results can be accessed via link outs to respective databases. The post-genomic data is divided into categories that include Transcriptomics/Expression, Proteomics/Mass Spec data, Structure and Protein-Protein Interaction (for example, see Escherichia & Mycobacterium ).
Improved Search Tools
- Genome Finder: The usability of this tool has been improved both in speed and functionality. The results are organized as a list of genomes and a list of genomic sequences that match the specified search criteria.
- Pathway Finder now Comparative Pathway Tool: The usability of this tool has been improved both in speed and functionality. The results are presented as list of pathways, a list of EC numbers, and a list of genes that match the search parameters specified by the user. Comparative Pathway Tool also now supports comparative pathway analysis when multiple genomes are selected.
Data Downloads
PATRIC now offers downloadable data files in several different formats, including FASTA, tab delimited, and GenBank file formats. Download files are organized by genomes. Each genome directory includes FASTA sequence files for genomic sequences (*.fna), protein coding genes (*.ffn), RNA coding genes (*.frn), and proteins (*.faa). Each directory also includes annotations in GenBank file format (*.gbf). Genome directories include tab delimited files for all genomic features (*.features.tab), protein coding genes (*.cds.tab), RNAs (*.rna.tab), GO assignments (*.go), EC assignments (*.ec), and pathway assignments (*.path). Data from three different annotation sources, PATRIC, RefSeq, and Legacy BRCs, are provided in separate files (*.PATRIC.*, *.RefSeq.*, *.BRC.*). Each genome directory also includes a tab delimited file containing mappings between PATRIC and RefSeq identifiers (*.PATRIC2RefSeq). Files are available under genomes directory under Downloads. A “Download genome data” link is provided on the top right corner of all genome level pages.
We are pleased to announce a new release of ToxoDB. While this release does not include new data, there are some new features and improvements.
- The Basket and Favorites functionality has been extended to include all record classes.
- Queries have been optimized to increase site performance.
- A number of bugs have been fixed.
15 July 2010
We are pleased to
announce new features in Giardia 2.2
This release of GiardiaDB includes the following new data sets:
- Giardia intestinalis microarray
data during host parasite interaction. This data was graciously
provided prepublication by Emma Ringqvist and Staffan Sv�rd (Uppsala
University, Sweden).
- Manually curated SAGE tag data provided by Andrew McArthur and
described by Birkeland
et. al.
15 July 2010
We are pleased to
announce the release of TriTrypDB 2.3
This release of TriTrypDB contains the following new datasets:
- Digital gene expression data for T. brucei procyclic and blood form stages provided by Annette Macleod.
Digital gene expression analysis of two life cycle stages of the human-infective parasite, Trypanosoma brucei gambiense reveals differentially expressed clusters of co-regulated genes. Veitch NJ, Johnson PC, Trivedi U, Terry S, Wildridge D, MacLeod A. BMC Genomics. 2010 Feb 22;11:124.
- Links have been added to the TcSNP database of genetic variation in T. cruzi from T. cruzi gene record pages.
15 July 2010
We are pleased to
announce the release of MicrosporidiaDB 1.1
This release of MicrosporidiaDB includes the genome sequence and
annotation of Enterocytozoon bieneusi
- Sequence and annotation of E.
bieneusi are described in
the publication "Genomic
survey of the non-cultivatable opportunistic human pathogen, Enterocytozoon bieneusi" and
were obtained from GenBank.
- Genome sequence of E. bieneusi can be compared to the
sequences of E. cuniculi and
E. intestinalis using
synteny.
- E. bieneusi genes may be identified
based on text terms, ID #, GO terms, chromosomal location, BLAST,
sequence features (length/MW, exons, signal sequences, Pfam domains,
user-defined motifs), orthology, etc.
In addition, this release features the new MicrosporidiaDB logo.
The original image was provided by Dr. Louis Weiss, Albert Einstein
College of Medicine.
15 July 2010
We are pleased to
announce the release of AmoebaDB 1.1
This release of AmoebaDB includes the following new data sets:
- Gene expression in Entamoeba
histolytica cysts and trophozoites. This study is
described by Ehrenkaufer
et. al.
- Impact of intestinal colonization and invasion on the Entamoeba histolytica transcriptome
This study is described by Gilchrist et. al.
Genes can be identified based on the fold change in their expression in
either of the above experiments using the microarray search available
in AmoebaDB (under the heading "Transcript Expression).
In addition, AmoebaDB now has a logo! Template images used in
AmoebaDB were kindly provided by Graham Clark, Fiona Henriquez, William
Petri and Greg Roberts.
15 July 2010
We are pleased to
announce the release of PlasmoDB 6.5
This release of PlasmoDB contains the following new datasets:
- A proteomics dataset for P. falciparum from 42 and 48 hrs post infection kindly provided prepublication by Paul Bowyer and Matthew Bogyo, Stanford University School of Medicine.
- P. falciparum SNP calls for 186 culture adapted strains provided by Jianbing Mu and Xinzhuan Su, NIH.
Plasmodium falciparum genome-wide scans for positive selection, recombination hot spots and resistance to antimalarial drugs. Nat. Genet. 2010 Mar;42(3):268-71 Mu et al.
15 July 2010
We are pleased to announce a new release of TrichDB. While this release does not include new data, there are some new features and improvements.
- The Basket and Favorites functionality has been extended to include all record classes.
- Queries have been optimized to increase site performance.
- A number of bugs have been fixed.
15 July 2010
We are pleased to announce a new release of CryptoDB. While this release does not include new data, there are some new features and improvements.
- The Basket and Favorites functionality has been extended to include all record classes.
- Queries have been optimized to increase site performance.
- A number of bugs have been fixed.
15 July 2010
VectorBase is pleased to announce the VB-2010-06 release.
Highlights of this release include:
* New genome browser look and feel
* Human body louse Pediculus humanus genome added
* IDOMAL malaria ontology released
Additional genome data added from RefSeq & GenBank, as well as, annotations through RAST
show more...
This release includes additional genome data from RefSeq, GenBank, annotations through RAST, minor data corrections, and minor bug fixes/improvements to the web functionality. The following table summarizes the data loaded in the PATRIC database and available through May 27, 2010 PATRIC Website Release.
|
PATRIC |
Legacy BRC |
RefSeq |
| Number of genomes |
2,135 |
410 |
2,577 |
| Number of genomic features |
15,495,850 |
2,324,786 |
16,647,467 |
We are pleased to
announce the release of TriTrypDB 2.2
This release of TriTrypDB contains the following new datasets and
features:
New Data:
Microarray analysis of
three L.
major developmental stages graciously provided
pre-publication by Stephen Beverley's group. Searching this data can be
performed via the microarray
expression search page in TriTrypDB.
Features introduced this year:
- "My
Basket" allows one to cherry pick genes and add them to
the basket for subsequent conversion into a step in a strategy.
The basket is accessible from the gray menu bar and the
number of items in your basket is indicated.
- Weighted
searches allows you to assign an arbitrary weight
to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
- "My
Favorites" (new in this release) allows you to bookmark
and organize genes you
access frequently. A tutorial on how to use this feature is
available here.
User comments matter:
There
are over 400 gene-page comments from TriTrypDB users, all of which have
been reviewed by annotators at GeneDB, the Seattle Biomedical Research
Institute and EuPathDB. Updated genes can be accessed via
prominent linkouts to GeneDB from TriTrypDB
gene pages. Comments from you are instrumental to guiding our
annotators. To learn how you can add a user comment please
view
this tutorial.
Feature evaluators still needed:
We welcome you to *sign
up* to be an early evaluator of new features on the EuPathDB sites.
This will be an opportunity for you to help us refine and improve
features early in the development process. In fact, the "My
Favorites" feature was the subject of our first survey and feature
evaluator's comments and suggestions contributed to enhancing this
feature.
Signing on is easy and will
require minimal effort on your part in the form of answering questions
or rendering your opinion.
To become a feature evaluator, click
here.
13 May 2010
We are pleased to
announce new features in MicrosporidiaDB 1.0
In an effort to continue improving the utility of MicrosporidiaDB, we have
implemented several new features in recent releases.
- "My
Basket" allows one to cherry pick genes and add them to
the basket for subsequent conversion into a step in a strategy.
The basket is accessible from the gray menu bar and the
number of items in your basket is indicated.
- Weighted
searches allows you to assign an arbitrary weight
to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
- "My
Favorites" allows you to bookmark and organize genes you
access frequently. A tutorial on how to use this feature is
available here.
Feature evaluators still needed:
We welcome you to *sign
up* to be an early evaluator of new features on the EuPathDB sites.
This will be an opportunity for you to help us refine and improve
features early in the development process. In fact, the "My
Favorites" feature was the subject of our first survey and feature
evaluator's comments and suggestions contributed to enhancing this
feature.
Signing on is easy and will
require minimal effort on your part in the form of answering questions
or rendering your opinion.
To become a feature evaluator, click
here.
13 May 2010
We are pleased to
announce new features in CryptoDB 4.3
In an effort to continue improving the utility of CryptoDB, we have
implemented several new features in recent releases.
- "My
Basket" allows one to cherry pick genes and add them to
the basket for subsequent conversion into a step in a strategy.
The basket is accessible from the gray menu bar and the
number of items in your basket is indicated.
- Weighted
searches allows you to assign an arbitrary weight
to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
Here is an example of the
same strategy looking for putative secreted enzymes in C. parvum
sporozoites, once
without weighting and then with weighting. Notice that you
have a larger number of results with weighting and those can be sorted
based on the sum of the weights, giving a ranked list of results.
Putative
secreted enzymes (unweighted)
Putative
secreted enzymes (Weighted)
- "My
Favorites" allows you to bookmark and organize genes you
access frequently. A tutorial on how to use this feature is
available here.
Feature evaluators still needed:
We welcome you to *sign
up* to be an early evaluator of new features on the EuPathDB sites.
This will be an opportunity for you to help us refine and improve
features early in the development process. In fact, the "My
Favorites" feature was the subject of our first survey and feature
evaluator's comments and suggestions contributed to enhancing this
feature.
Signing on is easy and will
require minimal effort on your part in the form of answering questions
or rendering your opinion.
To become a feature evaluator, click
here.
13 May 2010
We are pleased to
announce the release of PlasmoDB 6.4
This release of PlasmoDB contains the following new datasets and
features:
New Data:
- PiggyBac
insertion sites in P.
falciparum (NF54) from Balu et. al.
are available via the genome browser. To view insertion sites
turn on the track called "P. falciparum piggyBac Transposable Elements"
under the heading "Insertions". Here is a quick link to the
genome browser with a region with insertions: piggyBac insertions
in chromosome 11.
- ChIP-chip data using histone H4 antibodies from Westenberger et. al.
are available via the genome browser under the ChIP on chip heading.
You may view tracks for different RBC life stages as in this example.
- Nucleosome positioning data from Ponts et. al. are
available via the genome browser under the ChIP-seq heading.
You
may view tracks for different time points during the RBC life cycle as
in this example.
Features introduced this year:
- "My
Basket" allows one to cherry pick genes and add them to
the basket for subsequent conversion into a step in a strategy.
The basket is accessible from the gray menu bar and the
number of items in your basket is indicated.
- Weighted
searches allows you to assign an arbitrary weight
to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
- "My
Favorites" (new in this release) allows you to bookmark
and organize genes you
access frequently. A tutorial on how to use this feature is
available here.
- We now provide prominent linkouts to GeneDB from PlasmoDB
gene pages when there is an update in GeneDB not yet available in
PlasmoDB. This feature enables PlasmoDB users to have easy
access to the most up-to-date information. It is important to
highlight that your comments on gene pages in PlasmoDB constitute one
of the tools made available to the annotators at GeneDB to improve and
enhance the genome annotation.
Feature evaluators still needed:
We welcome you to *sign
up* to be an early evaluator of new features on the EuPathDB sites.
This will be an opportunity for you to help us refine and improve
features early in the development process. In fact, the "My
Favorites" feature was the subject of our first survey and feature
evaluator's comments and suggestions contributed to enhancing this
feature.
Signing on is easy and will
require minimal effort on your part in the form of answering questions
or rendering your opinion.
To become a feature evaluator, click
here.
13 May 2010
We are pleased to
announce new features in ToxoDB 6.0
In an effort to continue improving the utility of ToxoDB we have
implemented several new features in recent releases.
- "My
Basket" allows one to cherry pick genes and add them to
the basket for subsequent conversion into a step in a strategy.
The basket is accessible from the gray menu bar and the
number of items in your basket is indicated.
- Weighted
searches allows you to assign an arbitrary weight
to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
- "My
Favorites" allows you to bookmark and organize genes you
access frequently. A tutorial on how to use this feature is
available here.
Feature evaluators still needed:
We welcome you to *sign
up* to be an early evaluator of new features on the EuPathDB sites.
This will be an opportunity for you to help us refine and improve
features early in the development process. In fact, the "My
Favorites" feature was the subject of our first survey and feature
evaluator's comments and suggestions contributed to enhancing this
feature.
Signing on is easy and will
require minimal effort on your part in the form of answering questions
or rendering your opinion.
To become a feature evaluator, click
here.
13 May 2010
We are pleased to
announce new features in AmoebaDB 1.0
In an effort to continue improving the utility of AmoebaDB we have
implemented several new features in recent releases.
- "My
Basket" allows one to cherry pick genes and add them to
the basket for subsequent conversion into a step in a strategy.
The basket is accessible from the gray menu bar and the
number of items in your basket is indicated.
- Weighted
searches allows you to assign an arbitrary weight
to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
- "My
Favorites" allows you to bookmark and organize genes you
access frequently. A tutorial on how to use this feature is
available here.
Feature evaluators still needed:
We welcome you to *sign
up* to be an early evaluator of new features on the EuPathDB sites.
This will be an opportunity for you to help us refine and improve
features early in the development process. In fact, the "My
Favorites" feature was the subject of our first survey and feature
evaluator's comments and suggestions contributed to enhancing this
feature.
Signing on is easy and will
require minimal effort on your part in the form of answering questions
or rendering your opinion.
To become a feature evaluator tester, click
here.
13 May 2010
We are pleased to
announce the release of TrichDB 1.2
This release of TrichDB contains the following new datasets and
features:
New Data:
T. vaginalis
proteomic data graciously provided
pre-publication by the Patricia Johnson laboratory. Searching
this data can be performed via the Mass
spec. evidence search page in TrichDB.
Features introduced this year:
- "My
Basket" allows one to cherry pick genes and add them to
the basket for subsequent conversion into a step in a strategy.
The basket is accessible from the gray menu bar and the
number of items in your basket is indicated.
- Weighted
searches allows you to assign an arbitrary weight
to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
- "My
Favorites" (new in this release) allows you to bookmark
and organize genes you
access frequently. A tutorial on how to use this feature is
available here.
Feature evaluators still needed:
We welcome you to *sign
up* to be an early evaluator of new features on the EuPathDB sites.
This will be an opportunity for you to help us refine and improve
features early in the development process. In fact, the "My
Favorites" feature was the subject of our first survey and feature
evaluator's comments and suggestions contributed to enhancing this
feature.
Signing on is easy and will
require minimal effort on your part in the form of answering questions
or rendering your opinion.
To become a a feature evaluator, click
here.
13 May 2010
We are pleased to
announce new features in Giardia 2.1
In an effort to continue improving the utility of GiardiaDB we have
implemented several new features in recent releases.
- "My
Basket" allows one to cherry pick genes and add them to
the basket for subsequent conversion into a step in a strategy.
The basket is accessible from the gray menu bar and the
number of items in your basket is indicated.
- Weighted
searches allows you to assign an arbitrary weight
to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
- "My
Favorites" allows you to bookmark and organize genes you
access frequently. A tutorial on how to use this feature is
available here.
- A community
annotation field has been added to the text
search page. This makes it possible for the Giardia research
community to provide expert bulk annotations that would ultimately
results in changes in the GenBank record and official annotation.
Currently, there are 164 community annotations provided by
Rod Adam concerning VSPs. To view community annotations go to
the text search page in GiardiaDB and limit your search to the
"Community annotation". To view all genes that have a
community annotation associated with them simply enter the wild card
"*" in the text search field. The annotation itself is
prominently displayed on the gene pages right under the product name.
You can also click on the comment ID to retrieve additional
information.
- A file including assignment
of contigs to chromosomes has been made available by
Hilary Morrison pre-publication in the community file section of
GiardiaDB. Links
to the file are available on all contig pages.
Feature evaluators still needed:
We welcome you to *sign
up* to be an early evaluator of new features on the EuPathDB sites.
This will be an opportunity for you to help us refine and improve
features early in the development process. In fact, the "My
Favorites" feature was the subject of our first survey and feature
evaluator's comments and suggestions contributed to enhancing this
feature.
Signing on is easy and will
require minimal effort on your part in the form of answering questions
or rendering your opinion.
To become a feature evaluatora feature evaluatora feature
evaluatora feature evaluator, click
here.
13 May 2010
This release allows users to compare annotations from three different annotation sources, and includes a suite of website enhancements including Genome Browser, Protein Family Sorter, KEGG Pathways, Phylogentic Trees and a Pathway Search.
show more...
Genomes and Annotations
- The current PATRIC release includes RAST annotations for approximately 1,850 of the 2,000 complete bacterial genomes currently available in PATRIC. RAST (Rapid Annotation using Subsystem Technology, Aziz et al., 2008) is a fully-automated service for annotating bacterial and archaeal genomes. It provides consistent and high quality genome annotations across the whole phylogenetic tree. Using a single annotation system across all bacterial genomes not only improves the consistency of annotations but also limits the noise and discrepancies that are difficult to rule out when multiple annotation systems are used.
- This release also allows users to compare annotations from three different sources, including RAST, RefSeq, and legacy BRCs. When available, genomes have annotations from three different sources displayed side-by-side for easy comparison.
- All genomes annotations are available through various pages on PATRIC website, including Taxon Overview, Taxonomy, Genome List, Feature Table, Genome Overview. All of these pages are available as tabs, seen directly under the Taxonomy Overview.
|
PATRIC |
Legacy BRC |
RefSeq |
| Number of genomes |
2,135 |
410 |
2,577 |
| Number of genomic features |
15,495,850 |
2,324,786 |
16,647,467 |
Website Enhancements
Genome Browser
- PATRIC now offers Ajax-based genome browser implemented using JBrowse (Skinner et al, 2009). Access to the browser, via tabs, is available at both the genome and feature levels. Links to the genome browser are also available from the Genome List and the Feature Table.
- The new browser has tracks for genes, CDSs, RNAs and other miscellaneous features. The browser also allows users to compare the three annotation systems (RefSeq, PATRIC/RAST and Legacy BRC) simultaneously. Each is available by a separate track, allowing the user to do a comparative analysis.
Protein Family Sorter
- The current PATRIC release provides functional protein families. The families are created using FIGfams, generated from RAST annotations, and allow the user to look at protein conservation among diverse bacterial orders.
- Protein Family Sorter allows the users to include or exclude genomes, and also to put genomes into a “don’t care” category, offering more diversity in sorting ability. Users can also filter by family description or FIGfam number.
- Information on all families, or on individual members of a specific family, is available by download to either excel or text.
- Multiple sequence alignments are generated when requested. They open in a new window.
Metabolic Pathways
- PATRIC now offers metabolic pathways using KEGG pathway maps. EC numbers that are annotated by RefSeq, PATRIC/RAST or the legacy BRCs are mapped to KEGG pathways. The user can see which annotation system has identified a specific EC on any pathway map allowing a comparative approach.
- Pathways are available at both genome and feature levels.
Phylogenetic Trees
- In this release PATRIC provides Order-level phylogenetic trees for all pathogenic bacteria. These trees show members of the order, and also show details of the parts of the tree that contain the pathogenic genera, which are highlighted in red.
Searches and Tools
- A new Pathway Search allows user to search pathway data using EC numbers, pathway IDs, or pathway names. Search can be narrowed to a group of genomes using the taxonomy tree, or can be open to all bacteria. Users can also filter by annotation source (RefSeq, PATRIC/RAST, legacy BRCs, or all).
- An improved BLAST Search now provides analysis by all flavors (blastn, blastp, blastx, tblastn, tblastx) against the genes and protein sequences from RefSeq, PATRIC/RAST and the legacy BRC annotations. BLAST can also be performed against genome sequences.
We are pleased to announced the full release of Microsporidia 1.0
( http://MicrosporidiaDB.org).
Leveraging
bioinformatics infrastructure developed in the context of the EuPathDB
project (including PlasmoDB, etc), this resource incorporates genome
sequence and annotation from Encephalitozoon cuniculi and E. intestinalis.
Highlights include:
- Sequence and annotation for Encephalitozoon cuniculi and E. intestinalis (graciously
provided prepublication by Patrick Keeling's group)
- Genome
sequence alignments, permitting comparative genomic analysis of synteny
between E. cuniculi and E. intestinalis.
- Expressed
sequence tag data for E.
cuniculi.
- 'User
Comments' may be added to any gene page, enhancing available
annotation. Such comments become immediately visible and searchable by
others.
- Identification
of genes based on text terms, ID #, GO terms, chromosomal
location, BLAST, sequence features (length/MW, exons, signal sequences,
Pfam domains, user-defined motifs), orthology, etc.
Support
for integrated searches, via the 'search strategy builder'.
Strategies can be stored and shared with others by logging
into MicrosporidiaDB.
Example strategies combining multiple searches can be viewed here:
http://microsporidiadb.org/micro/im.do?s=ea825c840a708a03(This
strategy looks for genes whose products contain a transmembrane domain
or a secretion signal peptide that are conserved in Fungi but
absent form animals. The strategy starts by first running an
orthology search followed by asking for any of these that also contain
signal peptides or transmembrane
domains).
http://microsporidiadb.org/micro/im.do?s=fe87e063500d52b2
(The above strategy identifies any gene in E.
cuniculi and E. intestinalis that are annotated as hypothetical (or at
least have the word hypothetical in the product name) that also have GO associations)
***Sign up to be a EuPathDB feature evaluator:
***
We welcome you to *sign up* to be an early evaluator
of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process.
Signing up is easy. Evaluating will require minimal effort on your part,
typically in the form of answering specific questions, using the feature(s) and rendering your opinion.
We look forward to working with you to develop and enhance MicrosporidiaDB in
the coming years.
1 April 2010
We are pleased to announce the release of AmoebaDB 1.0
We are pleased to announced the full release of AmoebaDB 1.0
( http://amoebadb.org or http://amebadb.org).
Leveraging bioinformatics infrastructure developed in the context of the EuPathDB project (including PlasmoDB, etc), this resource
incorporates genome sequence and annotation from the Pathema project.
Highlights include:
- Sequence and annotation for Entamoeba histolytica, E. dispar and E. invadens.
- Genome sequence alignments, permitting comparative genomic analysis of synteny between Entamoeba species.
- Expressed sequence tag data for E. histolytica and E. dispar.
- 'User Comments' may be added to any gene page, enhancing available annotation.
Such comments become immediately visible and searchable by others.
- Identification of genes based on text terms, ID #, GO terms, EC #, chromosomal location, BLAST,
sequence features (length/MW, exons, signal sequences, Pfam domains, user-defined motifs), orthology, etc.
- Support for integrated searches, via the 'search strategy builder'. Strategies can be stored and shared
with others by logging into AmoebaDB. For an example, open this sample strategy
that combines multiple searches, in your browser.
***Sign up to be a EuPathDB feature evaluator:
***
We welcome you to *sign up* to be an early evaluator
of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process.
Signing up is easy. Evaluating will require minimal effort on your part,
typically in the form of answering specific questions, testing and rendering your opinion.
15 March 2010
We are pleased to
announce the release of CryptoDB 4.3
This release contains the following new datasets and features:
New Data:
This release of CryptoDB contains new expressed sequence tag (EST) data
for Cryptosporidium parvum KSU-1, normalized sporozoite and pooled
normalized enriched 24-48hr post-infection. EST data can be
searched in many ways, such as by BLAST,
library, extent of gene overlap, etc.
Additionally, genes can be searched based on EST evidence.
New Features:
- The gray tool bar at the top of all CryptoDB web pages has
been enhanced to include a link to the "My Basket" feature including an
indication of how may items are in the basket.
- The genome browser has been updated to version 1.7, which
offers enhanced features and stability. Now you can select a span and send the sequence to ncbi for blasting!
- Weighted searches are now available. This feature
allows you to assign an arbitrary weight to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
***Sign up to be a EuPathDB feature evaluator:
***
We welcome you to *sign up* to be an early evaluator
of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process.
Signing up is easy. Evaluating will require minimal effort on your part,
typically in the form of answering specific questions, using the feature(s) and rendering your opinion.
15 March 2010
We are pleased to
announce the release of TriTrypDB 2.1
This release contains the following new datasets and features:
New Data:
- Microarray data from of Minning et. al. of T. cruzi life cycle
stages (trypomastigotes, amastigotes, epimastigotes, and metacyclic
trypomastigotes). For additional details please see the publication
http://www.ncbi.nlm.nih.gov/pubmed/19664227
- Microarray data from Queiroz et. al. of T. brucei expression
profile during in vitro differentiation from slender bloodstream forms
to stumpy procyclic forms. For additional details please see the
publication
http://www.ncbi.nlm.nih.gov/pubmed/19857263
- Microarray data from Kabani et. al. of
rodent-derived T.
brucei
pleomorphic slender forms, stumpy forms and at 1h, 6h, 18h, and 48h
through synchronous in vitro differentiation to procyclic forms.
For additional details please see the publication
http://www.ncbi.nlm.nih.gov/pubmed/19747379
- Microarray data from Jensen et. al. of five different T. brucei life
cycle stages. For additional details please see the
publication http://www.ncbi.nlm.nih.gov/pubmed/19840382
- Microarray data from Estevez of T. brucei depleted
of the RNA-binding protein TbDRBD3. For additional details
please see the publication
http://www.ncbi.nlm.nih.gov/pubmed/18611951
- RNA sequence data provided pre-publication by George Cross
of blood and procyclic forms of T.
brucei. Expression profiles based on RNA-seq
data can be viewed in the genome browser, on gene pages or searched
in the
Genes by RNA-Seq page.
New Features:
- The gray tool bar at the top of all TriTrypDB web pages has
been enhanced to include a link to the "My Basket" feature including an
indication of how may items are in the basket.
- The genome browser has been updated to version 1.7, which
offers enhanced features and stability.
- Weighted searches are now available. This feature
allows you to assign an arbitrary weight to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
***Sign up to be a EuPathDB feature evaluator:***
We welcome you to *sign up* to be an early evaluator
of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process.
Signing up is easy. Evaluating will require minimal effort on your part,
typically in the form of answering specific questions, testing and rendering your opinion.
15 March 2010
We are pleased to
announce new features in GiardiaDB 2.1
New Features:
- The gray tool bar at the top of all GiardiaDB web pages has
been enhanced to include a link to the "My Basket" feature including an
indication of how may items are in the basket.
- The genome browser has been updated to version 1.7, which
offers enhanced features and stability.
- Weighted searches are now available. This feature
allows you to assign an arbitrary weight to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
***Sign up to be a EuPathDB feature evaluator:***
We welcome you to *sign up* to be an early evaluator
of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process.
Signing up is easy. Evaluating will require minimal effort on your part,
typically in the form of answering specific questions, using the feature(s) and rendering your opinion.
15 March 2010
This December 2009 release includes additional genome data from RefSeq and incumbent bacterial BRCs and minor updates to PATRIC data and website functionality.
show more...
This release includes additional genome data from RefSeq and incumbent bacterial BRCs, minor data corrections, and minor bug fixes/improvements to the web functionality.
Data Enhancements
The following table summarizes the data loaded in the PATRIC database and available through December 30, 2009 PATRIC Website Release.
|
From incumbent BRCs |
From RefSeq |
| Number of genomes |
409 |
2,329 |
| Number of genomic features |
2,297,834 |
14,944,546 |
Website Enhancements
RAST Interface
- PATRIC includes a collaboration with the University of Chicago to provide an end-user genome annotation service using the Rapid Annotation using Subsystem Technology (RAST) system. We have created a lightweight RAST Interface that allows PATRIC users to access RAST directly from the PATRIC website.
We are pleased to
announce the release of ToxoDB 6.0
New Data Content:
This release contains the addition of a new genome assembly and
annotation of Neospora
caninum generated by Adam Reid at the Wellcome
Trust Sanger Institute. The number of annotated genes for N. caninum in this
release increased from 5761 to 7227.
New Features:
This release features the addition of a "My
Basket" tool. This feature allows you to select various genes
from
your searches and add them to your basket (much like a shopping cart).
To add genes to your basket simply click on the basket icon
in a
result list or on a gene page. A green basket icon indicates
that a
genes is already in your basket. You can easily convert your
basket to
a strategy to save it and perform additional searches or combine it
with one of your already existing strategies.
Please note that using the basket feature requires you to
login first.
22 December 2009
We are pleased to
announce the release of TriTrypDB 2.0
This release contains the following new datasets and features:
New Data:
- T. brucei chromatin
immunoprecipitation
sequencing data presented as Genome Browser tracks. Data from
four histone variants (H2AZ, H2BV, H3V, and H4V) can be viewed.
To see an example of this, please click here.
To read more about this data please examine the following publication: Siegel
TN, Hekstra DR, Kemp LE, Figueiredo LM, Lowell JE, Fenyo D, Wang X,
Dewell S, Cross GA. Four histone variants mark the boundaries of
polycistronic transcription units in Trypanosoma brucei.
Genes Dev.
2009 May 1;23(9):1063-76.
- Updated Sequence and assembly for T. brucei 927 and
updated annotation for all other organisms in TriTrypDB.
- The
following new genome sequence and annotation have been incorporated
into TriTrypDB and can be searched and compared to each other:
- Trypanosoma brucei gambiense
- Trypanosoma
congolense
- Trypanosoma
vivax
- Leishmania
mexicana (available as sequence and assembly only,
annotation will be incorporated in the February release of TriTrypDB)
New Features:
- This release features the addition of a "My
Basket" tool. This feature allows you to select various genes
from
your searches and add them to your basket (much like a shopping cart).
To add genes to your basket simply click on the basket icon
in a
result list or on a gene page. A green basket icon indicates
that a
genes is already in your basket. You can easily convert your
basket to
a strategy to save it and perform additional searches or combine it
with one of your already existing strategies.
Please note that using the basket feature requires you to
login first.
- The
multiple sequence alignment tool available from gene pages has been
updated to allow the selection of the organisms to be aligned.
The multiple sequence aligner is available on gene pages
right
under the genomic context view.
- Web services have been implemented to receive weekly updates from GeneDB to provide a more rapid account for updated genes.
22 December 2009
We are pleased to announce the implementation a new tool in
GiardiaDB. The "My
Basket" tool allows you to select various genes from
your searches and add them to your basket (much like a shopping cart).
To add genes to your basket simply click on the basket icon
in a
result list or on a gene page. A green basket icon indicates
that a
genes is already in your basket. You can easily convert your
basket to
a strategy to save it and perform additional searches or combine it
with one of your already existing strategies.
Please note that using the basket feature requires you to
login first.
22 December 2009
We are pleased to
announce the release of PlasmoDB 6.3
This release of PlasmoDB contains the following new
datasets and features:
New Data:
- P.
falciparum transcriptome sequencing (RNA-seq) profiles.
This data was graciously provided to PlasmoDB prepublication
and will soon appear in press: New insights into the blood stage
transcriptome of Plasmodium falciparum using RNA-Seq. Thomas
D. Otto, Daniel Wilinski, Sammy Assefa, Thomas M. Keane, Louis R.
Sarry, Ulrike Bohme, Jacob Lemieux, Bart Barrell, Arnab Pain, Matthew
Berriman, Chris Newbold, Manuel Llinas. Molecular
Microbiology (in press). You can search for genes based on
this data by going to the "P. falciparum transcriptome
sequencing (RNA-seq) profiles" search under "Transcript
Expression" in PlasmoDB.
- P. falciparum
transcription profile of wild-type and Sir2a and Sir2b knock-out
parasites. Using this search you can find genes that are
differentially expressed between wild-type and knock-out parasites
during the intraerythrocytic life-cycle. You can search for
genes based on this data by going to the "Microarray
Evidence" search under "Transcript Expression" in PlasmoDB.
Details of this study can be found here: Tonkin
CJ, Carret CK, Duraisingh MT, Voss TS, Ralph SA, Hommel M, Duffy MF,
Silva LM, Scherf A, Ivens A, Speed TP, Beeson JG, Cowman AF. Sir2
paralogues cooperate to regulate virulence genes and antigenic
variation in Plasmodium falciparum. PLoS Biol. 2009 Apr 14;7(4):e84.
- P.
falicparum HP1 chromatin immunoprecipitation data.
You can search for genes based on this data by going to the "P.f ChIP on chip
evidence" search under "Transcript Expression" in PlasmoDB.
Details of this study can be found here: Flueck C,
Bartfai R, Volz J, Niederwieser I, Salcedo-Amaya AM, Alako BT, Ehlgen
F, Ralph SA, Cowman AF, Bozdech Z, Stunnenberg HG, Voss TS.
Plasmodium falciparum heterochromatin protein 1 marks genomic loci
linked to phenotypic variation of exported virulence factors. PLoS
Pathog. 2009 Sep;5(9):e1000569. Epub 2009 Sep 4.
New Features:
This release features the addition of a "My
Basket" tool. This feature allows you to select various genes
from
your searches and add them to your basket (much like a shopping cart).
To add genes to your basket simply click on the basket icon
in a
result list or on a gene page. A green basket icon indicates
that a
genes is already in your basket. You can easily convert your
basket to a strategy to save it and perform additional searches or
combine it with one of your already existing strategies.
Please note that using the basket feature requires you to
login first.
22 December 2009
In September 2009, the National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), awarded a five-year contract to develop a multi-faceted web-based bioinformatics resource that provides rich data and analysis tools for all bacterial species with an emphasis on the bacterial Orders that include NIAID category A-C priority pathogens.
show more...
In September 2009, the National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), awarded a five-year contract to Virginia Tech’s CyberInfrastructure Group (CIG) to support the biomedical research community’s work on infectious diseases. The funding is being used to develop a multi-faceted web-based bioinformatics resource that provides rich data and analysis tools for all bacterial species with an emphasis on the bacterial Orders that include NIAID category A-C priority pathogens.
Data Enhancements
December 9, 2009 marks our initial release of the PATRIC Website. By design, this release is data-centric, providing the first compiled set of all bacterial genomes, along with their annotations, available from RefSeq and GFF3 files provided by incumbent BRCs from the previous BRC awards. The following table summarizes the data loaded in the PATRIC database and currently available through this December 9, 2009 PATRIC Website release. At this time, we are missing data from a small number of GFF3 files, but are working to make these data available by December 31, 2009.
|
From incumbent BRCs |
From RefSeq |
| Number of genomes |
394 |
2,317 |
| Number of genomic features |
2,157,973 |
14,766,475 |
Website Enhancements
This December 9, 2009 PATRIC website release provides the following functionality:
- Basic Website Navigation: Including Taxonomy Browser, Taxon Overview page, Genome Overview page, Genome/Sequence List, Genomic Feature Table, and Feature Overview page.
- Searches and Tools: Genome Finder and Feature Search Tools allow users to quickly find genomes or features of interest. BLAST Search allows users to quickly search genomic sequences and protein coding genes based on sequence similarity.
- Feature Cart: Allows users to collect the features of interest from multiple pages across the PATRIC website. Once collected, these features can be exported as FASTA DNA or FASTA Protein sequences, or as a Feature Table.
- PubMed Integration: A simple but effective real-time literature retrieval system that quickly identifies publications relevant to a taxon, genome, or gene of interest using PubMed and Entrez Programming Utilities (eUtils) from NCBI and search terms derived from genome metadata and/or functional annotation of a gene/protein. Users can filter results by area of interest (i.e., Countermeasures, Diagnosis, Disease, Epidemiology, or Gene Expression).
- Google Search: Provides an automated list of related web resources as determined by Google search engine; resources are grouped by content category including Google Web, News, Images, Books, Patents, and Video. Google search results are summarized on Taxon, Genome, and Fetaure Overview pages, and direct links are provided to the detailed result pages.
- File Download: Allows users to download genome sequences and annotations as GenBank or GFF3 files.
We are pleased to
announce the release of GiardiaDB 2.0
This is a major release of GiardiaDB that includes several
new data types:
- Sequence and annotation of Giardia assemblage
E isolate P15 was provided by J. Jerlstrom-Hultqvist. O. Franzen, E.
Castro, J. Ankarklev, D. Palm, J. O. Andersson, S.G. Svard and B.
Andersson (Karolinska Institutet, Stockholm, Sweden and Uppsala
University, Uppsala, Sweden). The genome sequence and annotation was
graciously provided to GiardiaDB prepublication.
note: genomic data may be
searched in many ways such as identifying all genes from any of the
assemblages, searching for genes using the text search or identifying
genes from one assemblage based on functional data in another using
orthology. In addition, all gene pages include a genomic
context view highlighting
synteny
between the three assemblages in GiardiaDB.
- Additional stress induction microarray data graciously
provided by Cornelia Spycher and Adrian Hehl prepublication (University
of Z�rich, Switzerland). Genes may be identified based on
their expression profile by running one of the microarray
searches and results may be combined with other searches in
GiardiaDB.
Additional features and Improvements:
- 1012 genes from Giardia lamblia assemblage A isolate WB
have been undeprecated based on synteny to assemblage B and E genomes.
- Representation of SAGE tag data on gene pages has been
improved providing graphical representation of tag counts over the life
cycle of the parasite.
16 November 2009
We are pleased to
announce the release of ToxoDB 5.3
This release contains the addition of Bradyzoite differentiation
expression data from four experiments, most of which have not yet been
published and were graciously provided to the Toxoplasma research
community via ToxoDB.
- 2-day alkaline differentiation time course of T. gondii type II
(Prugniaud) strain provided by Matt Anderson and John Boothroyd.
- 3-day alkaline, CO2 starvation or sodium nitroprusside
differentiation of T.
gondii types I (RH) or II (Prugniaud) strains provided by
Paul Davis and David Roos.
- 14-day CO2 starvation differentiation of T. gondii type II
(Prugniaud) or III (VEG) strains provided by Paul Davis, Florence
Dzierszinski and David Roos.
Genes may be identified based on their expression profile by running
one of the microarray
searches and results may be combined with other searches in
ToxoDB.
16 November 2009
We are pleased to announce the release of TriTrypDB 1.3
This release of TriTrypDB includes a new ChIP-chip experiment identifying regions of active transcription in L. major from Thomas et. al. "Histone acetylations mark origins of polycistronic transcription in Leishmania major". The data are available as Genome Browser tracks. For an example click here.
We anticipate a more significant release for TriTrypDB in late December which will include updated annotations for all the TriTryp genomes as well as three additional kinetoplastid genomes from GeneDB.
14 November 2009
We are pleased toannounce
the release of PlasmoDB 6.2
New
Data:
- P.
falciparum and P. vivax proteomics
from clinical samples graciously provided prior to publication by Dr.
Utpal Tatu (Department of Biochemistry, Indian Institute of Science,
Bangalore, India). This data may be accessed using the "Identify
Genes based on Mass Spec. Evidence" query and combined with
other searches in PlasmoDB.
13 November 2009
We are pleased to
announce the release of EuPathDB 2.1
This is a major release of the EuPathDB resource and includes the
strategy builder, which allows the construction of complex searches
using a graphical interface.
Strategy builder
highlights:
- Transform a search into its orthologs from other species in
the EuPathDB family of databases.
- Edit any step in a strategy and see results change
dynamically.
- Share strategies with your colleagues via email.
- Login to store your history of strategies and browse them
at any time in the future.
A tutorial describing how to build a strategy can be viewed here.
And an example sample strategy can be accessed here.
Sincerely,
The EuPathDB team.
25 September 2009
We are pleased to
announce the release of CryptoDB 4.2
New Data:
- C. parvum
mitochondrial fraction proteomics provided prepublication by Lorenza
putignani.
New/Updated
Features:
- Now after you perform a BlastN against genomic sequence you
can
link from the blast output to the correct coordinates in the genome
browser. This allows you to explore annotation and other
functional
data in relation to your BLAST hits.
- The type ahead feature has been implemented for GO term, EC
number and Interpro domain searches.
- You can now edit or delete your user comments.
- You can now modify your user profile, change your password
or your email address.
Please also note that the parent portal site EuPathDB has just been
updated, offering you the ability to explore results across organisms.
Sincerely,
The EuPathDB team.
25 September 2009
We are pleased to
announce the implementation of new features in TrichDB:
- Identify Genes based on Orthology Phylogenetic
Profile: For example you may ask for genes in T. vaginalis that
do not have orthologs in mammals. Click here to explore this
feature.
- A table of paralogs (if any) is now available on gene pages
allowing you to view expanded gene families -- See TVAG_386080
for example (scroll down to the Orthologs and Paralogs Table).
- Now after you perform a BlastN against genomic sequence you
can link from the blast output to the correct coordinates in the
genome browser. This allows you to explore annotation and
other functional data in relation to your BLAST hits.
- The type ahead feature has been implemented for GO term, EC
number and Interpro domain searches.
- You can now edit or delete your user comments.
- You can now modify your user profile, change your password
or your email address.
We are still collecting data sets for an upcoming release of
TrichDB. This release will include a large expansion in data
content and we look forward to receiving more data sets from you.
Also, check out the new version of EuPathDB.
EuPathDB permits searches across all organism-specific sites and now
includes the strategy system. Build multi-step searches using
a graphical interface that is feature-rich. You can save your
strategies, modify any step in a strategy or even share your strategy
with other scientists.
Sincerely,
The EuPathDB team.
25 September 2009
We are pleased to
announce the release of GiardiaDB 1.3
New Data:
- G.
intestinalis mitosome enriched fraction proteomics
provided by Jan Tachezy and described in Rada et. al.
- Giardia
isolate data from GenBank are now included -- search for isolates based
on many criteria such as geographic location, submitter name, isolation
source and/or host. Also, generate nucleotide sequence
alignments of isolates you select, and BLAST your sequence against the
isolate database.
New/Updated
Features:
- Now after you perform a BlastN against genomic sequence you
can
link from the blast output to the correct coordinates in the genome
browser. This allows you to explore annotation and other
functional
data in relation to your BLAST hits.
- A table of paralogs (if any) is now available on gene pages
allowing you to view expanded gene families -- See GL50803_15158 for
example (scroll down to the Orthologs and Paralogs Table).
- The type ahead feature has been implemented for GO term, EC
number and Interpro domain searches.
- You can now edit or delete your user comments.
- You can now modify your user profile, change your password
or your email address.
Please also note that the parent portal site EuPathDB has just been
updated, offering you the ability to explore results across organisms.
Sincerely,
The EuPathDB team.
25 September 2009
We are pleased to
announce the release of TriTrypDB 1.2
New Data:
- New T. cruzi
genome assembly as described in the publication by Weatherly
and colleagues.
- T. cruzi reservosome
proteomics data (Sant'Anna et. al.).
- T. cruzi insect
form proteomics data (Cordero et. al.).
- L. infantum
proteomics data set from Dr. Marc Ouellette's group (provided
prepublication).
- Genome-wide analysis of mRNA abundance in two life-cycle
stages of T. brucei and
identification of trans-splicing and polyadenylation sites. RNA-seq (T.
Nicolai Siegel, Doeke R. Hekstra, Xuning Wang, Scott Dewell and George
A. M. Cross, provided prepublication) -- this data can be viewed as
tracks in the genome browser, click here for an example.
New/Updated
Features:
- Now after you perform a BlastN against genomic sequence you
can
link from the blast output to the correct coordinates in the genome
browser. This allows you to explore annotation and other
functional
data in relation to your BLAST hits.
- The type ahead feature has been implemented for GO term, EC
number and Interpro domain searches.
- You can now edit or delete your user comments.
- You can now modify your user profile, change your password
or your email address.
Please also note that the parent portal site EuPathDB has just been
updated, offering you the ability to explore results across organisms.
Sincerely,
The EuPathDB and GeneDB teams.
25 September 2009
We are pleased to
announce the release of PlasmoDB 6.1
New Data:
- P.
falciparum intraerythrocytic expression profiles
from infected patients (Lemieux et. al.)
-- data available on gene pages, for example, click here
and scroll down to the expression section.
New/Updated Features:
- Now after you perform a BlastN against genomic sequence you
can
link from the blast output to the correct coordinates in the genome
browser. This allows you to explore annotation and other
functional
data in relation to your BLAST hits.
- Linkouts to Gene Ontology annotation predictions at Plasmodraft
have been updated. (Br�h�lin
et. al.).
- Linkouts to the Malaria Parasite
Metabolic Pathways resource (Hagai Ginsburg).
- The type ahead feature has been implemented for GO term, EC
number and Interpro domain searches.
- You can now edit or delete your user comments.
- You can now modify your user profile, change your password
or your email address.
Please also note that the parent portal site EuPathDB has just been
updated, offering you the ability to explore results across organisms.
Sincerely,
The EuPathDB team.
25 September 2009
We are please to announce the release of CryptoDB 4.1
This release includes new and updated data sets, a new user interface with added functionality and a graphical strategy system for building complex searches.
The new user interface contains many new features including:
- A new search strategy interface that enables users to more easily construct complex searches.
- 'User Comments' have been updated to enable submission of supporting evidence including pubmed identifiers and images (or other files). Comments may be added to any gene or sequence page, enhancing available annotation.
- Files can now be uploaded and provided to the community via the file repository section.
New and updated Data:
- C. parvum KSU-1 normalized sporozoite EST sequences (provided prepublication by Jessica Kissinger).
15 July 2009
We are pleased to announce the release of GiardiaDB 1.2
This release features a new user interface with added functionality and a graphical strategy system for building complex searches.
The new user interface contains many new features including:
- A new search strategy interface that enables users to more easily construct complex searches.
- 'User Comments' have been updated to enable submission of supporting evidence including pubmed identifiers and images (or other files). Comments may be added to any gene or sequence page, enhancing available annotation.
- Files can now be uploaded and provided to the community via the file repository section.
The previous version of GiardiaDB may still be accessed here.
14 July 2009
We are pleased to announce the release of ToxoDB 5.2
This release includes several new and updated data sets (many prepublication), a new user interface with added functionality and a graphical strategy system for building complex searches.
The new user interface contains many new features including:
- A new search strategy interface that enables users to more easily construct complex searches.
- 'User Comments' have been updated to enable submission of supporting evidence including pubmed identifiers and images (or other files). Comments may be added to any gene or sequence page, enhancing available annotation.
- Files can now be uploaded and provided to the community via the file repository section.
New and updated Data:
- ChIP-chip centromere identification (Prepublication from Mathieu Gissot, Carrie F. Brooks, Boris Striepen, Kami Kim).
- Isolate data typed based on sequence (from Genbank and David Sibley) or RFLPs (from Chunlei Su).
- Whole genome sequence of the natural recombinant Toxoplasma gondii strain TgCkUg2 and SNP calls (Bontell et. al.).
The previous version of ToxoDB may still be accessed here.
14 July 2009
We are pleased to announce the release of TrichDB 1.1
This release features a new user interface with added functionality and a graphical strategy system for building complex searches.
The new user interface contains many new features including:
- A new search strategy interface that enables users to more easily construct complex searches.
- 'User Comments' have been updated to enable submission of supporting evidence including pubmed identifiers and images (or other files). Comments may be added to any gene or sequence page, enhancing available annotation.
- Files can now be uploaded and provided to the community via the file repository section.
The previous version of TrichDB may still be accessed here.
14 July 2009
We are pleased to announce the release TriTrypDB 1.1
Highlights include:
- The L. tarentolae genome has been provided by the CIHR Group on host pathogen interactions and is available for similarity searches and browsing for open reading frames and BLASTX alignments against the NR protein database.
- A microarray experiment examining the axenic promastigote-to-amastigote differentiation of L. donovani was provided by researchers at the Seattle Biomedical Research Institute and Technion Institute and is available for query and display on record pages.
- A new search strategy interface that enables users to more easily construct complex searches is released for the first time in TriTrypDB 1.1.
- 'User Comments' have been updated to enable submission of supporting evidence including pubmed identifiers and images (or other files). Comments may be added to any gene or sequence page, enhancing available annotation. Such comments are visible and searchable by others, and are automatically forwarded to the annotation team for incorporation into the official annotation
- Files can now be uploaded and provided to the community via TriTrypDB.
Please click on the 'Contact Us' link at the bottom of each page to indicate anything that might help to make the site better: bugs, help requests, features that are unclear or take too long to run, suggestions for additional tools or datasets that might be useful to incorporate, etc. Where appropriate, please also add to the 'User Comments' on each gene page, noting inaccurate (or accurate) gene models, phenotype information, expression data, clone availability, etc.
The previous version of TriTrypDB may still be accessed here.
14 July 2009
The EuPathDB team is happy to announce the release of PlasmoDB 6.0.
This release includes several new and updated data sets (many prepublication), a new user interface with added functionality and a graphical strategy system for building complex searches.
The new user interface contains many new features including:
- A new search strategy interface that enables users to more easily construct complex searches.
- 'User Comments' have been updated to enable submission of supporting evidence including pubmed identifiers and images (or other files). Comments may be added to any gene or sequence page, enhancing available annotation.
- Files can now be uploaded and provided to the community via the file repository section.
New and updated Data:
- P. falciparum reannotation integrated as the official annotation in PlasmoDB (See news item dated 1 February 2008 for details on the reannotation effort).
- P. falciparum High density genotyping array (HD Array) from the Broad Institute (Unpublished).
- P. vivax intraerythrocytic microarray life-cycle (Bozdech et. al.).
- P. vivax microarray from patient samples (Westenberger and Winzeler, unpublished).
- P. chabaudi new 8X assembly and reannotation (Pathogen Sequencing Unit at the Wellcome Trust Sanger Institute, unpublished).
- P. yoelii published gene calls appear as provisional genes in the genome browser (Vaughan et. al.).
- P. yoelii oocyst and salivary gland microarray data (Zhou et. al.).
- Isolate data from GenBank is now integrated and searchable.
Additional features:
14 July 2009
We are pleased to announce the release of ToxoDB 5.1.
This release features:
- New Chip-ChIP data performed on RH parasites (Bougdour, Maubon, Baldacci, Ortet, Bastien, Bouillon, Barale, Pelloux, Menard, Hakimi. Drug-inhibition of HDAC3 and epigenetic control of differentiation in Apicomplexa parasites. J. Exp. Med. 2009;206(4):953-66).
- Updated proteomics data from the Einstein Biodefense Proteomics Research Center.
- Enhanced user comment form that among other things allows linking PubMed records, linking multiple genes to the same comment and uploading images.
24 April 2009
We are pleased to announce the availability of a beta-release version of TriTrypDB 1.1, accessible at http://beta.TriTrypDB.org. Feel free to forward this link to others. Highlights include:
- The L. tarentolae genome has been provided by the CIHR Group on host pathogen interactions and is available for similarity searches and browsing for open reading frames and BLASTX alignments against the NR protein database.
- A microarray experiment examining the axenic promastigote-to-amastigote differentiation of L. donovani was provided by researchers at the Seattle Biomedical Research Institute and Technion Institute and is available for query and display on record pages.
- A new search strategy interface that enables users to more easily construct complex searches is released for the first time in TriTrypDB 1.1.
- 'User Comments' have been updated to enable submission of supporting evidence including pubmed identifiers and images (or other files). Comments may be added to any gene or sequence page, enhancing available annotation. Such comments are visible and searchable by others, and are automatically forwarded to the annotation team for incorporation into the official annotation
- Files can now be uploaded and provided to the community via TriTrypDB.
By providing access to TriTrypDB 1.1 prior to its official release, we hope to obtain valuable feedback to facilitate the debugging process, in addition to making new data and functionality available to the community as early as possible. Please click on the 'Contact Us' link at the bottom of each page to indicate anything that might help to make the site better: bugs, help requests, features that are unclear or take too long to run, suggestions for additional tools or datasets that might be useful to incorporate, etc. Where appropriate, please also add to the 'User Comments' on each gene page, noting inaccurate (or accurate) gene models, phenotype information, expression data, clone availability, etc.
24 April 2009
We are please to announce the release of CryptoDB 4.0, which features
several new data sets (including many that were graciously provided
prepublication to CryptoDB) and enhancements.
New data data sets:
- The fully sequenced and annotated genome of Cryptosporidium muris.
- More Isolate data -- CryptoDB now includes 2290 isolate records.
- New proteomics data for C. parvum.
New features and enhancements:
- Improved text search capability providing scored results.
- Improved result summary pages that also include results filtered against all Cryptosporidium in CryptoDB.
- A dramatically revamped isolate query section. In particular, this includes a vastly improved organization of isolates to allow more useful queries. Our Geographic location query now includes a map to allow visualization of isolate location by country or continent.
- Blast can now be performed against reference isolates only which should facilitate typing.
- Reference RFLP gel images are now made available via CryptoDB (a link from the isolate section).
- We now provide a phylogenetic tree view of multiple sequence alignments from isolate sequence that you choose.
- Several additional queries have been implemented including queries for secondary structure (based on sequences in the Protein Data Bank or PSIPRED), isoelectric point and available antibodies (in the reagents section).
*We are grateful to several investigators who have provided data and advice for this release of CryptoDB including:
- Belinda Ferrari and Quach Truong at the Department of Biological Sciences, Macquarie University, Sydney, Australia.
- Joseph M. Dybas, Carlos J. Madrid-Aliste, Fa-Yun Che, Edward Nieves, Louis M. Weiss, Kami Kim, Andr�s Fiser and Ruth Hogue Angeletti
at the Einstein Biodefense Proteomics Research Center, Albert Einstein College of Medicine, Bronx, NY, USA
- Lihua Xiao at the Centers for Disease Control in Atlanta, Georgia, USA
- Lis Caler and Hernan A. Lorenzi at the J. Craig Venter Institute, Rockville, MD, USA
- Guy Robinson at the UK Cryptosporidium Reference Unit, Swansea, UK
6 February 2009
We are pleased to announce the first release of TriTrypDB 1.0, a new bioinformatics resource for kinetoplastid parasites.
Leveraging bioinformatics infrastructure developed in the context of the EuPathDB project (including PlasmoDB, etc), this resource incorporates genome sequence and annotation from the GeneDB project, along with additional datasets generously provided by many research laboratories.
Highlights include:
- Sequence and annotation for Leishmania braziliensis, L. infantum, L. major, Trypanosoma brucei and Trypanosoma cruzi.
- Genome sequence alignments, permitting comparative genomic analysis of synteny (T. cruzi scaffolds assembled into virtual chromosomes, distinguishing 'Esmeraldo' and 'non-Esmeraldo' haplotypes).
- Expression data, including ESTs and proteomics evidence for L. infantum, T. brucei and T. cruzi (from several laboratories, much of it made available prior to publication).
- Phenotypic information, including manually curated RNAi knock-down phenotypes for T. brucei from TrypanoFAN and the published literature, aiding functional analysis and the validation of essential targets.
- 'User Comments' may be added to any gene page, enhancing available annotation. Such comments are visible and searchable by others, and will be automatically forwarded to the annotation team.
- Identification of genes based on text terms, ID #, GO terms, EC #, chromosomal location, BLAST, sequence features (length/MW, introns, signal sequences, Pfam domains, user-defined motifs), structure, epitopes, orthology, functional information, etc.
- Support for integrated searches, via 'Query History', such as:
- "Find all T. brucei genes for which siRNA-mediated knock-downs yield a lethal phenotype"
- "Find L. major kinases without human orthologs that are supported by expression evidence in any kinetoplastid parasite"
Look for TriTrypDB at the Kinetoplastid Molecular Cell Biology Meeting in Woods Hole, MA (26-29 April 2009; registration deadline 6 March).
This integrated kinetoplastid bioinformatics resource has been made possible by support from NIAID (EuPathDB), the Wellcome Trust (GeneDB), and the Bill & Melinda Gates Foundation (TriTrypDB).
6 February 2009
We are pleased to announce the availability of a beta-release version of TriTrypDB, accessible at http://beta.TriTrypDB.org. Feel free to forward this link to others. Highlights include:
‣ Sequence and annotation for Leishmania braziliensis, L. infantum, L. major, Trypanosoma brucei and T. cruzi. All current annotations from GeneDB have been incorporated into TriTrypDB 1.0.
‣ Genes may be queried based on gene IDs, text terms, EC numbers, GO terms, sequence features (length, introns, signal sequences, Pfam domains), etc.
‣ Genes and sequences may also be interrogated based on BLAST similarity, user-defined motifs, etc.
‣ T. cruzi scaffolds have been assembled into virtual chromosomes, distinguishing the 'Esmeraldo' and 'non-Esmeraldo' haplotypes.
‣ Alignments between all available genome sequences, permitting comparative genomic analysis of synteny and convenient downloads of aligned sequences.
‣ Expression data, including Expressed Sequence Tag (EST) evidence, and proteomics evidence for L. infantum, T. brucei and T. cruzi (from several laboratories, much of it kindly made available prior to publication).
‣ Phenotypic information from GeneDB, including manually curated RNAi knock-down phenotypes for T. brucei from TrypanoFAN and the published literature, aiding functional analysis and the validation of essential targets.
‣ Identification of orthologs, permiting inferences for one species based on evidence from others. For example you might want to identify L. major genes that are orthologs of essential T. brucei kinases, or T. cruzi proteins supported by expression evidence in any kinetoplastid species.
‣ 'User Comments' may be added to any gene page, enhancing available annotation. Such comments are visible and searchable by others, and will be automatically forwarded to the annotation team.
By providing access to TriTrypDB prior to its official release, we hope to obtain valuable feedback to facilitate the debugging process, in addition to making this resource available to the community as early as possible. Please click on the 'Contact Us' link at the bottom of each page to indicate anything that might help to make the site better: bugs, help requests, features that are unclear or take too long to run, suggestions for additional tools or datasets that might be useful to incorporate, etc. Where appropriate, please also add to the 'User Comments' on each gene page, noting inaccurate (or accurate) gene models, expression data, clone availability, etc.
23 January 2009
We are pleased to announce the release of the fifth snapshot of the reannotation of the P. falciparum 3D7 genome.
The effort to systematically update the annotation for P. falciparum 3D7 genome started in October 2007 with a weeklong workshop co-organized by staff from the Wellcome Trust Sanger Institute (WTSI) and the EuPathDB team. The Wellcome Trust and Burroughs Wellcome Fund-sponsored workshop was held in Hinxton, UK and was attended by nearly 30 Plasmodium researchers contributing expertise in various aspects of Plasmodium biology, plus more than 10 bioinformatics support staff from the Artemis, GeneDB and EuPathDB teams. Ongoing annotation and error checking is being carried out by the GeneDB group from WTSI.
For further information please refer to the February 1st, 2008 news item (below).
18 November 2008
We are pleased to announce an update to GiardiaDB 1.1, which mainly affects the number of deprecated genes available through GiardiaDB and how query results are displayed.
1. In this release, all genes that were available through the genome project database (MBL GiardiaDB) are now included in GiardiaDB. Hence, there are 4778 deprecated genes in this release.
2. When performing a query, you no longer select whether to query genes only, deprecated genes only, or both. Instead, when a query is performed, all results will be returned with the option to view (or filter) all results, genes only or deprecated genes only (?genes only? is the default view of the results).
As a reminder, deprecated genes were labeled as such because they appear unlikely to represent true genes, based on incompatibility with longer gene models or alternative models for which functional evidence is available. A number of deprecated genes may in fact be true genes and as evidence becomes available to support this, we will elevate a deprecated gene to a "full" gene status. We encourage users to notify us of such evidence by email or by adding specific user comments gene pages.
18 November 2008
The EuPathDB team is happy to announce the release of PlasmoDB 5.5.
New Data:
- Promoter Prediction:
- Prediction of transcription start sites in P. falciparum provided prepublication and available on gbrowse (Brick K, Watanabe J, and Pizzi E).
Improvements:
- Documentation related to data types has been improved.
- Gbrowse tracks have been organized and help associated with gbrowse items have been improved.
16 September 2008
PlasmoDB has released the fourth snapshot of the reannotation of the P. falciparum 3D7 genome. The reannotation presented in PlasmoDB is a work in progress and intended to inform you about the new annotation and elicit any feedback that you may have. Please add user comments to any genes for which you have evidence that supports or refutes the new annotation. In the previous snapshot, many product names for re-annotated genes were accidently reverted back to the original name. This new snapshot has the correct re-annotated product names.
15 July 2008
PlasmoDB has released the third snapshot of the reannotation of the P. falciparum 3D7 genome. The reannotation presented in PlasmoDB is a work in progress and intended to inform you about the new annotation and elicit any feedback that you may have. Please add user comments to any genes for which you have evidence that supports or refutes the new annotation. In the previous snapshot, many product names for re-annotated genes were accidently reverted back to the original name. This new snapshot has the correct re-annotated product names.
6 June 2008
The effort to systematically update the annotation for P. falciparum 3D7 genome started in October 2007 with a weeklong workshop co-organized by staff from the Wellcome Trust Sanger Institute (WTSI) and the ApiDB team. The Wellcome Trust and Burroughs Wellcome Fund-sponsored workshop was held in Hinxton, UK and was attended by nearly 30 Plasmodium researchers contributing expertise in various aspects of Plasmodium biology, plus more than 10 bioinformatics support staff from the Artemis, GeneDB and ApiDB teams. During the workshop 2700 genes were edited using Artemis and ACT software, mainly with the addition of new or alternative functional descriptions (e.g. product names, GO terms, EC numbers and miscellaneous textual comments).
After the workshop, intensive manual reannotation of the genome has continued at WTSI with the help of several members of the community. First, the structures of more than 700 gene models have been changed and new gene models have been added. In addition, curation staff at GeneDB have reviewed all User Comments submitted to PlasmoDB. Over the coming months, the emphasis will switch to a careful genome-wide review of functional annotation.
The annotation presented here in PlasmoDB is the second snapshot of a work in progress and intended to inform you about the new annotation and elicit any feedback that you may have. Please add a User Comment to any gene about which you have additional information. This information is forwarded to the GeneDB curators on a weekly basis and will be included in the next snapshot. In order to expedite the annotation, the following are of particular value to the manual curators:
- New product names with the PubMed ID of the paper in which they were first cited,
- Additional PubMed IDs of any papers from which the function of the gene/protein can be ascribed
- EC number
- Alternative/updated gene names and aliases.
- Evidence that a gene structure has been incorrectly called. Please include a description of the evidence that you have used (e.g. full length cDNA) and the new sequence.
The ongoing reannotation is being systematically carried out by Ulrike Böhme and Bart Barrell (Wellcome Trust Sanger Institute) with extensive help from several community volunteers, in particular Chris Newbold (Oxford, UK), Tom Templeton (Cornell, US), Eric Maréchal (Grenoble, France). For the annotation, the Artemis software was redesigned by Tim Carver and changes are recorded within a Chado database (GeneDB) designed and maintained by Adrian Tivey and Chinmay Patel.
Workshop participants included
- Oliver Bilker (Imperial Coll, London)
- Serge Bonnefoy (Inst Pasteur, Paris)
- Pete Bull (Kenya Med Res Inst, Kilifi)
- Jane Carlton (New York Univ)
- Brendan Crabb (Walter & Eliza Hall Inst, Melbourne)
- Christian Doerig (Univ Glasgow)
- Malcolm Gardner (Seattle Biomedical Res Inst)
- Hagai Ginsburg (Hebrew Univ, Jerusalem)
- George Githinji (MOMS, Tanzania)
- Taco Kooij (Heidelberg Univ)
- Dominic Kwiatkowski (Oxford Univ & WTSI)
- Sue Kyes (Oxford Univ)
- Thomas Lavstsen (Copenhagen)
- Manuel Llinas (Princeton Univ)
- Eric Marechal (CNRS, Grenoble)
- Dan Milner (Harvard Univ)
- Fingani Mphande (Karolinska Inst)
- Dan Neafsey (Broad Inst)
- Chris Newbold (Oxford Univ)
- Stuart Ralph (Melbourne Univ)
- Gowthaman Ramasamy (Seattle Biomedical Res Inst)
- Robert Sinden (Imperial Coll, London)
- Worachart Sirawaraporn (Mahidol Univ, Bangkok)
- Dominique Soldati (Univ Geneva)
- Tim Stedman (MR4 / ATCC)
- Tom Templeton (Cornell Univ, New York)
- Akhil Vaidya (Drexel Univ, Philadelphia)
- Scott Westenberger (Scripps Inst, La Jolla)
- Jennifer Wortman (Univ Maryland, Baltimore)
Sponsors and Facilitators included:
- Andy Berry, Ulrike Boehme, Celine Carret, Tim Carver, Al Ivens, Arnab Pain, Adrian Tivey & Matt Berriman (GeneDB / Wellcome Trust Sanger Inst)
- Brian Brunk, Zhongqiang Chen, Mark Heiges, Lucia Peixoto, Dhanasekaran Shanmugam & David Roos (ApiDB / Univ Georgia & Univ Pennsylvania)
19 May 2008
We are pleased to announce that GiardiaDB is now available at www.giardiadb.org. GiardiaDB is now part of ApiDB/EuPathDB ( www.eupathdb.org) and funded by NIH's BRC program.
Please send a message to the support team from the link at the bottom of the GiardiaDB web page ( Contact Us!).
GiardiaDB 1.1 provides the genome and annotation for the the eukaryotic parasite Giardia lamblia ATCC50803, WBC6 isolate, generated at the MBL and just reported in Science (Science 28 September 2007, Volume 317, pp. 1921-1926).
New data available in this release includes:
- Genes which were subsumed or largely overlapping with longer genes or genes with more evidence were marked as deprecated and all gene queries allow users to choose whether to query for these genes or not.
- Ten SAGE tag libraries are now available with queries and record pages.
- Microarry data from experiments examining stress induction in trophozoites.
- Proteomics data from trophozoites.
- All locations are reported on scaffolds but also available in contig coordinates.
GiardiaDB, like the other ApiDB/EuPathDB sites, offers researchers a plethora of tools for genomic data mining. In addition to BLAST and pattern/motif similarity searches, users can identify G. lamblia genes by genomic position; common name or keyword; gene attributes such as gene type and exon number; transcript expression evidence (ESTs and SAGE tags); physical characteristics of predicted proteins such as molecular weight and secondary structure; predicted function from Gene Ontology and EC enzyme number annotation; and cellular location. In addition, GiardiaDB contains pre-computed phylogenetic trees for proteins with homologues in other eukaryotic genomes.
Regards,
The GiardiaDB team
12 May 2008
The ApiDB team is pleased to announce the release of CryptoDB 3.7
New features include:
- records for 756 Cryptosporidium isolates. Queries are available for searching isolate records by ID,
host of origin, isolate species, and several other characteristics.
For the full set of search options, see the "Identify Isolates by:"
category on the web site's front page. Multiple isolate sequences can
be readily aligned with ClustalW from the query result page.
- also included are 27,498 C. muris oocyst ESTs aligned to the C. muris
draft genome.
- alignments of contigs of C. parvum, C. muris and C. hominis have been
remade using the Mercator orthology mapping program and the MAVID
multiple global alignment program. These alignments are available in
the Genome Browser.
19 February 2008
The ApiDB team is pleased to announce the release of ApiDB 3.2
This release accesses the latest releases of CryptoDB and PlasmoDB, offering the new Isolate queries in CryptoDB and updates after the on-going re-annotation of Plasmodium falciparum.
19 February 2008
The effort to re-annotate the falciparum genome was started in October at a weeklong workshop co-organized by ApiDB and GeneDB staff. The workshop was hosted at the Welcome Trust Sanger Institute in Hinxton, UK and was attended by nearly 30 Plasmodium researchers contributing expertise in various aspects of Plasmodium biology and more than 10 bioinformatics support staff from the GeneDB and ApiDB teams. More than 2,700 genes have been reviewed to date and most modified in some fashion; primarily refining exon boundaries, assigning product names, assigning GO functions and EC numbers and making comments / notes. The annotation process is continuing in the attempt to review all genes at least twice (by different annotators). The annotation presented here in PlasmoDB is thus a work in progress and intended to inform you, the community, about the new annotation and elicit any feedback that you may have. Please add a user comment to any gene that you have additional information about. This includes supporting data corroborating gene annotation as well as data that indicates the gene model, product name, functional assignments etc are incorrect. These comments will be included in the annotation which we hope will be concluded in mid-march.
Workshop participants included
- Oliver Bilker (Imperial Coll, London)
- Serge Bonnefoy (Inst Pasteur, Paris)
- Pete Bull (Kenya Med Res Inst, Kilifi)
- Jane Carlton (New York Univ)
- Brendan Crabb (Walter & Eliza Hall Inst, Melbourne)
- Christian Doerig (Univ Glasgow)
- Malcolm Gardner (Seattle Biomedical Res Inst)
- Hagai Ginsburg (Hebrew Univ, Jerusalem)
- George Githinji (MOMS, Tanzania)
- Taco Kooij (Heidelberg Univ)
- Dominic Kwiatkowski (Oxford Univ & WTSI)
- Sue Kyes (Oxford Univ)
- Thomas Lavstsen (Copenhagen)
- Manuel Llinas (Princeton Univ)
- Eric Marechal (CNRS, Grenoble)
- Dan Milner (Harvard Univ)
- Fingani Mphande (Karolinska Inst)
- Dan Neafsey (Broad Inst)
- Chris Newbold (Oxford Univ)
- Stuart Ralph (Melbourne Univ)
- Gowthaman Ramasamy (Seattle Biomedical Res Inst)
- Robert Sinden (Imperial Coll, London)
- Worachart Sirawaraporn (Mahidol Univ, Bangkok)
- Dominique Soldati (Univ Geneva)
- Tim Stedman (MR4 / ATCC)
- Tom Templeton (Cornell Univ, New York)
- Akhil Vaidya (Drexel Univ, Philadelphia)
- Scott Westenberger (Scripps Inst, La Jolla)
- Jennifer Wortman (Univ Maryland, Baltimore)
Sponsors and Facilitators included:
- Andy Berry, Ulrike Boehme, Celine Carret, Tim Carver, Al Ivens, Arnab Pain, Adrian Tivey & Matt Berriman (GeneDB / Wellcome Trust Sanger Inst)
- Brian Brunk, Zhongqiang Chen, Mark Heiges, Lucia Peixoto, Dhanasekaran Shanmugam & David Roos (ApiDB / Univ Georgia & Univ Pennsylvania)
1 February 2008
We are pleased to announce this new release of the ApiDB.org Website and Database, Version 3.1.
New Features/Data
- All our queries offer the same functionality as our component sites: Crypto 3.6, Plasmo 5.4 and Toxo 4.3 recently released with new data.
- All BLAST queries have been consolidated to a single interface, the same as in the component sites.
- More convenient access to help ... please click the "Ask us a Question" link on the front page sidebar, or the "Contact Us" at the top and bottom of every page to report problems or suggest improvements to the database.
Please notice that our BLAST tool has been consolidated to a single interface. We provide the same (updated) datasets as in the previous release: all Crypto, Plasmo and Toxo datasets plus the Theileria annulata and Theileria parva genomes, all ESTs from all apicomplexa, and the available clustered ESTs from apidots4.0 and the most recent Crypto and Toxo assemblies. Please provide your feedback so we can continue improving ApiDB.
ApiDB release 3.1 keeps improving its current web services architecture that allows us to offer all queries available at the component sites ( CryptoDB.org, PlasmoDB.org and ToxoDB.org). Our front page includes our updated Genome Statistics Table, Gene Metrics table and all available queries; thus, your favorite query is only one click away.
What is ApiDB?
ApiDB is a portal that allows you to query simultaneously different apicomplexan organisms in the Cryptosporidium, Plasmodium and Toxoplasma genera. ApiDB's added value is:
- The ability to blast all ApiDB organisms, plus Theileria annulata and parva genomes, all apicomplexan ESTs and their available clustered ESTs
- The ability to combine query results from different ApiDB organisms
- The ability to search for orthologous/paralogous genes across all ApiDB organisms
12 December 2007
The ApiDB team is pleased to announce the release of ToxoDB 4.3
New features include:
- Gene ID queries allow users to query with the identifiers from the predicted gene models.
- Graphic of the alternate gene predictions is now on the gene record page.
- View and download multiple sequence alignments between the sequenced strains with polymorphisms highlighted.
- Clustered and assembled ESTs are now available from the EST queries / pages and aligned to the genome (GBrowse).
- Additional Mass Spec data has been loaded.
- All BLAST queries have been consolidated to a single interface.
- Faster rendering of gene pages and GBrowse.
- Querying with a single identifier goes directly to the gene page rather than the summary results page.
- More convenient access to help ... please click the "Ask us a Question" link on the left of every page, or the "Contact Us" at bottom to report problems or suggest improvements to the database.
Best regards,
Your ApiDB project team
6 December 2007
The ApiDB team is pleased to announce the release of CryptoDB 3.6
New features include:
- SNP data from C. parvum IOWA, MD, TU114 strains
- View and download multiple sequence alignments between parvum, hominis and muris with polymorphisms highlighted.
- More than 10,000 ESTs from the Full-Cryptosporidium cDNA Database have been added
- Additional Mass Spec data has been loaded (will have better descriptions of samples before release).
- All BLAST queries consolidated to a single interface.
- Querying with a single identifier goes directly to the gene page rather than the summary results page.
6 December 2007
The ApiDB team is pleased to announce the release of PlasmoDB_5.4.
New data include:
- A slightly modified reference genome for P. falciparum (several gaps have been closed on chr10 & 11).
- P. berghei gametocyte proteomics data (Khan et al, Cell 121:675-87).
- Many additional P. falciparum SNPs (unpublished data from the Broad Inst).
- Additional ESTs (Watanabe et al, Nucl Acids Res 32:D334-8; Lu et al, BMC Genomics 8:255; Florent et al, Mol.Biochem.Parasitol. 135:143-8)
- Expression profiling data for antigenic and adherent variants of P. falciparum 3D7 (Mok et al, Molec Biochem Parasitol 151:184-92).
- User comments submitted prior to June 2007 have now been incorporated into the official annotation.
- Sequence assemblies for 11 falciparum strains from the Broad Institute have just been released and are available on the PlasmoDB download site.
New features include:
- Faster loading of Gene and Genome Browser pages.
- Improved synteny views in the Genome Browser.
- Browser views of rodent malaria genomes colored to indicate chromosomes (Kooij et al, PLoS Pathog 1:e44).
- Gene page links to various external data sources (including PlasmoMAP, TDRtargets, UCSC P. falciparum genome browser, Ontology-based Pattern Identification and literature databases).
- More convenient access to help ... please click the "Ask us a Question" link on the left of every page, or the "Contact Us" at bottom to report problems or suggest improvements to the database.
Note that a mechanism has now been established to ensure that User Comments on genes or sequences are incorporated into the official annotation in a timely manner. In response to a banner posted on PlasmoDB and e-mail to malaria researchers, several hundred new User Comments were entered by the community in advance of a P. falciparum genome reannotation workshop jointly sponsored by the Wellcome Trust Sanger Inst and the PlasmoDB team during the week of Oct 21!
There is still time to enter additional User Comments while the updated annotation is being reviewed in preparation for deposition in GenBank and release on PlasmoDB and GeneDB in early 2008. Please use the User Comments feature as a repository for valuable information on:
- Naming, e.g. "Apical membrane antigen 1 is also known as AMA1, AMA-1, Pf83, Rhoptry Membrane Antigen, RMA1" or "this gene is now recognizable as XXX based on Pfam domain ###."
- Gene models, e.g. "Corrected cDNA sequence for gene XXX: <sequence>, based on RACE and primer extension for 5' UTR and polyA-tailed cDNA clone for 3' UTR" or "RT-PCR confirms annotated CDS sequence as correct." Note that confirmation of existing annotation can be as valuable as corrections!
- Function, e.g. "Enzymatic activity for gene XXX has been confirmed" or "Protein XXX is predicted to target to the apicoplast, but an epitope-tagged construct shows cytoplasmic localization." Be sure to provide name and/or PubMedID if available.
For data that is not readily accommodated in this form, or datasets too extensive to enter as individual User Comments (expression profiling studies, systematic evaluation of gene models by RT-PCR, etc), please contact support@PlasmoDB.org.
Best regards,
Your ApiDB project team
31 October 2007
We are pleased to announce that GiardiaDB is now available at www.giardiadb.org. GiardiaDB is now part of ApiDB ( www.apidb.org) and funded by NIH's BRC program.
The previous GiardiaDB, hosted by the MBL's Bay Paul Center, will remain available until the end of 2007. During this transition period, we ask that you use www.giardiadb.org and notify us if there is a feature, tool, or query that you need that is not available at the new site. Please send a message to the support team from the link at the bottom of the GiardiaDB web page ( Contact Us!).
GiardiaDB 1.0 provides the genome and annotation for the the eukaryotic parasite Giardia lamblia ATCC50803, WBC6 isolate, generated at the MBL and just reported in Science (Science 28 September 2007, Volume 317, pp. 1921-1926). Also provided are expression data from EST and SAGE surveys.
GiardiaDB, like the other ApiDB sites, offers researchers a plethora of tools for genomic data mining. In addition to BLAST and pattern/motif similarity searches, users can identify G. lamblia genes by genomic position; common name or keyword; gene attributes such as gene type and exon number; transcript expression evidence (ESTs and SAGE tags); physical characteristics of predicted proteins such as molecular weight and secondary structure; predicted function from Gene Ontology and EC enzyme number annotation; and cellular location. In addition, GiardiaDB contains pre-computed phylogenetic trees for proteins with homologues in other eukaryotic genomes.
Regards,
The GiardiaDB team
1 October 2007
PlasmoDB 5.3 provides the genome and annotation for the primate malarial parasite Plasmodium knowlesi, provided by the Wellcome Trust Sanger Institute Pathogen Sequencing Unit. This brings the number of Plasmodium genomes available for queries at PlasmoDB up to six. Also provided are expression data from recent papers using gene knockouts to study molecular mechanisms in human ( P. falciparum) and rodent ( P. berghei) malarial parasites. Queries have been developed for the new expression studies and a summary graphic (P.f.) or table (P.b.) added to the gene pages.
The PlasmoDB 5.3 web site also has significant improvements to the query and results pages. Many of the queries have been reorganized. For example, the Predicted Functional Interaction query now allows input of a list of genes and a clearer explanation of what to get from the query (e.g., find high confidence predicted interactions between members of the input gene list). The results from this and all queries can be sorted. It is also now possible to add columns of interest (e.g., protein features, GO annotation, even expression characteristics) and sort on them.
New data and queries in PlasmoDB 5.3:
- Genome sequence and annotation for Plasmodium knowlesi from the Wellcome Trust Sanger Institute Pathogen Sequencing Unit. P. knowlesi is included in many of the queries to identify genes, genomics sequences, ESTs, and ORFs by various features.
- Microarray studies have been incorporated from a series of 3 papers on gene silencing (Duraisingh et al. Cell 2005) and invasion pathways (Baum et al. PLoS Pathogens 2005; Stubbs et al. Science 2005) in P. falciparum.
- A microarray study in P. berghei from a paper on translational repression in gametocytes (Mair et al. Science 2006).
- A refined mapping of epitopes to Plasmodium provided by the Immune Epitope Database and Analysis Resource. A query is now provided to identify genes whose encoding protein has an epitope at different levels of confidence.
- Updated E.C. number and metabolic pathway assignments, Interpro protein domains and associated GO predictions, MR4 reagent links, homologies to entries in the NRDB protein database and links to stuctures in PDB.
- A new build of ortholog groups from Orthomcl for use in PlasmoDB. The list of genomes represented has increased from 55 to 87 and includes all the organisms covered by the different NIH/NIAID funded Bioinformatics Resource Centers.
New features in PlasmoDB 5.3:
- Most of the queries have been reorganized to make them more intuitive. Some include a new "Advanced Parameters" section. See for example "Identify Genes based on EST evidence."
- The columns displayed in query result pages are improved. The result pages also offer options for sorting, adding new columns, and moving columns around.
- Downloading results is improved. A choice of formats is provided including tab-delimited, text, GFF, and FASTA to meet different needs.
- Tab delimited provides an expanded list of annotations that can be directly exported as an Excel spreadsheet.
- Text provides a detailed summary for each gene including SNPs, Y2H interactions, metabolic pathways, MR4 reagents, protein structures, notes, sequence, and many other choices.
- GFF is a format recognized by many applications such as the genome browser, gbrowse.
- FASTA also provides a common format used for BLAST and other applications. The FASTA option provides the functionality of the SRT (sequence retrieval tool) found in earlier versions of PlasmoDB.
19 June 2007
New data and queries in CryptoDB 3.5:
- Proteomics data provided by two labs (Wastling, et al. (updated from CryptoDB 3.4) and Snelling, et al. (new data)) have been loaded, and
these are visible on the Genome Sequence Browser. Queries to find genes, or ORFs, that show evidence of protein expression based on Mass Spec analyses are also available.
- C. muris contigs from first draft assembly and translated open reading frames.
- Interproscan analysis of Cryptosporidium proteins.
New features in CryptoDB 3.5:
- We have added tutorial movies to help you learn about the functionality of our site.
If there are additional tutorials you would like to see added please submit a request via our support page.
- Most of the queries have been reorganized to make them more intuitive. Some include a new "Advanced Parameters" section. See for example "Identify Genes based on EST evidence."
- The columns displayed in query result pages are improved. The result pages also offer options for sorting, adding new columns, and moving columns around.
- Downloading results is improved. A choice of formats is provided including tab-delimited, text, GFF, and FASTA to meet different needs.
- Tab delimited provides an expanded list of annotations that can be directly exported as an Excel spreadsheet.
- Text provides a detailed summary for each gene including protein structures, notes, sequence, and many other choices.
- GFF is a format recognized by many applications such as the genome browser, gbrowse.
- FASTA also provides a common format used for BLAST and other applications. The FASTA option provides the functionality of the SRT (sequence retrieval tool).
18 June 2007
We are pleased to announce this new release of the ApiDB.org Website and Database, Version 3.0.
ApiDB is a portal that allows you to query simultaneously different apicomplexan organisms in the Cryptosporidium, Plasmodium and Toxoplasma genera. ApiDB's added value is:
- The ability to blast all ApiDB organisms
- The ability to combine query results from different ApiDB organisms
- The ability to search for orthologous/paralogous genes across all ApiDB organisms
ApiDB release 3.0 uses a new architecture based on web services that allows us to offer all queries available at the component sites (cryptodb.org, plasmodb.org and toxodb.org). Our new front page, with our updated Genome Statistics Table, now includes all available queries; thus, your favorite query is only one click away.
ApiDB 3.0 also includes improvements in the following areas:
New Query Features
- Question Page: Advanced Parameters are hidden to the novice user
- Question Page: New dataset parameter for uploading a set of IDs for ID based queries
- Summary Page: Users can add / remove columns and sort by any column
- Download Report: New report type: FASTA file
- Summary Page and Download Report: All attributes available in the component sites are now available in ApiDB now (in the previous release we were only providing organism name, product description and sequence.)
New/Updated Tools
- ApiCyc: another tool to search metabolic pathways for all ApiDB organisms (P. berghei and P chabauldi are not available for the time being).
- SRT Sequence Retrieval Tool is now an integrated component when downloading reports in FASTA format
- BLAST adds new data sets: T. gondii ESTs, C. parvum proteins, C. muris genomic sequence and orfs, and P. knowlesi gene models.
Community Resources
- New links to the Tetrahymena Genome and The Arabidopsis Information Resource
1 June 2007
ToxoDB 4.1 delivers the apicoplast genome and has incorporated the genomic sequence of T. gondii
VEG and RH (Chr Ia and Chr Ib) strains. The Genome Browser offers tracks for the ESTs from dbEST, ORFs
and the Affymetrix Probe Data (provided by Amit Bahl, Roos Lab, University of Pennsylvania). We also provide
Nucmer alignment of GT1, VEG and RH strains against the reference ME49 strain and SNPs generated from this
alignment. The Gene page now includes matches to Interpro protein domains and ESTs.
The ToxoDB 4.1 site also includes new features such as allowing you to keep your
Query History for later use, an improved keyword search and a significantly
upgraded Query & Tools page.
New data and queries in ToxoDB 4.1:
- Apicoplast genome and annotation
- Affymetrix probes mapped to genome, available in GBrowse
- Nucmer Alignments SNPs of VEG and RH (Chr Ia and Chr Ib) strains
- Toxoplasma ESTs mapped to genome, from the Oct. 2, 2006 release of dbEST
- Interproscan analysis of Toxoplasma proteins
- ORFs greater than 50aa
New features in ToxoDB 4.1:
- A recently designed User Login process. This will allow for:
- making use of the new Persistent Query History functionality, i.e. retrieve your past searches for
current use, as well as save further searches for the future.
- using the "Add a Comment" facility, for commenting on genes and chromosomes/scaffolds.
- adding yourself to mailing list of ToxoDB, and other ApiDB member sites.
- There are now queries available on SNP, EST and ORF datatypes, along with those on genes and sequences.
- Query History page provides additional functionality. For example, user can now rename performed
queries; or delete all queries with press of a button, in addition to doing boolean operations on
query results as well as viewing or downloading these.
- Reorganized Query and Tools page
- Improved text search
17 November 2006
New data and queries in CryptoDB 3.4:
- Preview release of Cryptosporidium sporozoite proteomic data from MS/MS analysis.
A significantly more comprehensive study is in preparation.
- BLAST and protein motif searches are now saved in your query history,
permitting the results to be combined with other queries.
- A new User Login process. This will allow for:
- making use of the new Persistent Query History functionality, i.e. retrieve your past searches for
current use, as well as save further searches for the future.
- using the "Add a Comment" facility, for commenting on genes and contigs.
- adding yourself to mailing list of CryptoDB, and other ApiDB member sites.
RNA trancripts are now available on gene record pages for non-protein-coding genes
17 November 2006
PlasmoDB 5.2 brings a new emphasis on genetic markers, in the form of four new datasets relating to Plasmodium polymorphisms. This data is available in the Genome Browser, on the Gene page and in queries. We continue to move key data from the PlasmoDB 4.4 site to the current site: this release re-introduces gametocyte expression data and expression profile similarity search. The Gene page now includes matches to Interpro protein domains and the Genome Browser offers tracks ESTs from dbEST.
The PlasmoDB 5.2 site also includes new features such as allowing you to keep your Query History for later use, an improved keyword search and a significantly upgraded Query & Tools page.
New data and queries in PlasmoDB 5.2:
- Array data from Kidgell et al. PLoS Pathogens 2006 A systematic map of genetic variation in Plasmodium falciparum.
- Polymorphism data based on resequencing and array analyses.
- SNPs generated by the NIH: Based on CDS sequences from strains 3D7, 7G8, D10, Dd2, HB3 (kindly provided in advance of publication by X. Su; reference: J. Mu et al, Nature Genetics, in press)
- SNPs generated by the Broad Institute: Based on whole genome sequence for strains Dd2 and HB3, and partial genome sequence for multiple strains (kindly provided in advance of publication by D. Decaprio, S.K. Volkman & D.F. Wirth; reference: S.K. Volkman et al, Nature Genetics, in press)
- SNPs generated by the WTSI: Based on whole genome sequence for strains Ghana1 and IT and a comparison with P. reichenowi (kindly provided in advance of publication by D.C. Jeffares & M. Berriman; reference: D.C. Jeffares et al, Nature Genetics, in press).
- Array CGH data on a variety of Plasmodium strains kindly provided by E. Winzeler.
- A query to generate gene lists based on SNP characteristics such as non-synonymous / synonymous substitutions and SNP density.
- Plasmodium ESTs from the Oct. 2, 2006 release of dbEST.
- Interproscan analysis of Plasmodium proteins.
- PlasmoCyc pathways, available from gene pages
New features in PlasmoDB 5.2:
- Save past searches (Persistent Query History)
- Reorganized Query and Tools page
- Improved text search
- New tutorial movies
Features and data moved from PlasmoDB 4.4 to 5.2:
- Gametocyte data from Young et al. Mol Biochem parasitol 2005.
- Expression profile similarity for erythrocytic stage data from P. falciparum 3D7, HB3, and DD2 (Llinas et al., Nucl. Acids. Res. 2006; Bozdech et al PLos Biol 2003). New to 5.2 is the inclusion of P. berghei data (Hall et al. Science 2005).
17 November 2006
We are pleased to announce this new release of the ApiDB.org Website and Database, Version 2.1. It includes improvements in different areas:
New Features, as in Crypto.org, Plasmo.org and Toxo.org
- User support page for you to send us feedback.
- User registration/login. This will allow you two things: to receive emails from us with announcements and to keep a persistent query history that stays between sessions.
New/updated Queries
- New Query&Tools page with the "Query Grid". It summarizes at once which queries are avilable in each site.
- Consistent query and parameter names accross sites.
- Genes by Pfam Domain: easier selecting a pfam term with Ajax.
- Ortholog relationships spanning all supported organisms.
- ApiDB site is catching up with Crypto, Plasmo and Toxo sites; ten new gene queries are offered in this release.
- New queries that return Genomic Sequences, ESTs and ORFs.
New/updated Data
- All apicomplexan ESTs from NCBI dbEST
- Crypto, Plasmo and Toxo have added Transcripts information, plus ESTs and ORFs that ApiDB uses in its BLASTs.
Community Resources
- New links to GeneDB and ModBase/UCSF.
17 November 2006
The PlasmoDB 5.1 release continues our effort to add new data and features while migrating PlasmoDB 4.4 functionality to the PlasmoDB 5 architecture. This release includes new rodent malaria parasite data, improved mappings to external sites, improvements to the Genome Browser, more 4.4 functionality (we're almost done with the migration) and some important behind the scenes improvements.
Here are the details:
- Rodent malaria parasite (RMP) data
- For P. berghei and P. chabaudi
- Genome Browser tracks showing synteny to P. falciparum
- TIGR Gene Index (TGI) tracks in the Genome Browser
- Pfam domains mapped to predicted protein products
- A Genome Browser track showing glass slide array oligos mapped to P. yoelii
- Updated mappings from PlasmoDB genes to
- The Protein Data Bank (PDB)
- Genbank, RefSeq and Swissprot
- Reagents from the MR4 Consortium
- tRNA gene predictions for P. vivax, P. berghei, P. chabaudi and P. yoelii.
- Migrated functionality PlasmoDB 4.4
- Array element pages that describe the details of oligos and their expression.
- A protein secondary structure query and track on the gene page
- Links from PlasmoDB genes to predicted protein structure models
- The "Add a Comment" facility (for commenting on genes and chromosomes/contigs).
- A newly designed User Login process. (In upcoming releases logging in to the site will let you store your query history and set preferences. For now, it just lets you Add a Comment).
- Genome Browser features
- Save your favorite track configurations and regions of interest.
- Download the sequence of any feature in any track
- Improved citations on the Data Sources page
- Behind the scenes, a new issue tracking system so we can better respond to your support issues.
16 August 2006
We are pleased to announce Version 4.0 of the Toxoplasma Genome Database.
ToxoDB provides access to the genome sequence and annotation of the apicomplexan parasite T. gondii (ME49 strain). We have also incorporated into this release the genomic sequence of T. gondii GT1 strain, provided by The Institute for Genome Research (TIGR).
ToxoDB 4.0 includes:
- A new interactive genome browser, based on the GBrowse module of the GMOD project, providing rapid visualization of the parasite genome and gene models, custom restriction site identification, open reading frame identification, and downloads in various formats.
- The genome sequence for T. gondii GT1 strain (from TIGR).
- T. gondii ME49 strain annotation, including manual annotation from Toxoplasma Genome Sequencing Project; updated GO term curation.
- TIGR Gene Indices for T. gondii.
- New ortholog analyses, using the OrthoMCL algorithm to group genes from T. gondii with other complete eukaryotic and prokaryotic genomes (Chen et al., Nucl. Acids Res. 34:D363; 2006).
The release represents a complete restructuring of this database resource.
We have eliminated the practice of using flat files as a means of data storage, upgraded to use the Genomics Unified Schema (GUS) database system and populated an underlying Oracle database in a systematic fashion.
Additionally, we have employed the new Web Development Kit (WDK), dramatically changing the way the web site operates under the hood.
This transformation conforms to the model used by all ApiDB projects; it will henceforth not only make it easier to generate future releases in short cycles, but these changes also ensure a consistent look and feel for all ApiDB projects, namely ToxoDB, CryptoDB, and PlasmoDB (supported by a Bioinformatics Resource Center contract from the US NIH / NIAID).
Further, note that not only can the users ask questions (by issuing queries), but the website also provides the improved functionality of combining queries together and/or downloading the results in a customizable manner.
27 July 2006
We are pleased to announce the first full release of the Plasmodium Genome Database, Version 5.0, now as www.plasmodb.org. (Many of you have had a chance to explore the beta-release version as v5-0.plasmodb.org).
PlasmoDB 5.0 implements significant 'under-the-hood' changes, making it much easier to generate future releases in short cycles. In addition, these changes to ensure a consistent look for all ApiDB projects, including PlasmoDB, CryptoDB, and ToxoDB (supported by a Bioinformatics Resource Center contract from the US NIH / NIAID).
Because it will take some time to implement all of the PlasmoDB 4.4 functionality in the new architecture, PlasmoDB 4.4 remains active, and the results of queries run under it may be exported to PlasmoDB 5.0 (see PlasmoDB 4.4 Query History).
New Data in PlasmoDB 5.0 (relative to version 4.4) include:
- the complete genome sequence for P. vivax Salvador, along with a complete set of annotated gene models
- the complete genome sequence for P. berghei, along with a complete set of annotated gene models
- the complete genome sequence for P. chabaudi, along with a complete set of annotated gene models
- TIGR Gene Indices for P. vivax, P. falciparum, and P. yoelii
- protein-protein interaction data, based on a genome-wide yeast 2-hybrid analysis (LaCount et al. Nature 438:103; 2005)
- new ortholog analyses, using the OrthoMCL algorithm to group genes from several Plasmodium species with other complete eukaryotic and prokaryotic genomes (Chen et al., Nucl. Acids Res. 34:D363; 2006)
- Computational predictions for functional interactions in P. falciparum from the PlasmoMAP project are provided based on co-evolution and co-expression of genes (Date & Stoeckert Genome Res. 16:542; 2006). This data is now available as a PlasmoDB query.
- Updated mappings for pathways and Enzyme Classifications (EC).
New Features in PlasmoDB 5.0 Include:
- an interactive genome browser, based on the GBrowse module of the GMOD project
- syntenic relationships between P. falciparum, P. vivax, and P. yoelii (Carlton et al., Nature 419:512; 2002).
Finally, we are pleased to report a new CD-ROM version of the Plasmodium Genome Database. PlasmoCD provides users lacking reliable high-speed internet connections with full access to the complete genomes of P. falciparum, P. vivax, and P. yoelii, along with gene models, annotation, BLAST hits, ortholog results, expression data, and search functions for querying and integrating these results across species.
25 May 2006
New in this release:
- New Gene Annotations
-
411 additional gene annotations have been made by CryptoDB for the C. parvum genome. These correspond to the genes annotated in the Genbank record simply as "introns and exons likely".
- PFam Predictions
-
interproscan was used to query C. parvum and C. hominis annotated proteins against the PFam component of InterPro. A query to find genes by PFam family name has been added and predicted domains are documented on the gene record pages.
- Gene Ontology Terms
-
GO terms were collected from the interproscan analysis and are available for gene searches and are documented on the gene record pages.
- EC Numbers
-
EC number annotations have been added. Sources of EC numbers include Genbank, KEGG and the metabolic pathway database CryptoCyc. Genes may be searched by EC number.
- Gene Orthologs
-
C. parvum and C. hominis orthologs computed at OrthoMCL DB have been added to our database.
- We have added query to find Cryptosporidium genes that have a specified phyletic pattern, based on ortholog groups.
- Analysis for MRC-LMB's chromosome 6
-
C. parvum chromosome 6 completed by Bankier, et al., MRC Laboratory of Molecular Biology has undergone our in-house analysis. This includes DNA and protein sequences searched against NCBI's non-redundant protein database and protein queries against the PFam database.
16 April 2006
New in this release:
-
Our graphical genome viewer
can now highlight syntenic gene comparisons. Check the box for 'Syntenic Genes (C. hominis vs. C. parvum)',
then 'Update Image'
25 January 2006
Highlights of Release 3.1 include:
-
The addition of C. parvum chromosome 6 completed by Bankier, et al., MRC Laboratory of Molecular Biology.
Bankier AT, Spriggs HF, Fartmann B, Konfortov BA, Madera M, Vogel C, Teichmann SA, Ivens A, Dear PH. Integrated mapping, chromosomal sequencing and sequence analysis of Cryptosporidium parvum. Genome Res. 2004 Feb;14(2):327. [PubMed]
-
The site now offers a query history (accessible via the link in the toolbar) through which you may build on simple queries to create more complex combinations. There is also an option to download query results as plain text. This option is available on the search results and query history pages.
-
New queries for ESTs and genes by protein molecular weight have been added.
-
CryptoCyc, a database of computationally predicted metabolic pathways for Cryptosporidium, is available online and is cross-referenced on the gene and contig pages of CryptoDB.
7 November 2005
CryptoDB is now part of an NIH/NIAID funded Bioinformatics Resource Center
to provide Apicomplexan Database Resources. This release represents a major
upgrade of the database software and as a result we can provide greater
search power. We now offer searches by gene and contig name in addition
to key word and protein features. Results are displayed graphically and
provide additional useful information for each gene or contig. We have
added a GBrowse display that allows you to zoom in to the nucleotide level
and we have added a comparative GBrowse view comparing C. parvum
and C. hominis contigs. Please explore the site and provide feedback
and suggestions to help@cryptodb.org.
1 August 2005
| | |
|
|