Select Data Source(s):








<>

News and Events Announcements

Click for story highlights and resource links.
show more...
Release 5.10 March 2012 Issue

The Benjamin Franklin Award for Open Access in the Life Sciences is a humanitarian/bioethics award presented annually by Bioinformatics.org.
show more...
Workshop entitled “PATRIC: A resource for infectious disease research. Real-life examples used to drive software development”.
show more...
To improve the annotation of the Mycobacterium tuberculosis genome sequence and increase the value of this community resource.
show more...
Release 5.9 January 2012 Issue

A first-pass assembly and annotation of P. yoelii yoelii YM is now publicly available on GeneDB. The assembly was made using short-fragment Illumina reads and scaffolded with 454 paired-reads from a 3kb library. The assembly was computationally improved using PAGIT (http://www.sanger.ac.uk/resources/software/pagit/) and annotated by transfer from the P. berghei ANKA reference. Genes in subtelomeric regions were predicted ab initio. The annotation has been manually re-inspected genome-wide.

P. yoelii yoelii YM can be viewed and searched on GeneDB. EMBL and GFF format files can be downloaded from the following FTP site:

ftp://ftp.sanger.ac.uk/pub/pathogens/Plasmodium/yoeliiYM/version_1/January_2012

A publication describing the new sequence is in preparation and permission should therefore be obtained before undertaking genome-scale or whole chromosome wide analyses.

This work was funded by the Wellcome Trust Sanger Institute and the European Union's 7th Framework Programme (FP7/2007-2013) under grant agreement no 242095. The DNA was provided by Tony Holder (Division of Parasitology, MRC National Institute for Medical Research, Mill Hill, London).

Dear all,

VectorBase is pleased to announce that Nucleic Acid Research has accepted its paper for publication in the 2012 Database issue. The article describes improvements since the last NAR publication, in 2009.

Check it out here.

From now on, please use this paper to cite us!

Release 2.8 November 2011 Issue

Release 5.8 November 2011 Issue

A new assembly of the P. falciparum IT genome is now available, replacing the version 1 sequence that has been available via PlasmoDB (since release 8.0) and the Sanger FTP site. The version 1 sequence was produced by "correcting" (or morphing) the reference 3D7 sequence, using aligned Illumina reads from IT.   However, in version 2, the sequence data have been assembled de novo. A key aim for this genome project is to fully assemble the subtelomeric regions. Although we were able to correctly assemble some subtelomeric sequences onto the ?core? of the genome (i.e. the interstitial or housekeeping region), many subtelomeric sequences remain problematic and are being actively refined. The annotation is automated and no manual curation has taken place.

The new version contains new systematic identifiers compliant with those now in use for 3D7.  However, old identifiers remain in the database as fully searchable synonyms.  The new version can be viewed and searched on GeneDB and will be loaded into PlasmoDB in early 2012.  EMBL and GFF format files can be downloaded from the following FTP site:

ftp://ftp.sanger.ac.uk/pub/pathogens/Plasmodium/falciparum/IT_strain/version_2

This work was funded by the Wellcome Trust [grant number WT085775/Z/08/Z] and the European Union's 7th Framework Programme (FP7/2007-2013) under grant agreement no 242095.


A preliminary 0.5 data set for Rhodnius prolixus is included in the VB-2011-10 release:

At the present time, the following data are available:
  • repeats (Dust, TRF, known repeated elements publicly available)
  • alignments to 9,665 Rhodnius prolixus Expressed Sequence Tags (ESTs) from dbEST mapped to the genome
  • alignments to 792,023 sequences from 2 Rhodnius prolixus RNA-seq datasets mapped to the genome
  • alignments to 143,631 proteins from UniProt, grouped into 7 taxa, mapped to the genome
  • 18,426 Genscan and 65,946 SNAP ab intio gene models
  • 15,878 gene models from the Lagerblad lab
  • consensus gene set with 20,976 genes and 32,624 transcripts built with the Ensembl gene build pipeline


Be aware that the gene, transcript and peptide identifiers are not stable and there is no plan to maintain them.

They have not been indexed as part of the site-wide search.

Note that VectorBase provides a BLAST server to compare your sequence against the Rhodnius prolixus supercontigs.

Gene annotation is available in GFF3 and GTF for download, as are the transcripts, peptides, supercontigs and ESTs.

An enhanced genebuild will follow in a future VectorBase release. We aim to reduce the high number of transcripts.

The Glossina morsitans 0.5 gene set is now available!

This is our first gene set was generated by MAKER. Input gene models are issues from species specific EST and RNA-seq alignments, Metazoa protein alignments, and ab initio programs. The community is currently assessing it and we hope to come up with a new, improved, gene set soon!

While this gene set is not downloadable, it is accessible on the genome browser or for blasting.

Note that this is a TEMPORARY gene set, it is not searchable and won't be archived. We are not planning on maintaining identifiers and recommend using locations as well as identifiers to identify genes

Happy Glossina browsing!

The Rapid Annotation using Subsystems Technology (RAST) annotation program was developed in 2007 and is currently utilized and funded via PATRIC.
show more...
VectorBase is pleased to announce the VB-2011-10 release. Highlights of this release include:
  • Several major new microarray experiments added for Anopheles gambiae and Aedes aegypti
  • Flat file expression downloads and an expression map are now available for Aedes aegypti.
  • Improvement to the Rhodnius prolixus gene set and presentation of Pedro Lagerblad/Jose Ribeiro's prediction set.
More detailed release notes are available on the Wiki.

Held at Virginia Bioinformatics Institute in conjunction with the Genetics, Bioinformatics, and Computational Biology (GBCB) Seminar Series.
show more...



Since the publication of the genome in 2002, work has continued at the Sanger Institute to produce a single base-perfect P. falciparum reference genome. This new sequence version is an upgrade from version 2.1.4 to version 3 and includes some substantial changes. Barring some highly AT-rich tracts that remain to be fully verified, we expect this to be the last major sequence update.

Differences include the correction of major mis-assemblies on chromosome 7 and 8 – one end of chromosome 7 now replaces one end of chromosome 8 (and vice versa), the replacement of all ambiguous bases (N’s) in the genome with the correct sequence and the correction of hundreds of sequencing errors. The original P. falciparum 3D7 genome project was started by three different genome centers – Sanger, TIGR and Stanford – all of whom used different naming schemes (e.g MAL7P1.10, PF11_0344, PFE0010c). For the submission in 2002 all three naming schemes were used. In October 2007, the systematic re-annotation of P. falciparum 3D7 started in earnest with a community annotation workshop and work has continued at Sanger ever since. During that time, new genes have been created and other genes have been split. When a new ID needed to be assigned but a logical numerical identifier was not available, a suffix of ‘a’ and ‘b’ was added to distinguish from the neighbouring gene. In addition, now that part of chromosome 7 is located on chr8, and vice versa, it would be more confusing to have MAL7 identifiers on chr8 and MAL8 identifiers on chr7.

We have therefore decided to adopt a consistent new scheme to identify genes. The old identifiers will be maintained as fully searchable synonyms but will be made significantly less prominent in PlasmoDB and GeneDB. An important additional change is that both protein-coding and non-protein coding genes will now use the same consistent nomenclature. The new naming scheme complies with the standards set by the International Nucleotide Sequence Database Collaboration (Genbank, EMBL & DDBJ). All new identifiers now start with a two-part prefix: PF3D7_ to identify the genome and two numerical digits to identify the chromosome, e.g. the first gene on chr1 is now called PF3D7_0100100. The new naming scheme increments intervals of 100 to allow significant numbers of genes to be added (although we believe that this is unlikely). This new naming scheme will also be adopted to other genomes that are being worked on at the Sanger Institute, e.g. P. falciparum IT genes will start with PFIT_. The new sequence version can be viewed and searched on GeneDB and will be loaded into PlasmoDB in early 2012. EMBL and GFF format files, together with a conversion table of old:new identifiers, can be downloaded from the following FTP site:

ftp://ftp.sanger.ac.uk/pub/pathogens/Plasmodium/falciparum/3D7/3D7.latest_version/version3/October_2011/

Dr. Rebecca Wattam and Maulik Shukla highlight data and analysis tools for bacterial pathogens via Brucella use cases.
show more...
VectorBase is pleased to announce the VB-2011-08 release. Highlights of this release include:
  • First announcement of the new population biology resource prototype web interface
  • Updated Ensembl genome browser (to schema 63)
More details are available in the Release notes.

Release 2.6 July 2011 Issue

Release 5.6 July 2011 Issue

PATRIC has updated its annotation and analysis to include nine strains from the recent outbreak, including a complete assembly of E. coli TY-2482.
show more...
The University of Notre Dame (www.nd.edu) is seeking a biologist with training and research experience in molecular biology and genomics, to join a large team working on highly visible web resources for the genomes and genome-related data of arthropod vectors of human pathogens (VectorBase). The successful applicant is expected to work closely with VectorBase developers and the external community of scientists who use the resource. The incumbent must be able to effectively communicate with researchers using VectorBase in order to define their needs, and convey that information to VectorBase software engineers and database specialists with the goal of optimizing the user experience. Major responsibilities of this position include providing for an intuitive and informative interface to access and interrogate genomic-scale datasets, and identifying/soliciting additional genomic-scale datasets from the external community. Position requires active participation in scientific meetings, both within the US and abroad, including demonstrations, workshops, and other instructional activities. Leadership in the preparation of tutorials, technical documentation, and reports is also expected. Opportunity for participation in bioinformatics / computational biology research for suitably motivated individual.

Additional information can be found here.

VectorBase is pleased to announce the VB-2011-06 release. Highlights of this release include:
  • Pre-site for Glossina moritans. Features mapped to the genome and preliminary gene sets available.
  • New microarray data from Christian et al. and Cassone et al.
More details are available in the Release notes.

The latest VectorBase release (VB-2011-06) contains preliminary data for the Glossina morsitans genome:
  • assembly, ESTs and traces are available for blasting,
  • contigs, supercontigs, agp files, ESTs and traces are available for download,
  • EST-based genes, ab initio SNAP genes, MAKER gene models, and repeats are available via the genome browser.
Glossina needs you! Manual annotation provide an invaluable addition to the automatic annotation - there is already a small community in place that will be helping with manual annotation but if you are interested, please contact Serap Aksoy (Serap.Aksoy -at- yale.edu) or Geoff Attardo (geoffrey.attardo -at- yale.edu).

Conformity scores and associated data are available.
show more...
Release 2.5 May 2011 Issue

Release 5.5 May 2011 Issue

PATRIC has collected metadata associated with genomes from sources, such as NCBI’s BioProject, GenBank, and the Human Microbiome Project.
show more...
We anticipate making a minimum of two awards of up to $600K each.
show more...
VectorBase is pleased to announce the VB-2011-04 release. Highlights of this release include:
  • New microarray expression data for Culex quinquefasciatus and Aedes aegypti
  • Updated mass spec. peptides for Anopheles gambiae available via DAS and simple browsing interface
  • Anopheles funestus RNA-seq contigs available for BLAST and browsing and search
  • GTF file format for use in tophat/cufflinks analysis of RNA-seq data available for download from all species.
More detailed release notes are available on the Wiki.

The NIAID-funded VectorBase invites research investigations called Driving Biological Projects (DBPs) from organizations housing experimental laboratories having expertise and knowledge in working with arthropod vectors of human pathogens. We anticipate making two awards of up to $600K each by April 1, 2012. A two page Letter of Intent (LOI) is due August 1, 2011. Full proposals will be solicited by invitation only, with a deadline of November 1, 2011.

See the full solicitation here. Send questions to vector@nd.edu

Free PATRIC bioinformatics resource center workshop featuring data and analysis tools for bacterial pathogens.
show more...
Samples are now in sequencing at the Broad Institute for the NHGRI/NIAID-supported project, "Genome Analysis of Vectorial Capacity in Major Anopheles Vectors of Malaria Parasites". Please visit the web site http://www.broadinstitute.org/annotation/genome/anopheles/ for information, updates, and data as they become available.

Following extensive discussion with the malaria research community, the list of Plasmodium genome sequencing projects supported by the US National Institutes of Allergy and Infectious Diseases (NIAID) has been updated, replacing entries that have been difficult to source with additional isolates and species of interest. 

The initial white paper is available here; click here for the revised list of approved projects.  The Plasmodium sequencing contract has been awarded to the Broad Institute, and data is made publicly available to the broader research community via deposition in GenBank and through relevant Bioinformatics Resource Centers, including PlasmoDB.  Further information on the 'white paper' process by which key pathogen species are approved for sequencing is available here.

Release 2.4 March 2011 Issue

Release 5.4 March 2011 Issue

Dear Cryptosporidium Researcher,

We would like to inform you of a proposal being prepared on behalf of the research community to generate genomic and RNA-seq data for a number of Cryptosporidium species, including laboratory propagated and geographic/clinical isolates. �The proposal will be submitted for funding to the National Institutes of Allergy and Infectious Diseases (NIAID) for consideration by the Eukaryotic Pathogen and Disease Vector Sequencing Project working group and coordinating committee.

http://www.niaid.nih.gov/labsandresources/resources/gsc/pathogen/Pages/default.aspx

The species we are suggesting include C. parvum, C. hominis, C. meleagridis, C. bovis, C. bailey, C. cuniculus and C. ubiquitum. This sequencing effort will facilitate comparative analyses to identify genes which control host specificity and other clinically and biologically important traits, as well as to extend our understanding of Cryptosporidium populations. All data will be available to the public through the National Center for Biotechnology Information (NCBI) and CryptoDB.org websites.

We would like to solicit your input and request your support. �At this time an e-mail expressing your interest in this project or proposing changes would be much appreciated. Please indicate whether you may have access to Cryptosporidium positive clinical or animal samples which could be made available for this project.

Please send emails of inquiry and/or support to Giovanni Widmer (giovanni.widmer@tufts.edu) or Jessie Kissinger (jkissing@uga.edu)

Release 2.10 March 2012 Issue

Release 2.9 January 2012 Issue

VectorBase is pleased to announce the VB-2011-02 release. Highlights of this release include: Preliminary Rhodnius prolixus genome browsers with transcript and protein similarities, repeats and preliminary gene predictions. Update to the mitochondrial gene data for Aedes, Anopheles, Culex and Ixodes. Update to the Xrefs for Aedes, Anopheles. More detailed release notes are available.

A preliminary data set for Rhodnius prolixus is included in the VB-2011-02 release: http://rprolixus.vectorbase.org/

The annotation of Rhodnius prolixus is in process. At the present time, the following data are available:
    - repeats (Dust, TRF, known repeated elements publicly available);
    - alignments to 9,665 Rhodnius prolixus Expressed Sequence Tags (ESTs) from dbEST mapped to the genome;
    - alignments to 792,023 sequences from 2 Rhodnius prolixus RNA-seq datasets mapped to the genome;
    - EST and RNA-seq derived gene set
    - alignments to 143,631 proteins from Uniprot, grouped into 7 taxa, mapped to the genome
    - 18,426 (Genscan http://genes.mit.edu/GENSCAN.html) ab initio gene models


Be aware that the EST and RNA-seq derived genes are not part of the official gene set, their identifiers are not stable and there is no plan to maintain them.

They have not been indexed as part of the site-wide search.

Note that VectorBase provides a (Blast server http://www.vectorbase.org/Tools/BLAST/) to compare your sequence against the Rhodnius prolixus supercontigs.

Raw and processed versions of the ESTs and RNA-seq datasets are available for (download http://www.vectorbase.org/GetData/Downloads/), as are the supercontigs.

A full genebuild will follow in a future VectorBase release.

Free PATRIC bioinformatics resource center workshop featuring data and analysis tools for bacterial pathogens.
show more...
Release 2.3 January 2011 Issue

Release 5.3 January 2011 Issue

The Pathogen Functional Genomics Resource Center (PFGRC) Pathogen DNA Microarray oligo and printing designs have been submitted and are publicly available from GEO Datasets for investigators who would like to use these designs to order microarrays from a microarray supplier. The GEO Accession Numbers are available from the Pathogen Portal here.
PFGRC Microarray Laboratory Protocols are also now available from the Pathogen Portal here.
The PFGRC has also made program source files, executables, and documentation for bioinformatics data management and analysis tools available for download through the SourceForge website and the PathogenPortal here.

VectorBase is proud to announce its December release, codenamed VB-2010-12. You can find all the details in the release notes. Comments, questions and bug reports are welcome via the usual channels (help desk email, forum, Facebook and Twitter).

We've incorporated another 500 community annotations to the Anopheles gambiae gene set! More precisely, the update has lead to: - 65 new genes, - 156 new isoform predictions, - 443 modified transcript predictions. This new gene set AgamP3.6 is available with the new VectorBase release VB-2010-12. More details are available on our Help Wiki, including the list of modifications between both gene sets.


Cell Symposia | Influenza: Translating basic insights ? Dec. 2-4, 2010 This symposium was held on December 2nd ? 4th at the Washington Marriott Hotel in Washington DC. IRD team members were in attendance and were available at a booth. A poster presentation was also be given during an assigned poster session.

Options for the Control of Influenza VII ? Sept. 3-7, 2010 IRD team members were in attendance throughout the course of this conference and were available at a booth in the exhibit hall. Team member 3 poster presentations are as follows:

















Release 2.2 November 2010 Issue

Release 5.2 November 2010 Issue

We are pleased to announce the publication of a paper in Nucleic Acid Research describing AmoebaDB.
Please cite this manuscript when referencing the AmoebaDB resource in your publications.

AmoebaDB and MicrosporidiaDB: functional genomic resources for Amoebozoa and Microsporidia species. Nucleic Acids Res. 2010 Oct 24. Aurrecoechea C, et al.


Two-year awards focusing on infectious disease research related to human bacterial pathogens were granted.
show more...
The 10th issue of the VectorBase newsletter is out! Lots of new data and tools to present this time. The most important might be the new Anopheles gambiae M and S genome browsers. But we also present the selected Driving Biological Projects, the Culex genome papers, link to additional data, and new tools, mainly to reach our community (contact form, RSS feeds, Facebook). Have a break and read our gazette! It is accessible on-line and in PDF from the Newsletter Help Page. You can also subscribe to the Newsletter mailing list to receive it directly in your mail box.

Genome sequencing of the M & S molecular forms of An. gambiae were published today showing that speciation is more advanced than previously thought. This greater divergence between the genomes has highlights the need to identify those genes critical for initiating this process. A companion paper describes the development of a SNP genotyping platform for investigating gene flow between the incipient species. Genome browsers for the M & S molecular forms are available through VectorBase:

We are pleased to report that the following Driving Biological Projects (DBPs) have been awarded in connection with the EuPathDB Bioinformatics Resource Center (BRC):

  • The Toxoplasma oocyst transcriptome and proteome
    John Boothroyd, Matt Bogyo (Stanford) & Pat Conrad (UC Davis)
    This project is expected to yield expression data from an important, but inaccessible, life cycle stage (1 yr).

  • A comprehensive catalog of the T. gondii & N. caninum parasite & infected host cell transcriptome & proteome
    Brian Gregory (Univ. Pennsylvania) & Jonathan Wastling (Univ. Liverpool)
    This project will produce an immense volume of sequence data, driving database improvements for handling 'next-gen' sequencing, non-coding RNAs, and interrogating host & pathogen datasets in parallel (2 yr).

  • Dissection of Plasmodium falciparum genetic crosses by mass spectrometry-based metabolite QTL
    Manuel Llinás (Princeton) & Michael Ferdig (Notre Dame)
    This project is expected to improve handling of genetic mapping data, incorporation of metabolomics results (integrated with pathway maps), and linking of specific genetic loci with metabolite abundance (2 yr).

A biennial competition for DBP awards is mandated by the BRC contract from NIH/NIAID, and supported by a total of $1.2M in total costs per award cycle. The goal of this program is to produce high value genomic-scale datasets for the EuPathDB user community, and drive the further development of bioinformatics resources for data analysis and mining.

Letters of intent (LOI) were solicited in February (see Feb 4 News item), and reviewed by the EuPathDB Scientific Working Group (SWG; see News, March 31). 12 full applications were solicited from among the ~50 LOIs received, and awards were based on further scientific review by the SWG, and in consultation with NIAID program staff. Cost negotiations and 'in kind' contributions from other sources made it possible to support 3 applications, from 6 investigators (in some cases, more than one LOI was merged into a single award).

We received far more high quality proposals than could be supported, testifying to the need for such mechanisms for data production bioinformatics analysis. The EuPathDB team looks forward to working with members of our user community who are seeking funding for such projects from external sources. Inquiries, including requests for support letters, should be directed to the EuPathDB staff (click on the 'Contact Us' link).

A new solicitation for DBPs is anticipated in 2011, and will be announced on the EuPathDB web site.

VectorBase is pleased to announce our first two Driving Biological Projects (DBP) funded by NIH/NIAID on the recommendation by the VectorBase Scientific Working Group (SWG). All data generated by these funded DBPs will be made publicly available on VectorBase:

  • Comparative Neurotranscriptome of Aedes aegypti (PDF)
  • Physical Reference Maps for Vector Genome Assembly (PDF)

Please contact us at info@vectorbase.org if you have any questions. We expect the next call for proposals to be announced with the next 18 months.

VectorBase is pleased to announce the VB-2010-10 release. Highlights of this release include:
  • Preliminary Anopheles gambiae M and S genome browsers with EST mappings and EST-based gene predictions
  • Update to the mitochondrial gene data for Aedes, Anopheles, Culex and Ixodes.
  • Update to the Xrefs for Aedes, Anopheles, Culex and Ixodes.
More detailed release notes are available on the Wiki.

Come bring your data and participate in free hands-on tutorials featuring the annotation and modeling services offered by the RAST/SEED servers.
show more...
The genome paper presenting the annotation and the analysis of the Culex quinquefasciatus genome, the third mosquito to be sequenced, has been published today in the journal Science. Check it out!

A companion paper detailing the analysis of the Culex pathogenomics and infection response to diverse pathogens is published in the same issue.

In our latest efforts to reach out to the bioinformatics community surrounding VectorBase, we will be providing an RSS feed providing updates on the latest employment opportunities. Please add this feed to your reader to catch the latest jobs news!

VectorBase is pleased to announce the VB-2010-08 release. Highlights of this release include:
  • New genome browser look and feel.
  • Three Aedes aegypti expression studies added.
  • Update to the mitochondrial gene data for Aedes, Anopheles, Culex and Ixodes.
  • Update to the Xrefs for Aedes, Anopheles, Culex and Ixodes.
  • Update to the EST data for Aedes and Anopheles.
More detailed release notes are available on the Wiki. We are aware of a number of outstanding issues with this release, documented in the release notes, and are actively addressing those. Please report bugs or other issues to the VectorBase helpdesk

Release 2.0 July 2010 Issue

Release 5.0 July 2010 Issue

Welcome to the 9th issue of the VectorBase newsletter!

We announce the new Ensembl Genome Browser and an updated Search facility. The newsletter also features new data:Pediculus humanus genome browser, IDOMAL data as well as RNA-seq expression studies; and much more information ... so come and have a look!

It is accessible on-line and in PDF from the Newsletter Help Page. You can also subscribe to the Newsletter mailing list to receive it directly in your mail box.


VectorBase is pleased to announce the VB-2010-06 release.

Highlights of this release include:

  • New genome browser look and feel
  • Human body louse Pediculus humanus genome added
  • IDOMAL malaria ontology released

More detailed release notes are available on the Wiki.

We are aware of a number of outstanding issues with this release, documented in the release notes, and are actively addressing those. Please report bugs or other issues to the VectorBase helpdesk


Release 4.15 May 2010 Issue

Release 1.3 May 2010 Issue

VectorBase will soon be updating its Genome Browser to stay in line with the Ensembl code, and benefit from their constant improvements. You can familiarise yourself with the new genome browser via the Ensembl Genome links below. These changes are scheduled for release in early June. Please note that, from June, the VectorBase-hosted genome browser will have enhanced functionality (e.g. expression data, useful DAS tracks configured by default). The hope that the new presentation is more intuitive and accessible. We will help you go through the change with a new tutorial and will be answering questions on our HELP mailing list. The web addresses won’t change so any links you bookmarked will still be valid.

You can have a peak at the new presentation for using these links:


In its continuing effort to address Grand Challenges in Global Health, the Foundation for the National Institutes of Health is providing $6 Million in grants that seek approaches to discover new chemical entities with novel mechanisms of action for insecticides to control malaria.

Application instructions are here. More information in the forum.

Letters of Inquiry are due by June 25, 2010

PATRIC team members attend this bacterial genome annotation workshop whose purpose is to explore the utility of a competition process to advancing the science of gene and genome annotation.
show more...
Drs. Williams and Sobral's conatruct a new gamma-proteobacteria "tree of life" using PATRIC's data resources.
show more...
PATRIC team members attend this workshop to help develop PATRIC requirements and understand implications of emerging standards on PATRIC infrastructure.
show more...
Release 4.14 April 2010 Issue

Release 1.2 April 2010 Issue

EuPathDB is pleased to announce the members of the scientific working group (SWG). The SWG provides EuPathDB with scientific oversight and guidance. Members of the SWG also review the driving biological projects initiative.

Michael Gottlieb, Chair
Foundation for National Institutes of Health (FNIH)
Broad expertise in parasitology, large integrative projects (chair)
Matt Berriman
Wellcome Trust Sanger Institute
Pathogen genomics & functional annotation
John Boothroyd
Stanford University
Toxoplasma biology & host-pathogen interactions
Greg Buck
Virginia Commonwealth University
Pathogenomics of apicomplexan and kinetoplastid parasites
Patrick Duffy
National Institutes of Health (NIAID)
Laboratory & clinical aspects of malaria biology
Pascale Gaudet
Northwestern University
Biocuration, ontologies, DIctyostelium
Adrian Hehl
University of Zurich
Cellular and molecular biology of Giardia
Patrick Keeling
University of British Columbia
Comparative genomics
Bill Petri
University of Virginia
Laboratory & clinical aspects of Entamoeba and diarrheal diseases
Lincoln Stein
Ontario Cancer Center
Bioinformatics tools & analysis
Lu Wang
National Institutes of Health (NHGRI)
Large-scale sequencing
Jonathan Wastling
University of Liverpool
Proteomics of apicomplexan parasites
Louis Weiss
Albert Einstein College of Medicine
Microsporidia, Toxoplasma, and clinical microbiology
Dyann Wirth
Harvard School of Public Health
Plasmodium population biology, drug discovery

VectorBase is pleased to announce the members of the Scientific Working Group (SWG). The SWG provides advice about the management, performance, and the needs of the scientific community using VectorBase. Members of the SWG also review the VectorBase Driving Biological Projects (DBP) proposals.

Group details can be viewed here.


VectorBase's BioMart services are now in sync with the gene sets on the rest of the web site. Please see the forum item for further details. Please contact the VectorBase help desk with any queries.

We are pleased to announce new features in TrichDB 1.1

New Features:

  • The gray tool bar at the top of all TrichDB web pages has been enhanced to include a link to the "My Basket" feature including an indication of how may items are in the basket.
  • The genome browser has been updated to version 1.7, which offers enhanced features and stability. 
  • Weighted searches are now available.  This feature allows you to assign an arbitrary weight to the searches (steps) in your search strategy making it possible for you to view a ranked list of your results. 

***Sign up to be a EuPathDB feature evaluator:***

We welcome you to *sign up* to be an early evaluator of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process. Signing up is easy. Evaluating will require minimal effort on your part, typically in the form of answering specific questions, using the feature(s) and rendering your opinion.

We are pleased to announce new features in ToxoDB 6.0

Community Annotation:

As many of you know, a community driven annotation effort has been underway.  The response has been great (350 genes have been annotated by the community so far) and we are happy to now include these annotations in ToxoDB.  The ultimate goal is to translate this effort into changes in product names and gene models.  As a first step, since this process is still ongoing, community annotations can be viewed by searching using the text search tool in ToxoDB and limiting your search to the Community annotation.  To view all genes that have a community annotation associated with them simply enter the wild card "*" in the text search field.  The annotation itself is prominently displayed on the gene pages right under the product name.  

As noted, this is an ongoing process.  We have several more annotations to incorporate and many more promised.  

New Features:

  • The gray tool bar at the top of all ToxoDB web pages has been enhanced to include a link to the "My Basket" feature including an indication of how may items are in the basket.
  • The genome browser has been updated to version 1.7, which offers enhanced features and stability. 
  • Weighted searches are now available.  This feature allows you to assign an arbitrary weight to the searches (steps) in your search strategy making it possible for you to view a ranked list of your results. 

***Sign up to be a EuPathDB feature evaluator:***

We welcome you to *sign up* to be an early evaluator of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process. Signing up is easy. Evaluating will require minimal effort on your part, typically in the form of answering specific questions, testing and rendering your opinion.

We are pleased to announce new features in PlasmoDB 6.3

New Features:

  • The gray tool bar at the top of all PlasmoDB web pages has been enhanced to include a link to the "My Basket" feature including an indication of how may items are in the basket.
  • The genome browser has been updated to version 1.7, which offers enhanced features and stability. 
  • Weighted searches are now available.  This feature allows you to assign an arbitrary weight to the searches (steps) in your search strategy making it possible for you to view a ranked list of your results. 

***Sign up to be a EuPathDB feature evaluator:***

We welcome you to *sign up* to be an early evaluator of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process. Signing up is easy. Evaluating will require minimal effort on your part, typically in the form of answering specific questions, using the feature(s) and rendering your opinion.

Dr. Mao attends a meeting in UK to work with the OBO ontology community in the context of PATRIC data and resources.
show more...
PlasmoDB 4.4 has been discontinued. If there are tools or other functionality that you used on PlasmoDB 4.4 that you need, please contact us to let us know and we will help you regain access to the missing functionality.

Culex pipiens quinquefasciatus as been promoted from sub-species to species, and is now called Culex quinquefasciatus .

In order to reflect this change in the taxonomy, VectorBase has updated all the Culex pipiens (quinquefasciatus) instances to its new taxonomic name, Culex quinquefasciatus.

The name only has been changed, the underlying data remain the same.

The NIAID-funded Eukaryotic Pathogen Genome Database (EuPathDB.org) invites proposals for Driving Biological Projects (DBPs) focused on one or more of the eukaryotic pathogens supported by this Bioinformatics Resource Center (BRC). We anticipate making two awards of up to $600K each by July 1, 2010 (a second round of solicitations is expected in 2012). A two page Letter of Intent (LOI) is due Feb 28, following which full applications will be solicited by invitation only, with a deadline of March 31. Click here to download the RFP.

The NIAID-funded Bioinformatics Resource Center VectorBase invites proposals for Driving Biological Projects (*s) that focus on infectious diseases research related to human microbial pathogens and invertebrate vectors of human diseases.

DBPs must include research that uses high-throughput experimental technologies to functionally characterize the genome, proteome or metabolome to help elucidate how genes, proteins and metabolites may be involved in pathogenesis, antimicrobial resistance or other biological processes of interest in the study of infectious diseases. HTP approaches to functional characterization of genes and proteins may include, for example, microarrays, mass spectrometry, two-hybrid screenings, RNAi experiments, Chip-Seq and RNA-Seq studies and bioassays.

Download the full RFP

Frequently Asked Questions


Release 4.13 February 2010 Issue

Release 1.1 February 2010 Issue

We are pleased to announce that the E-RNAi RNAi reagent design tool has been updated with the latest Anopheles gambiae and Aedes aegypti genomes. Please note, this service is provided by a third party with minor assistance from VectorBase.

We are pleased to announce the implementation a new tool in CryptoDBDB.  The "My Basket" tool allows you to select various genes from your searches and add them to your basket (much like a shopping cart).  To add genes to your basket simply click on the basket icon in a result list or on a gene page.  A green basket icon indicates that a genes is already in your basket.  You can easily convert your basket to a strategy to save it and perform additional searches or combine it with one of your already existing strategies.  Please note that using the basket feature requires you to login first.

We are pleased to announce the implementation a new tool in TrichDB.  The "My Basket" tool allows you to select various genes from your searches and add them to your basket (much like a shopping cart).  To add genes to your basket simply click on the basket icon in a result list or on a gene page.  A green basket icon indicates that a genes is already in your basket.  You can easily convert your basket to a strategy to save it and perform additional searches or combine it with one of your already existing strategies.  Please note that using the basket feature requires you to login first.

The VectorBase developers finally heard about Twitter and set up a @vectorbase account. We'll use it to let you know about any planned or unplanned system maintenance and any new features and datasets at VectorBase. We look forward to your @vectorbase tweets too!

Release 4.12 December 2009 Issue


We are pleased to announce a workshop taught as part of a collaboration between the Pathogen Genomics Group (Wellcome Trust Sanger Institute, UK), the Eukaryotic Pathogen Bioinformatics Resource Center (University of Georgia and University of Pennsylvania, US), and the Seattle Biomedical Research Institute (US). This residential workshop aims to provide experimental biologists with hands-on experience in genomic-scale data analysis, including genome browsers and comparison tools, methods for data integration, and resources for sophisticated data mining. Many examples and exercises will be drawn from the kinetoplastida, including Leishmania spp., Trypanosoma brucei, and Trypanosoma cruzi, although applicants working on other protozoan parasites will also benefit.

!!!!!Application Deadline is December 18th 2009!!!!!!

For registration and additional information (including scholarships) please click here (Note: you will be diverted to and external site).

As usual, if you have any questions please send us an Email.

EuPathDB published in Nucleic Acid Research

We are pleased to announce the publication of a paper providing an update on the EuPathDB genomic resourse.  The paper entitled "EuPathDB: a portal to eukaryotic pathogen databases" is available here as an advance access publication.


This article may be used to cite the TriTrypDB resource in your publications.  To download this reference to your citation manager please click here.

Detoxification genes were assayed for expression differences between two strains of Aedes aegypti: Vauclin, from Martinique, which is resistant to temephos and deltamethrin insecticides, and Bora-bora, which is susceptible to insecticides, in both larvae and adults.

The open access article can be found here, and many of the genes mentioned in the paper now have official VectorBase gene symbols, so you can locate their expression pages easily with the following style of URL: http://funcgen.vectorbase.org/ExpressionData/gene/CYP6M6.

This work was a collaborative effort between the Laboratoire d'Ecologie Alpine (LECA) of Grenoble, the Institut de Recherche pour le Développement (IRD) Montpellier, and the LSTM, Liverpool.

The AnoXcel sheet has been updated with the VectorBase-style Gene Identifiers - it is available from the download page. Please note that we only provide the raw file, there is no possibility to search for information within this file with our search tool.

We are aware the there are some issues with our BLAST tool at the moment.

Since the installation of a newer, faster, application server, the TomCat sotfware (for running Java applications) is not behaving as expected and will sometimes stops completely, preventing any BLAST jobs to be submitted. We are working on this and are trying to figure out what is happening.

Restarting the server is enough to solve the problem (temporarily) so please let us know if you can't run any BLAST jobs by contacting info@vectorbase.org


Dear all,

The PlasmoDB team would like to alert you to two important issues concerning the presentation of P. falciparum microarray data in PlasmoDB. 

1.  Beginning with PlasmoDB 6.0 all profiles using the Winzeler Affymetrix scrMalaria array were updated to reflect the latest 3D7 genome annotation. Previously, probes were mapped to the P. falciparum 3D7 genome annotation released in July 2001.  The array was originally designed with a tiling of probes, so the new annotation has resulted in many genes gaining new probes which did not previously map to the gene, and dropping old probes which no longer map (Note: The new mapping allowed up to two mismatches per probe, and only probes which mapped uniquely, up to two mismatches, were retained).  Previous gene expression profiles were generated using the MOID algorithm with the 2001 annotation, while the current profiles were generated using RMA with the new annotation.  Many of the expression profiles exhibit minor differences and a few changed significantly.  Such differences are expected and should result in a more accurate representation of the data.

2.  An internal error with one the files concerning the P. falciparum 3D7 gametocyte expression data was discovered.  This error appeared with the release of PlasmoDB 6.0 in July of 2009 and was fixed on October 20th, 2009.  Also, please note that probe remapping and MOID vs. RMA analysis differences described above also apply to this data set. 

We regret any disruption this may cause you.  Please do not hesitate to contact us with any questions or concerns.

Sincerely,
The PlasmoDB Team


We are pleased to announce the publication of a paper introducing TriTrypDB.  The paper entitled "TriTrypDB: a functional genomic resource for the Trypanosomatidae" is available here as an advance access publication.


This article may be used to cite the TriTrypDB resource in your publications.  To download this reference to your citation manager please click here.

Community Annotation in Progress

The Toxoplasma research community has been involved in improving the genome annotation.  With guidance from the current ToxoDB community advisers (John Boothroyd, Vern Carruthers, Marc-Jan Gubbles and Michael White), members of the research community (see below) have been asked to take the lead in providing high quality annotation on subsets of cellular and biochemical pathways. The goal is to incorporate the results of this effort in ToxoDB by the end of the year.  

If you have not been contacted yet and feel that you can contribute to one of the pathways mentioned below, please contact us.

  • Micronemes (Vern Carruthers)
  • Rhoptries - Necks and Bulbs (Peter Bradley)
  • Dense Granules (Marie-France Cesbron-Delauw)
  • Chromatin-associated (Kami Kim)
  • Proteases (Dominique Soldati-Favre)
  • Apicoplast (Stuart Ralph)
  • Mitochondrion (David Roos)
  • IMC (Con Beckers)
  • Surface Proteins (Michael Grigg)
  • Bradyzoite (Louis Weiss)
  • Golgi/ER (Markus Meissner)
  • Acidocalcisome (Silvia Moreno)
  • Fatty Acid Metabolism (Frank Seeber)
  • Nucleotide Metabolism (David Bzik)
  • Sugar Metabolism (Stan Tomavo)
  • Glycolipid Metabolism (Boris Striepen)
  • Transcription (Michael White)
  • Translation (Bill Sullivan)
  • Cell Cycle (Marc-Jan Gubbels)
  • Parasitophorous Vacuole and Membrane (Tony Sinai)

The National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), has awarded a 5-year, $27,670,448 contract to the CyberInfrastructure Group (CIG) of the Virginia Bioinformatics Institute (VBI)
show more...
The 7th issue of the VectorBase newsletter is now available. It is accessible on-line and in PDF from the Newsletter Help Page. You can also subscribe to the Newsletter mailing list to receive it directly in your mail box. We announce the release of the Anopheles gambiae and Aedes aegypti gene set updates, as well as two additional expression data set. We also have redesigned the Blast Interface - come and check it out!

We would like to announce to the current and future VectorBase user community that NIAID/NIH has formally renewed the VectorBase contract for another 5 year cycle starting on 1 October 2009. We would especially like to thank the many members of the genomics and vector biology communities who provided letters and email messages endorsing the VectorBase renewal proposal.

Posted by Bob on behalf of the entire VectorBase Team.

We are pleased to announce the implementation of new features in ToxoDB:
  • Now after you perform a BlastN against genomic sequence you can link from the blast output to the correct coordinates in the genome browser.  This allows you to explore annotation and other functional data in relation to your BLAST hits.
  • Upadated product names for the Surface Antigen Glycoprotein (SAG) Related Sequences (SRSs) (provided by Michael Grigg).
  • The type ahead feature has been implemented for GO term, EC number and Interpro domain searches.
  • You can now edit or delete your user comments.
  • You can now modify your user profile, change your password or your email address.

Please also note that the parent portal site EuPathDB has just been updated, offering you the ability to explore results across organisms.

Sincerely,

The EuPathDB team.

VectorBase is happy to present a new BLAST interface. We have changed the way you select datasets for searching to simplify selection. Please, have a look and send us any comments or suggestions using the Help system.

Data updates for ESTs and trace files for all species have been included.


A new Anopheles gambiae gene set, AgamP3.5, is available at VectorBase The update includes the manual appraisal of existing gene models, including the majority of models from chromosome arms 3L and X. More information about the new gene build can be found here. The new gene set also includes an update of the expression and comparative data.

A new Aedes aegypti gene set, AaegL1.2, is now accessible at VectorBase The update is includes gene models which were deemed supplemental to the initial set and models submitted by the community (to learn how to submit your gene models to VectorBase, click here). The new gene set also includes an update of the expression and comparative data.

Anopheles gambiae expression data from two recent publications has been added to VectorBase. The work by Marco Neira Oviedo and colleagues has compared larval salivary gland gene expression to the whole organism. Yury Goltsev and colleagues have measured gene expression during embryonic development and also profiled the cells which form the embryonic serosa (egg shell). See also the forum topic for links and discussion.

We received many user comments during the KCMB meeting and following. Many of these have already been examined by TriTrypDB curators (at GeneDB, SBRI and UGA) and incorporated into the official annotation. 253 genes have been modified since the meeting resulting in a total of 710 genes that have been updated since this current build of TriTrypDB. Click here to see the complete list of genes that have been re-annotated since December 1, 2008.

Annotation of the Kinetoplastida is ramping up again as part of the collaborative efforts of EuPathDB and GeneDB. Annotations are being carried out at the Welcome Trust Sanger Center, the Seattle Biomedical Research Institute and the University of Georgia. Click here to see the list of genes that have been re-annotated since December 1, 2008 when this current build of TriTrypDB was started.

A new experiment profiling the response of detoxification genes to 72h exposure to 2 pesticides and 3 insecticides in Aedes aegypti larvae has been released in VectorBase. The publication is in press and we'll let you have the details as soon as we know them.

Here's a link to Table 2 from the forthcoming paper (Google Docs) with hyperlinks to the VectorBase expression reports.

Just added to VectorBase's download section: tab delimited text files containing gene-averaged expression data for all Anopheles gambiae genes. This data is identical to that shown on the gene expression report pages.

Click on the Get Data menu item to visit the download page. The more human-friendly expression reports can be found via the Expression Data menu link.

Coming soon: more powerful expression data retrieval with BioMart... stay tuned!


We have recently completed Stage 1 of this population genomics database

Stage 2 of this system is being designed to store data for multiple taxa, and we are actively looking for users willing to submit vector population genetics data to VectorBase. To help facilitate this process, we are also developing a Workbench feature that will allow users to manage their own data. A small group of users have already been enlisted, but additional beta users are welcome. If you are interested in submitting data and/or in more information on participating as a beta user for the Workbench please contact us at: popgen at vectorbase dot org

Three new experiments are now available in the VectorBase Gene Expression resource: These are all for Anopheles gambiae but there are experiments for Aedes aegypti in the pipeline.

The 3rd kinetoplastid molecular cell biology meeting will be held from April 26-29th, 2009 in Woods Hole, Massachusetts, USA. For more information click here.

In October 2008 VectorBase initiated a plan for the development of a web-based data repository for information about the population genomics of invertebrate vectors of human disease. This endeavor is being pursued via a subcontract with Dr. Gregory C. Lanzaro, University of California-Davis and Dr. Charles E. Taylor, University of California-Los Angeles. Development of this new functionality of VectorBase will begin with a population genomics database for Anopheles gambiae, now available at VectorBase.

We have recently completed Stage 1 of this A. gambiae database. This first stage includes data for 5,870 individual A. gambiae s.l. collected from 54 sites in Mali and for 3,245 individuals from 51 sites in Cameroon. The data includes species identification (A. gambiae and A. arabiensis), molecular form (M vs. S), chromosomal form (Forest, Bamako, Mopti and Savanna) and karyotype. A Google Maps API and GoogleEarth provide visualization of these data on maps. Full details on this new component of VectorBase are available here.

We are also developing a Workbench feature that will allow users to manage and manipulate their own data for ongoing studies. We have enlisted a small group of beta users for this feature and are currently inviting additional users to participate. If you are interested in participating as a beta user for the Workbench please contact us at: popgen at vectorbase dot org.

Trace files for G. morsitans and R. prolixus are available for BLAST and Download.

The IscaW1.1 gene set for Ixodes scapularis has now been released by VectorBase and GenBank It can be downloaded and browsed from the Ixodes genomic data page. At GenBank, the genes are annotated as features on scaffold entries DS611849-DS981340.

The Insecticide Resistance Data section of VectorBase has been revamped. It can be found here.

We are pleased to report an award from the Bill & Melinda Gates Foundation in support of a TrypDB component of the EuPathDB family of databases, encompassing Leishmania species, Trypanosoma brucei, and Trypanosoma cruzi. A coordinated award from the Wellcome Trust will permit enhanced annotation of kinetoplastid parasites. These projects represent a collaborative effort between the GeneDB team (at the Wellcome Trust Sanger Institute), colleagues at the Seattle Biomedical Research Institute, and the EuPathDB team -- an NIAID-supported Bioinformatics Resource Center. Look for the first release of TrypDB in Jan 2009!

The fifth issue of the VectorBase newsletter is now available (a PDF version is also available). New data include Insecticide Resistance data and traces for Glossina morsitans and Rhodnius prolixus (also available for blasting). Check out our new forum and try our top tip!

Sign up to receive the Newsletter when it comes out, or just browse it from the website. For more details, see the Newsletter Help Page.


We have now integrated a tabular text summary of microarray-based gene expression data on every gene report page (also known as GeneView). Scroll down to the "Gene DAS Report" panel, check the box next to "Gene Expression Summary" and click on "Update". After the page has refreshed you will see a list of experiments where significant differential expression has been observed. Click on the experiment names to see more details of the expression profiles and statistics.

Note that in Aedes aegypti there are many genes without microarray reporters, and therefore data, at the moment, so your mileage may vary. (More data is coming soon, however.) We have no microarray data at all for Culex pipiens yet.


Free microarrays: Please note that the Pathogen Functional Genomics Resource Center (PFGRC) at the J. Craig Venter Institute (JCVI) provides several resources for investigators including microarrays for Giardia lamblia (WB strain, clone C6).

Click here to view a list of available microarrays at PFGRC.

We are pleased to announce the publication of a paper in Nucleic Acid Research describing PlasmoDB.
Please cite this manuscript when referencing the PlasmoDB resource in your publications.

PlasmoDB: a functional genomic database for malaria parasites. Nucleic Acids Res. 2008 Oct 31. Aurrecoechea C, et al.


We are pleased to report an award from the Bill & Melinda Gates Foundation in support of a TriTrypDB component of the EuPathDB family of databases, encompassing Leishmania species, Trypanosoma brucei, and Trypanosoma cruzi. A coordinated award from the Wellcome Trust will permit enhanced annotation of kinetoplastid parasites. These projects represent a collaborative effort between the GeneDB team (at the Wellcome Trust Sanger Institute), colleagues at the Seattle Biomedical Research Institute, and the EuPathDB team -- an NIAID-supported Bioinformatics Resource Center. Look for the first release of TriTrypDB in Jan 2009!

Cassone et al. have performed expression profiling of several field strains of M and S form Anopheles gambiae mosquitoes. Considered across all three developmental periods (presented as separate experiments in VectorBase), differentially expressed genes represented around 1–2% of all expressed genes. One prominent example is this cuticular protein with much higher expression in the M-form compared to S.

We are pleased to announce the publication of the database paper introducing both GiardiaDB and TrichDB in the database issue of Nucleic Acid Research.

GiardiaDB and TrichDB: integrated genomic resources for the eukaryotic protist pathogens Giardia lamblia and Trichomonas vaginalis. Nucleic Acids Research Advance Access published on September 29, 2008, DOI 10.1093/nar/gkn631.


We've added a forum to the site to foster communication between site users and VectorBase associates. Feel free to comment and enquire on all things VectorBase!

A link to the forum is located in the gray section of the navigation menu at the top of every page or follow this link.

The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), of the Department of Health and Human Services (DHHS) supports research related to the basic understanding of microbiology and immunology leading to the development of vaccines, therapeutics, and medical diagnostics for the prevention, treatment, and diagnosis of infectious and immune-mediated diseases.
show more...
The Anopheles Genomes White Paper has been endorsed by the NHGRI and NIAID Human Eukaryotic Pathogens and Invertebrate Disease Vectors Sequencing Target Selection Working Group, and will be assessed later this month by the Coordinating Committee for Sequencing Target Selection. It is available on VectorBase.

The fourth issue of the VectorBase newsletter is now available. Check the new Culex data that are available, play with the improved version of the Gene Expression Resource and try our 'VectorBase Top Tip'! If you wish to receive the newsletter you can sign up or just browse it on the website

A listing of microarrays that are available to Plasmodium researchers has been posted at http://plasmodb.org/common/microarrays/. This list is meant to provide in a single location a summary of the publicly available microarrays targeting Plasmodium genomes (including P. falciparum, P. berghei, P. yoelii, and P. vivax). Also provided are details, including, for some of the arrays, an updated mapping to genes in PlasmoDB.

An update to the Culex pipiens gene set, CpipJ1.2, is now available at VectorBase. Changes from the CpipJ1.1 set involved rectifying some issues with start and stop codons for 87 predictions and the removal of approx. 1,500 predictions which are not thought to represent real gene predictions. These are mostly small 1-2 exon models with no orthologs to any other species. This process involved extensive manual appraisal of prediction in tandem with bioinformatic approaches looking at potential bacterial and transposon-related sequences.

VectorBase encourages community scientists to contribute to the Ixodes annotation effort through submission of their own gene models. Community-generated gene models will be displayed as a separate manual annotation track in the Ixodes Genome Browser. Community annotations will also be considered for incorporation into the final Ixodes gene set, which is targeted for release at Vectorbase and GenBank in a few months. Community scientists wishing to submit manual annotations may do so through the VectorBase Community Annotation Pipeline. An account is required to submit gene models. To sign-up for an account, click the “User” tab in the top-right corner of the VectorBase homepage. To submit a gene model, follow these instructions. For further information regarding submission of community annotations, please contact the Ixodes VectorBase Community Representative, Jason Meyer, at meyerjm@purdue.edu

VectorBase is proud to announce the completion of a comprehensive overhaul of the way we store, analyze and display gene expression data. We now handle Affymetrix data, as well as most other microarray platforms. Several key experiments have been added, including blood meal and adult tissue data from UC Irvine and a larval gut compartment analysis from the University of Florida. The database back end and expert analysis system has been upgraded to the powerful BASE2, while the general user web interface has been improved in all areas. Full documentation is provided on the help wiki, or dive straight in with an example gene expression summary: A. gambiae actin 5C.

The proposal for sequencing of additional Plasmodium isolates and species has been approved. The whitepaper is available here.

Can Ontologies help vector biology? Trends in Parasitology, Volume 24, Issue 6, pp 249-252 (June 2008), the latest 'Trends in Parisitology' opinion paper, is now available.

A Genome Browser is now available to view the preliminary annotation of the Ixodes scapularis genome.

The NCBI has decided to follow the near-unanimous consensus in the taxonomic literature and promoted the subspecies Culex pipiens quinquefasciatus to species level, as Culex quinquefasciatus. The NCBI taxid will remain the same (7176). VectorBase will reflect this taxonomy change some time in the future, but for the moment the web pages and file names at VectorBase will continue to use "Culex pipiens". Gene, transcript and protein identifiers (CPIJxxxxxx), and the assembly identifier (CpipJ), will not be modified.

The third issue of the VectorBase newsletter is now available. Check the new data that are available on the VectorBase site and try our 'VectorBase Top Tip'! If you wish to receive the newsletter you can sign up or just browse it on the website.

Preliminary annotation of the Ixodes scapularis genome has been released as part of a joint venture between VectorBase, The Broad Institute and JCVI. Data is available for download, or for use with our BLAST server. Please see the Ixodes scapularis genome page for more details.

The Ixodes scapularis Wikel strain genome sequence assembly is available from VectorBase. The assembly consists of 570,640 contigs arranged into 369,495 scaffolds yielding a combined assembly length of 1.4 Gb. Both contigs and scaffolds can be downloaded and the scaffolds are available for Blast searching. You can receive updates and get involved through the Ixodes mailing list.

The Culex pipiens gene set is now available in GenBank under the project ID AAWU00000000. The genes and the proteins are available as 'features' via the scaffolds entries - the proteins are also available through the protein section of NCBI. Want to see an entry? Try the accession DS231813 on the NCBI website.

VectorBase now hosts an electronic mailing list for the Ixodes scapularis genome sequencing project. You can find out more about the mailing lists hosted at VectorBase and subscribe to those you are interested in.

VectorBase BioMart has been relaunched for Aedes aegypti and Anopheles gambiae. The data is now up to date and provided in the standard "Ensembl BioMart" format (which you could previously find at biomart.org and ensembl.org). Rest assured that the data has been generated by the VectorBase team - we just use Ensembl technology.

You may now search and retrieve many types of data in VectorBase BioMart, including GO terms, InterPro domains and microarray reporters. Bulk sequence retrieval is also available.

BioMart is available through the Get Data menu.

Transcript datasets for all organisms have been updated as of December 2007 including 194 K Ixodes scapularis and 76 K Culex pipiens sequences. The datasets are available for download and BLAST searching.

By using the Aedes detox chip microarray, researchers from the French Laboratory of Alpine Ecology of Grenoble (R. Poupardin, S. Reynaud and J.P. David), the Liverpool School of Tropical Medicine (C. Strode and H. Ranson) and the Greek Laboratory of Pesticide Science (J. Vontas) compared the expression of candidate detoxification genes in Aedes aegypti larvae exposed to two insecticides and three pollutants commonly found in polluted environments. This study allowed the isolation of genes that are induced by distinct subsets of insecticides and pollutants (e.g. AAEL001807/CYP9M9 is induced by atrazin, permethrin and fluoranthen while AAEL009127/CYP6M11 responds to copper sulfate). Results of this research work will contribute to decode the molecular mechanisms of xenobiotic response in mosquitoes and the possible impact of pollutants on insecticide tolerance. This study will be published shortly in the journal Insect Biochemistry and Molecular Biology as (Poupardin et al., 2008). The expression data is available in VectorBase from today.

The second issue of the VectorBase newsletter is available. Come and check the new data and tools that are available through the VectorBase site! If you wish to receive the newsletter you can sign up or just browse it on the website.

BLAST in ApiDB allows you to blast against: not only any Crypto, Plasmo and Toxo data set available in their websites, but also against the Theileria Genomes, all available apicomplexan ESTs (from NCBI dbEST) and the available EST assemblies that include Cryptosporidium parvum and Toxoplasma gondii.

HMMER Build and Search are now available on VectorBase. This version of HMMER Search takes advantage of MPI code, allowing it to run its search on multiple machines at the same time. With our equipment, speed results are on the order of twenty times faster than the traditional version. All peptides and supercontigs databases are accessible for searching.

Only one VectorBase HMMER Search job will run at a time, additional jobs will be placed in a queue. Depending on it's popularity and the number of databases per search job, the actual amount of time it takes to complete a search may vary. In my tests, running a search against all databases finished after an hour and forty-five minutes.

The sequencing of the Aedes aegypti genome has equipped researchers at the Liverpool School of Tropical Medicine (Dr. Hilary Ranson and Dr. Clare Strode) to design and produce a ‘detox chip’. This chip houses a complete set of cytochrome P450s, glutathione transferases (GSTs) and carboxyl esterases for the yellow fever mosquito.

As insecticides play a critical role in controlling vector-borne diseases such as malaria and dengue, our interest lies in identifying the genes likely to confer metabolic resistance to insecticides. Using the detox chip we have identified a cohort of genes which were over expressed in two insecticide resistant populations originating from Thailand and Mexico. Of particular interest is the over expression of a large number of P450s which belong to the CYP9 (e.g. probe aaeg-detox-2D7) class and members of the epilson GST class (e.g. probe aaeg-detox-1J3) in resistant Aedes. The detox chip will be used to monitor detoxification gene expression in Aedes populations worldwide with the aim of understanding the dynamics of insecticide resistance.

Multiple sequence alignments generated by the ClustalW tool are now piped into the Jalview applet and may be viewed on the results page along with the raw results.

Please check the ApiDB tutorials offered now in three formats: QuickTime (.mov), Microsift Windows (.avi) and Flash Player (.flv). In ApiDB.org the link to these Website Tutorials appears in the home page, at the bottom of the sidebar, under "Information and Help".

Please check the PlasmoDB tutorials offered now in three formats: QuickTime (.mov), Microsoft Windows (.wmv) and Flash Player (.flv). There is a link to these tutorials in PlasmoDB sidebar, in all pages.

The inaugural VectorBase newsletter is available. The newsletter will be published every quarter bringing you an update on new data and tools available through the VectorBase site. If you wish to receive the newsletter you can sign up or just browse it on the website.

Wherever in ToxoDB you can type an Annotated Gene's ID you can now also type the ID of an electronically predicted gene (eg, TgGlmHMM_1045). You will be taken to the associated annotated gene (eg, 27.m00868).

Please check the new TrichDB.org trichdb and GiardiaDB.org giardiadb websites. These soon will be integrated into our portal.

~19 million genomic trace reads and ~370,000 BAC end sequences from the I. scapularis genome project are now available at VectorBase for use with BLAST and for download.

VectorBase has moved to the new AgamP3.4 gene build. It incorporates improved models (including manually annotated genes) and changes gene identifiers from the EnsEMBL naming convention (ENSANG*) to the VectorBase naming convention (AGAP*).

For more information about the new gene build, see this document.

For more information on the new naming scheme, and translating between old and new gene identifiers, see this document.

We are serving the tiling array expression data from Halasz et al. (2006) via DAS in the genome browser. This provides a genome wide view of expression levels in adult male and female mosquitoes. To enable the DAS track, go to your favourite genomic region (example), and click on the "DAS Sources" menu in the "Detailed view" (third image from top), then check the box next to "Tiling Expression". For more information, see this help document.

Please check the new Gene Metrics table with all the organisms with gene information available in ApiDB. A link to it has been added in the home pages of all ApiDB sites. In ApiDB.org the Gene Metrics link appears in the home page, at the bottom of the sidebar, under "Information and Help".

The Aedes aegypti Genome paper has been published in the June 22, 2007 issue of Science magazine. For more information, see Science Magazine's Website.

A preliminary assembly of C. muris was obtained at approximately 4.5 X coverage, and reveals a ~9Mb genome with average cytosine-guanine content (%GC) of ~28.5 %. This assembly is based on ~50,000 reads obtained mostly from a small-insert library and a fosmid library. This assembly is estimated to contain ~ 92% of the ~9Mb genome, and consists of 927 contigs arranged into 595 scaffolds.

Final assembly, to be released in late summer, will include an additional 2X coverage obtained from a medium insert library and possibly >5X additional coverage obtained with 454 pyrosequencing technology. The final assembly is expected to contain > 99% of the genome.

Sequencing from the cDNA library provided by Giovanni Widmer is underway. Average insert length is 1,600 bp. If the library is shown to be random, an expected >30,000 EST reads will be obtained.

More information is available on the project's website (http://msc.tigr.org/c_muris/index.shtml).


Trace reads for the A. gambiae Pimperena (S molecular form) and Mali-NIH (M molecular form) WGS projects are now available for download and for use with BLAST.

The P. humanus PhumU1.1 gene build is now available at VectorBase. For more details, see the P. humanus Genome Homepage for more details.

The C. pipiens quinquefasciatus genome, CpipJ1 assembly is now available at VectorBase. See the C. pipiens Genome Homepage for more details.

The preliminary C. pipiens quinquefasciatus gene build, CpipJ1.0_5 assembly is now available at VectorBase. See the C. pipiens Genome Homepage for more details.

ReAnoXcel is a database of the conceptually translated proteome of A. gambiae developed by Dr. Jose Ribeiro (NIAID) using gene models determined by Ken Vernick's (University of Minnesota) annotation pipeline.

The BioMart query builder and data retrival system has been updated to version 0.5.

The VectorBase Genome Browser has been updated to be concurrent with Ensembl version 43 Genome Browser.

BLAST sequence fetching from the Interactive BLAST Results pages has been rewritten to provide faster result retrieval.

A 0.5 Gene Build for P. humanus is now available for download and for use with BLAST. The VectorBase and TIGR 0.5 gene builds will be merged to form the P. humanus 1.0 Gene Build.

February, 2007 The PFGRC announces that it will conduct a four day course titled, "Introduction to Microarray Technology", April 16-19, 2007.

The NIAID sponsored Pathogen Functional Genomics Resource Center at TIGR announces that it will conduct a four day course titled, "Introduction to Microarray Technology." The workshop will be held on the TIGR campus on April 16-19, 2007. The course will provide an overview of the two-dye spotted microarray platform. The course is designed for hands on participation and attendees will actively learn to make the most of their microarrays through a combination of wet lab and bioinformatics sessions.

The wet lab portion will be taught by PFGRC staff who are directly involved in microarray production. Participants will be guided through probe preparation and hybridization protocols. The training will also consist of detailed discussions on dye coupling, slide pre-hybridization, hybridization, and scanning techniques. Other topics for discussion will include the importance of assessing source material and probe quality, comparisons of various slide surfaces, and printing techniques.

The bioinformatics portion of the course covers an introduction to the stages of microarray data analysis. Major topics include data management, image analysis, normalization, and data mining. Fundamental data analysis concepts will be taught using a mix of lectures and interactive hands-on sessions. Participants will analyze the microarrays they prepared, using the open-source TM4 software suite while working closely with the software developers. CDs containing microarray datasets and the TM4 suite will be provided.

The course will be provided free of charge. Attendees will be responsible for their travel and lodging. This introductory course is one of several offered by the PFGRC on variety of functional genomics topics. Attendance will be limited to 14 participants.

Interested participants will find the preliminary course agenda here, and registration information here.


End sequences of A. aegypti BAC Clones are available in the Downloads Section.

This updated home page for ApiDB.org provides the same functionality (queries and tools) but it adds a new Genomes table with key information.   Also a new link to Data Documentation has been added on the sidebar under "Help and Information".   Please send any comments and questions to ApiDB.

TIGR has released the preliminary P. humanus genome assembly and accompanying gene build. Data is available for download and for use with BLAST. For more details, see the P. humanus home page.

The A. gambiae Gene Tool has been updated to be current with Version 42 of the VectorBase / Ensembl Genome Browser. Access it from the A. gambiae home page.

EST datasets for all organisms have been updated to include all ESTs available in GenBank as of January, 2007.

A new homeage for the Body Louse, P. humanus is now available on VectorBase at http://phumanus.vectorbase.org.

An Invitation and Background Information regarding the Rhodnius Genome Sequencing Project

An international effort is underway that promises to expand and enrich research on all aspects of reduviid biology, from their role as vectors of Chagas disease to fundamental basic and evolutionary biology. We are sending out this email to welcome all scientists who have an interest in and/or are working on Rhodnius prolixus or other reduviids to participate in and contribute to this effort. As elaborated on below the total genome sequencing of Rhodnius prolixus is underway and the initial stages of planning for long term residence, curation and annotation are under consideration. VectorBase at the University of Notre Dame, where the mosquito genomes are being housed has kindly set up our list server to assist us in facilitating broad communication and interaction within the community. We are grateful to its director Frank Collins for his support and willingness to assist us.

Our invitation is for all researchers in the reduviid or related fields to be involved. This is in keeping with our intent to be inclusive, so we welcome any individuals who may have unintentionally been left off the current list to be added. You are receiving this email because your name was included in the initial list which was compiled from various inputs. Please inform us of any others you feel should be added. If you wish not to be included please let us know.

With this preamble, the following provides some general background and information on the Rhodnius Genome Project - how it arose, some of the activities up to now and some plans and hopes for the future.

In recent years it has been widely recognized that there was a need to have increased genome sequencing information on reduviids (for example the Triatomine Genome Initiative in 2003 of scientists from the MERCOSUR countries held in Montevideo). A concerted effort was undertaken with the submission of a "White Paper " proposal " The case for sequencing the genome of the blood-feeding Hemipteran insect Rhodnius prolixus " to the NIH Human Genome Research Institute in the fall of 2004. This proposal resulted from input from a number of researchers world-wide and was coordinated by Erwin Huebner (Canada). A copy of the White Paper can be found on the NIH Human Genome web site. This application was successful (selected spring of 2005) and the Genome Sequencing Center at Washington University Medical School , St. Louis was selected to carry out the large scale genome sequencing.

An organizational meeting was held Nov. 3 -5, 2005 in Rio de Janiero with participants from Canada, USA, Brazil, Argentina and Uruguay. At this meeting a Steering Committee to oversee and coordinate the Genome Project and work with John Spieth ( Washington U Genome Sequencing Center) to accomplish the goals of the White Paper. The Steering Committee consists of Erwin Huebner (Coordinator), Antonio Bernardo de Carvalho , Ellen Dotson, Carl Lowenberger, Fernando Monteiro, Pedro Oliveira, Francisco Panzera and Rolando Rivera Pomar. Other sub-committees were also established which will become more active once the genome project progresses further.

An important issue that arose in the initial consideration of which colony should be the source of DNA for the sequencing was that a survey of some of the longstanding lab colonies showed that they were in fact a hybrid of prolixus and a closely related species Rhodnius robustus. It became important to genotype as many as Rhodnius colonies as possible in order to select a strain that was not of hybrid origin or too heterogeneous. Ellen Dotson and Fernando Monteiro played a key role in this and what has emerged is that the colony at the CDC in Atlanta is indeed prolixus and the best option for the sequencing. The Steering committee in discussion with John Spieth has selected that colony as the source for the full scale sequencing.

The plan is also to sequence as many ESTs as possible. Rolando Rivera Pomar has been conducting a survey to catalogue what EST libraries and the vectors used exist within the reduviid research community. If you have not already responded to him or been contact yet please email him and John Spieth any information you may have. Francisco Panzera has provide valuable information on genome size also important at this stage.

The Steering Committee has also discussed the possibilities for the permanent residence, curation and updating of the data base once the genome is sequenced and assembled. Frank Collins at VectorBase has indicated they are willing to assist the Rhodnius community, provide advice and help and will provide a home for the Rhodnius Genome data base. It has been agreed by the Steering Committee in discussion with Frank that the genome will reside at VectorBase. In addition to and in concert with the assembly and automated annotation that will be done at the Washington U Genome Center VectorBase will run automated annotations as was done with the mosquito genomes. VectorBase will provide long term curation and maintenance once the Genome Center has released it and submitted sequence data to Gen Bank. The research community will have access to the data base at VectorBase and will play an important roll in the essential mechanical annotation to fully identify genes correctly. It is important that as many researchers as possible be prepared to contribute to the annotation process. We invite everyone who has an interest and wishes to please plan to participate in the annotation of genes in your area of interest, gene families etc. Once the genome assembly has been completed, auto annotation done and the subsequent community wide annotation is needed it will be important to identify one or two persons to serve as annotation coordinators to work with the community and VectorBase.

The establishment of the List Server at VectorBase provides us an excellent mechanism to foster communication within our community, disseminate information and stimulate new interactions and research in reduviid genomics and proteomics. There are excellent opportunities ahead and we hope you will join us in these exciting times as we bring this project to fruition. Your input, ideas, suggestions and participation are welcomed.

We are planning a meeting in Salvador, Brazil, May 19 - 20, 2007 immediately before the SBBq/IUMB Meeting (May 21-25) to discuss Triatomine Biology and the Genome Project Progress and Plans. Please contact Pedro Oliveira, who is coordinating this conference for detail.


Submitted by Erwin Huebner (Coordinator of the Rhodnius Genome Project) on behalf of the Steering Committee.

The second annual ApiDB Workshop will be held June 3-6, 2007 at the University of Georgia, Athens, GA, USA.

The workshop will be a 3.5 day introduction to apicomplexan database resources that are part of the ApiDB Bioinformatics Resource Center ( ApiDB.org, PlasmoDB, ToxoDB, and CryptoDB ).

The application deadline is January 19, 2007 (Application). Please click here for further information and to register.


The second annual ApiDB Workshop will be held June 3-6, 2007 at the University of Georgia, Athens, GA, USA.

The workshop will be a 3.5 day introduction to apicomplexan database resources that are part of the ApiDB Bioinformatics Resource Center (ApiDB.org, PlasmoDB.org, ToxoDB.org, CryptoDB.org)

The application deadline is January 19, 2007 (Application). Please click here for further information and to register.


Dear Colleagues,

We are pleased to announce a batch of major data and functional updates at the VectorBase website, including gene expression data and manual annotations for the entire A. gambiae 2L chromosome arm. Please read on for more details.

Organism & Data Updates

New Culex pipiens Homepage
Culex pipiens quinquefasciatus has a new homepage at http://cpipiens.vectorbase.org. Genome data and assembly updates will be posted on this page as assembly nears completion.
Organism Datasets Updated
The first full gene set for Aedes aegypti (AaegL1.1) was released in June 2006. The latest gene set for Anopheles gambiae (AgamP3.3, February 2006) is also available. EST and Trace datasets for all organisms have been updated as of October 30th and are available for download and for use with BLAST. All current VectorBase gene sets are also indexed at NCBI.
A. gambiae Gene Tool Available
The AnoBase Gene Tool is now available for use at VectorBase and is accessible from the Anopheles gambiae homepage. The Gene Tool provides a full summary of gene meta-data on a single page.
A. gambiae BAC Hybridization Images Available
Images of BAC hybridizations in Anopheles gambiae are now accessible from the Anopheles gambiae homepage. Data is searchable by BAC name and Locus.
A. gambiae manual 2L Gene Models Available
Manual annotation of the 2L chromosome arm in A. gambiae is now complete! Pending incorporation into the next A. gambiae gene build, all new manually curated gene models have been made temporarily available in the genome browser via DAS. To activate, select 'Manual Annotation' under DAS sources when in the Contig View of the Genome Browser.
Gene Expression Data Submission Available
Several microarray experiments have been added to the VectorBase gene expression repository. To submit your data to ArrayExpress via VectorBase, please visit http://www.vectorbase.org/ExpressionData/. Two experiments are currently published and publicly available through VectorBase:
  * Anopheles gambiae: Vlachou et al. midgut invasion
  * Aedes aegypti: Sanders et al. SINV infection

Functionality Updates

VectorBase Genome Browser and Genome Data updated based on Ensembl V.41.Updates include:

Inter-species comparison data Available
» Ortholog mapping between Aedes aegypti, Anopheles gambiae, and Drosophila melanogaster genes, plus gene sets from two external organisms: Homo sapiens and C.elegans.
» DNA/DNA comparison (tBLAT) between Aedes aegypti, Anopheles gambiae, and Drosophila melanogaster genomic sequence.
Gene Ontology Associations
» GO Term mappings from UniProt to organism genes.
» GO Term mappings from Drosophila orthologs.
Gene expression (microarray) data integration
» Genome browser contigview (click on the dark green microarray features):
http://agambiae.vectorbase.org/Genome/ContigView/?l=2L:36678503-36682502
» VectorBase gene page with links to expression data in the "Similarity matches" section:
http://agambiae.vectorbase.org/Genome/GeneView/?gene=ENSANGG00000007634
http://aaegypti.vectorbase.org/Genome/GeneView/?gene=AAEL000231
BioMart Initial Testing Release Available
An initial version of the VectorBase BioMart database has been released to allow data mining of genomic data through the VectorBase website. BioMart is a query-building tool designed to make it possible to filter and retrieve data such as genes, exons, and proteins in a variety of formats.
BLAST Functionality Improved
Specialized displays of results are now available for specific target databases. Results of BLASTs against genomic sequence databases can now be viewed directly in the genome browser. Additionally, sequences hit identified by a BLAST query can be downloaded or used as input for the VectorBase ClustalW service. BLAST is available in the Tools section.
ClustalW Available
The multiple sequence alignment tool ClustalW is now available for use at VectorBase. Users may upload their own sequences or pass in input directly from VectorBase BLAST results. ClustalW is available in the Tools section.

As always, feedback is much appreciated! Please don't hesitate to contact us with questions or comments at help@vectorbase.org - we'd love to hear from you.

Thanks,
The VectorBase Team

The Cryptosporidium muris genome sequencing project has been funded by the National Institute of Allergy and Infections Diseases (NIAID), through the Microbial Sequencing Center program at the Institute for Genomic Research (TIGR). This project aims at obtaining a high-quality assembly of C. muris genome based on 8X sequence coverage, as well as the comparative automated and manual curation of gene structure predictions of all three Cryptosporidium genomes sequenced to date. More information is available on the project's website (http://msc.tigr.org/c_muris/index.shtml). All sequence reads and traces will be deposited with GenBank and the genome assemblies and annotations will be deposited with GenBank and CryptoDB.

Public EST data for all organisms has been updated and is available for use with BLAST and for download.

An initial release of the BioMart data mining tool is now available at VectorBase. BioMart allows you to build complex queries based on gene, transcript and protein characteristics. Access BioMart under Get Data->Data Mining.

Images of BAC In situ hybridizations are now online and searchable under Images->BAC Hybridization Images. You can find images by searching by BACs name or chromosomal region using the VectorBase search engine.

A new homepage for all Culex pipiens quinquefasciatus data can now be found at http://cpipiens.vectorbase.org. Please send bugs or errors to the VectorBase Webmaster.

All trace reads available at the NCBI Trace Archives as of September, 2006 are now available for download and for use with the VectorBase BLAST service.

All trace reads made available at NCBI as part of the C. pipiens Genome sequencing project are now available at VectorBase for download at VectorBase and for use with our BLAST service. Take a look at the C. pipiens homepage for more information.

The WU-BLAST programs supporting the CryptoDB BLAST tool were upgraded on 6 July 2006.

The current version is:
2.0MP-WashU [04-May-2006] [linux26-i786-ILP32F64 2006-05-09T12:19:58]

(The former version was 2.0MP-WashU [16-Mar-2005] [linux26-i686-ILP32F64 2005-03-16T08:14:50] )


An updated version of BLAST is now online at VectorBase! You can now sort your results by hit-specific properties, download result hits, and view alignments directly in the A. gambiae & A. aegypti genome browsers. Feedback / Suggestions are of course welcome.

The A. aegypti 1.0 Genebuild, created from a merge of the TIGR and VectorBase 0.5 annotation sets, is now available at VectorBase for download and for use with our BLAST server. These data are also available from NCBI and TIGR.

9+ million trace reads produced as part of the I. scapularis genome sequencing project are now available for download and for use with VectorBase's BLAST service.

Data pertaining to I. scapularis has been moved from the "Additional Organisms" section to it's own homepage, now available at http://iscapularis.vectorbase.org. Genomic trace reads will also be available shortly.

Jose Ribero's AnoXcel spreadsheet is now availble at Other->AnoXcel.

The offices and datacenter of CryptoDB.org are moving to new facilities and will need to temporarily suspend services. We apologize for the inconvenience.

Jose Ribero's AegyXcel spreadsheet is now availble at Other->AegyXcel.

The P. berghei and P. chabaudi genomes are now incorporated in PlasmoDB 5.0 Beta. The P. berghei genome is accompanied by transcript and protein expression data. We have also continued to make progress migrating functionality from PlasmoDB 4.4 to PlasmoDB 5.0 Beta

The PlasmoMAP project provides computationally generated functional interactions between P. falciparum proteins. This data is now available as a PlasmoDB query.

The Aedes aegypti home page and genome page have been updated to provide a more consistent look across the website. New genomic statistics are available as well.

The Anopheles gambiae genome assembly and gene build has been updated and is now available. The genome assembly, dubed assembly AgamP3, has undergone several changes to remove duplicate / haplotype regions from the genome. Additionally, 140+ additional scaffolds have been maped to chromosomal locations. Details about the new assembly can be found in A.gambiae:Docs->Current Assembly.


A new gene build to is also now available based on the new AgamP3 assembly that is viewable in the A. gambiae genome browser.

The Anopheles gambiae home page and genome page have been updated to provide a more consistent look across the website. New genomic statistics are available as well.

We are pleased to announce to the Plasmodium community Version 5.0 Beta of the Plasmodium Genome Database. PlasmoDB 5.0 Beta had an early release at the Vivax Malaria Research conference at TIGR on December 9, 2005, and is now available for community use. Please see PlasmoDB 5.0 Beta released

Clustering and alignment of all public Aedes aegypti ESTs has been completed and is availiable as a DAS Track in the Aedes genome browser or in the EST data section.

Basic BLAST services for use with VectorBase data have been put online. Try it out by clicking on Tools->BLAST or by clicking here. Coments about the interface or results formats are welcome.

Preliminary annotation of the Aedes aegypti genome has been released as part of a joint venture between VectorBase and TIGR. Data is viewable in the Aedes aegypti Genome Browser, available for download, or for use with our BLAST server.


Please see the Aedes aegypti genome page for more details.

GBrowse is an interactive generic genome browser developed as part of the GMOD project (www.gmod.org). GBrowse is used in PlasmoDB to display gene models, EST alignments, and synteny relatonships between Plasmodium species. With GBrowse, we now provide:
  • Custom restriction site identification
  • Open reading frame identification
  • Custom downloads in various formats:
    • FASTA
    • GFF (2)
    • GFF3
    • BSML (XML)
    • EMBL
    • GenBank
    • GAME
    • GCG

PlamoDB 5.0 Beta has been upgraded to use the latest Genomics Unified Schema (GUS) database system and the new Web Development Kit (WDK). GUS is an open source project (www.gusdb.org) providing a functional genomics database schema covering sequence annotation, expression data, regulatory interactions, and proteomics using controlled vocabularies and ontologies. GUS also provides software to load data (pugins) and set up a web site (WDK). GUS has been adopted by several groups covering organism ranging from bacteria, protozoa, plants, and mammals.

We are pleased to announce Version 5.0 Beta of the Plasmodium Genome Database. We have incorporated into PlasmoDB the new release of the Plasmodium vivax genome and annotation generated by Jane Carlton's group and the sequencing center at The Institute for Genome Research. We have also used this release to change the way we create the web site and underlying database that should make it easier for us to generate new PlasmoDB releases and be consistent with other ApiDB projects such as CryptoDB and ToxoDB. As a result, we have focused on providing access to P. vivax through the PlasmoDB 5.0 Beta site but have not completely reproduced all the functions in PlasmoDB 4.4. Therefore we have kept the PlasmoDB 4.4 site fully active for continued use. Furthermore we have provided a mechanism to take the results of a PlasmoDB 4.4 query and use it in PlasmoDB 5.0 Beta.

PlasmoDB 5.0 Beta features:

  • A new interactive genome browser, based on the GBrowse module of the GMOD project, providing rapid visualization of the parasite genome and gene models, custom restriction site identification, open reading frame identification, and downloads in various formats.
  • The complete genome sequence for P. vivax Salvador, along with a complete set of annotated gene models, and syntenic relationships between P. falciparum, P. vivax, and P. yoelii (from Jane Carlton, TIGR).
  • Updated P. falciparum annotation, including curated literature citations; updated GO term curation from Andy Berry, Matt Berriman, and the Sanger curation effort; updated EC/pathway mappings from Hagai Ginsburg.
  • TIGR Gene Indices for P. vivax, P. falciparum, and P.yoelii.
  • Corrected expression profiles from glass-slide arrays for P. falciparum 3D7, HB3 & Dd2 (from Manuel Llinas, Zybnek Bozdech & Joe DeRisi).
  • Updated structural data, including information from the structural genomics consortia.
  • Protein-protein interaction data for P. falciparum, based on a genome-wide yeast 2-hybrid analysis (LaCount et al. Nature 438:103; 2005).
  • New ortholog analyses, using the OrthoMCL algorithm to group genes from several Plasmodium species with other complete eukaryotic and prokaryotic genomes (Chen et al. Nucl Acids Res, in press).

Finally, we are pleased to report a new CD-ROM version of the Plasmodium Genome Database. PlasmoCD provides users lacking reliable high-speed internet connections with full access to the complete genomes of P. falciparum, P. vivax, and P. yoelii, along with gene models, annotation, BLAST hits, ortholog results, expression data, and search functions for querying and integrating these results across species. PlasmoCD 4.1 can be downladed from the PlasmoDB web site, or requested by sending e-mail to <help@PlasmoDB.org>.


Yeast two-hybrid data for P. falciparum described in LaCount et al. Nature 438:103; 2005 has been provided by Doug LaCount (Purdue University) for inclusion into PlasmoDB. Interaction data can be seen on individual gene pages and used to search for interactors of a specificed P. falciparum Gene ID.

OrthoMCL is an algorithm developed to identify orthologs and recent paralogs based on whole genome comparison of protein sequences (Li et al Genome Res. 13:2178; 2003). Previously, OrthoMCL groups were based on comparison of only 16 species in PlasmoDB 4.4. In the current release, OrthoMCL groups were generated with 61 species. These species include the 55 used to generate orthologs groups at the OrthoMCL DB site plus P. vivax, Leishmania major, T. brucei, T. cruzi, Apis mellifera (honeybee) and S. purpatus (sea urchin). The OrthoMCL DB site is described in Feng et al. Nucleic Acids Res. 2006 Database Issue.

We've updated the Aedes aegypti homepage to give it more structure, hopefully a little cleaner, and make it consistent with the Anopheles gambiae homepage.

A document describing the post sequencing roles of the Microbial Sequencing Centers and VectorBase has been posted in the Aedes aegypti documents section.

The Microbial Sequencing Centers of the Broad Institute and The Institute for Genomic Research are pleased to announce the public release of a new assembly for the genome sequence of Aedes aegypti. As the result of increased sequence coverage the overall quality of the sequence, both in terms of coverage and continuity, has been vastly improved. The total size of the contigs in this assembly is 1.3 Gb, and the average coverage within the assembly is now 7.6 fold.

For more information, look at the Aedes aegypti genome page.

C. hominis gene Chro.rrn021 is annotated as having two exons but in CryptoDB release 3.0 this gene is not returned in the query 'Genes With Exon Count'. This error will be fixed in the next release.

The expression profiles obtained from Joe DeRisi's group (UCSF) are processed by the PlasmoDB team for generating the images on the expression pages and for use in the expression profile queries. The orginal HB3 data were processed without background subtraction while the more recent 3D7 and DD2 data were processed with background subtraction. These have now all been processed consistently in the following manner.

Raw data were loess normalized using the default loess implementation in the R package for microarray analysis (marray). The non-background corrected foreground mean was used as input to the loess algorithm. Replicate arrays for the same time point were then averaged to give one value for each timepoint/gene. Data was then smoothed using a least squares fit to each data point and its two neighbors on each side.

The profiles seen in PlasmoDB 4.4 have also been corrected. The correlation table that gives correlations between the different studies (the three DeRisi and the two Winzeler) has been recalculated based on the new profiles. This new data is not yet displayed on 4.4 but will be on 5.0 Beta.


CryptoDB is one of NIAID's Bioinformatics Resource Centers (BRC) for Biodefense and Emerging or Re-Emerging Infectious Diseases. As part of our efforts for interoperability among the other BRCs and the research community in general we have released the published Cryptosporidium genome features in standardized GFF3 format. The files may be downloaded via FTP from ftp://ftp.tigr.org/pub/brc/ApiDB/Cryptosporidium/.

The home page for the Anopheles gambiae has been redone to provide quicker links to organism-specific data and news. Updates to contact information will also be made this week.

Javascript drop down menus in Ensembl have been fixed so they now work properly with most incarnations of Internet Explorer, Mozilla variants, and Safari. Older browser (Netscape 4, IE for Mac) support may be iffy...

On Wed., June 8, 2005, the National Human Genome Research Institute (NHGRI), one of the National Institutes of Health (NIH), announced today that the Large-Scale Sequencing Research Network will target 13 more organisms as part of its ongoing effort to produce genomic data that will expand biological knowledge and improve human health. Among those to be sequenced are the \"M\" and \"S\" strains of the Anopheles gambiae.

You can read the full press release at: http://www.genome.gov/15014493

A quick \"Getting Started\" page has been put up in the documents sections. Move the mouse over the red circles to look at some of the basic features of VectorBase. E-mail me (rbruggne@nd.edu) if there are any problems.

It's now possible to do some pretty basic searches of the Anopheles genome, including associated images. Much of the functionality is similar to Ensembl, although the proprietary Altavista syntax doesn't work.

Added a new web front end enabling administration of news items from the user homepage.

Support for detailed news entries has been added to the VectorBase Website.

New documentation detailing methods involved in creating the next assembly of the Anopheles gambiae genome have been posted. Look at the A. gambiae specific documentation for more details.

Images of in situ hybridizations of BAC clones previously stored at AnoBase have been imported into VectorBase and associated with genomic data.

Initial setup of the primary VectorBase infrastructure hardware has been completed.

The initial VectorBase website has been put online for testing purposes.

E-Mail services have been set up to provide mass e-mailing and automatic archiving of VectorBase related correspondance.

Representatives from each of the VectorBase subcontracting organizations met in Crete for role discussion.