|
|
|
|
| | | |
News and Events Announcements
Click for story highlights and resource links.
show more...
The Pathogen Portal team at the Virginia Bioinformatics Institute (VBI) has announced the release of Mouse Model Strain Selection Guide to help infectious disease researchers select good mouse models for their studies. The Pathogen Portal team has worked with Dr. Carol Bult at the Jackson Laboratory (JAX) to develop this summarized mouse model selection table for the NIAID category A-C priority pathogens. Highlights of this release include:
- The guide provides the list of resistant, susceptible and humanized mouse models used in the infection studies.
- Pathogens (bacteria, eukaryotic pathogens and viruses) are linked out directly to their corresponding BRC websites (PATRIC, EuPathDB, ViPR and IRD).
- Mouse strains selected for the pathogen infection studies are linked out to the JAX mice database for detailed mouse strain information and purchasing information (if any).
- The PubMed and Mouse Genome Informatics (MGI) references are provided.
The Benjamin Franklin Award for Open Access in the Life Sciences is a humanitarian/bioethics award presented annually by Bioinformatics.org.
show more...
“Bioinformatics.org has announced the finalists in the 2012 Benjamin Franklin Award.
The Benjamin Franklin Award for Open Access in the Life Sciences is a humanitarian/bioethics award presented annually by Bioinformatics.org an individual who has, in his or her practice, promoted free and open access to the materials and methods used in the life sciences. Winners are chosen by voting members of Bioinformatics.org.
The 2012 winner will be announced at the Tenth Anniversary Bio-IT World Conference and Expo on April 25 in Boston. For more details, visit http://www.bio-itworldexpo.com/. “
Workshop entitled “PATRIC: A resource for infectious disease research. Real-life examples used to drive software development”.
show more...
The workshop was hosted by Texas A&M Health Science Center . Topics included:
What PATRIC Offers:
- Consistent annotations across all sequenced bacterial species from GenBank.
- Searches based on taxonomy, gene name, locus tag, protein function/families, pathways, EC numbers, GO terms, and more.
- Data and analysis results are free and typically downloadable.
- Organism summaries; pre-sorted PubMed articles; interactive phylogenetic trees; virulence and disease information, and experimental data including, transcriptomics, proteomics, protein structure, and protein-protein interactions.
- Numerous interactive visualizations for genome browsing, multi-genome comparisons, pathway comparisons, protein family comparisons, phylogenetic trees with coupled MSAs, 3D protein structures, and feature group comparisons.
For event details see Texas PATRIC workshop flyer.
If you have a suggestion for a future workshop topic and/or location, please send it to us via the “contact us” link at the bottom of this page.
To improve the annotation of the Mycobacterium tuberculosis genome sequence and increase the value of this community resource.
show more...
A first-pass assembly and annotation of P. yoelii yoelii YM is now publicly available on GeneDB. The assembly was made
using short-fragment Illumina reads and scaffolded with 454 paired-reads from a 3kb library. The assembly was
computationally improved using PAGIT (http://www.sanger.ac.uk/resources/software/pagit/) and annotated by transfer
from the P. berghei ANKA reference. Genes in subtelomeric regions were predicted ab initio. The annotation has been
manually re-inspected genome-wide.
P. yoelii yoelii YM can be viewed and searched on GeneDB. EMBL and GFF format files can be downloaded from the
following FTP site:
ftp://ftp.sanger.ac.uk/pub/pathogens/Plasmodium/yoeliiYM/version_1/January_2012
A publication describing the new sequence is in preparation and permission should therefore be obtained before undertaking
genome-scale or whole chromosome wide analyses.
This work was funded by the Wellcome Trust Sanger Institute and the European Union's 7th Framework Programme (FP7/2007-2013)
under grant agreement no 242095. The DNA was provided by Tony Holder (Division of Parasitology, MRC National Institute for Medical
Research, Mill Hill, London).
Dear all,
VectorBase is pleased to announce that Nucleic Acid Research has accepted its paper for publication in the 2012 Database issue. The article describes improvements since the last NAR publication, in 2009.
Check it out here.
From now on, please use this paper to cite us!
A new assembly of the P. falciparum IT genome is now available,
replacing the version 1 sequence that has been available via PlasmoDB
(since release 8.0) and the Sanger FTP site. The version 1 sequence was
produced by "correcting" (or morphing) the reference 3D7 sequence,
using aligned Illumina reads from IT. However, in version
2, the sequence data have been assembled de novo. A key aim for this
genome project is to fully assemble the subtelomeric regions. Although
we were able to correctly assemble some subtelomeric sequences onto the
?core? of the genome (i.e. the interstitial or housekeeping region),
many subtelomeric sequences remain problematic and are being actively
refined. The annotation is automated and no manual curation has taken
place.
The new version contains new systematic identifiers compliant with
those now in use for 3D7. However, old identifiers remain in the
database as fully searchable synonyms. The new version can be
viewed and searched on GeneDB and will be loaded into PlasmoDB in early
2012. EMBL and GFF format files can be downloaded from the
following FTP site:
ftp://ftp.sanger.ac.uk/pub/pathogens/Plasmodium/falciparum/IT_strain/version_2
This work was funded by the Wellcome Trust [grant number
WT085775/Z/08/Z] and the European Union's 7th Framework Programme
(FP7/2007-2013) under grant agreement no 242095.
A preliminary 0.5 data set for Rhodnius prolixus is included in the VB-2011-10 release:
At the present time, the following data are available:
- repeats (Dust, TRF, known repeated elements publicly available)
- alignments to 9,665 Rhodnius prolixus Expressed Sequence Tags (ESTs) from dbEST mapped to the genome
- alignments to 792,023 sequences from 2 Rhodnius prolixus RNA-seq datasets mapped to the genome
- alignments to 143,631 proteins from UniProt, grouped into 7 taxa, mapped to the genome
- 18,426 Genscan and 65,946 SNAP ab intio gene models
- 15,878 gene models from the Lagerblad lab
- consensus gene set with 20,976 genes and 32,624 transcripts built with the Ensembl gene build pipeline
Be aware that the gene, transcript and peptide identifiers are not stable and there is no plan to maintain them.
They have not been indexed as part of the site-wide search.
Note that VectorBase provides a BLAST server to compare your sequence against the Rhodnius prolixus supercontigs.
Gene annotation is available in GFF3 and GTF for download, as are the transcripts, peptides, supercontigs and ESTs.
An enhanced genebuild will follow in a future VectorBase release. We aim to reduce the high number of transcripts.
The Glossina morsitans 0.5 gene set is now available!
This is our first gene set was generated by MAKER. Input gene models are issues from species specific EST and RNA-seq alignments, Metazoa protein alignments, and ab initio programs. The community is currently assessing it and we hope to come up with a new, improved, gene set soon!
While this gene set is not downloadable, it is accessible on the genome browser or for blasting.
Note that this is a TEMPORARY gene set, it is not searchable and won't be archived. We are not planning on maintaining identifiers and recommend using locations as well as identifiers to identify genes
Happy Glossina browsing!
The Rapid Annotation using Subsystems Technology (RAST) annotation program was developed in 2007 and is currently utilized and funded via PATRIC.
show more...
The Rapid Annotation using Subsystems Technology (RAST) annotation program was developed in 2007 and is currently utilized and funded via PATRIC.
Read full article on RAST German E. coli Strain Annotation Using Cloud Computing.
VectorBase is pleased to announce the VB-2011-10 release. Highlights of this release include:
- Several major new microarray experiments added for Anopheles gambiae and Aedes aegypti
- Flat file expression downloads and an expression map are now available for Aedes aegypti.
- Improvement to the Rhodnius prolixus gene set and presentation of Pedro Lagerblad/Jose Ribeiro's prediction set.
More detailed release notes are available on the Wiki.
Held at Virginia Bioinformatics Institute in conjunction with the Genetics, Bioinformatics, and Computational Biology (GBCB) Seminar Series.
show more...
What PATRIC Offers:
- Consistent annotations across all sequenced bacterial species from GenBank.
- Searches based on taxonomy, gene name, locus tag, protein function/families, pathways, EC numbers, GO terms, and more.
- Data and analysis results are free and typically downloadable.
- Organism summaries; pre-sorted PubMed articles; interactive phylogenetic trees; virulence and disease information, and experimental data including, transcriptomics, proteomics, protein structure, and protein-protein interactions.
- Numerous interactive visualizations for genome browsing, multi-genome comparisons, pathway comparisons, protein family comparisons, phylogenetic trees with coupled MSAs, 3D protein structures, and feature group comparisons.
Event includes free pizza dinner. For registration information, agenda, and location details please see PATRIC VT Workshop Promo.
If you have a suggestion for a future workshop topic and/or location, please send it to us via the “contact us” link at the bottom of this page.
Since the publication of the genome in 2002, work has continued at the Sanger Institute to produce a single base-perfect P. falciparum
reference genome. This new sequence version is an upgrade from version 2.1.4 to version 3 and includes some substantial changes. Barring some highly
AT-rich tracts that remain to be fully verified, we expect this to be the last major sequence update.
Differences include the correction of major mis-assemblies on chromosome 7 and 8 – one end of chromosome 7 now replaces one end of
chromosome 8 (and vice versa), the replacement of all ambiguous bases (N’s) in the genome with the correct sequence and the correction of
hundreds of sequencing errors. The original P. falciparum 3D7 genome project was started by three different genome centers – Sanger,
TIGR and Stanford – all of whom used different naming schemes (e.g MAL7P1.10, PF11_0344, PFE0010c). For the submission in 2002 all three naming
schemes were used. In October 2007, the systematic re-annotation of P. falciparum 3D7 started in earnest with a community annotation workshop
and work has continued at Sanger ever since. During that time, new genes have been created and other genes have been split. When a new ID needed
to be assigned but a logical numerical identifier was not available, a suffix of ‘a’ and ‘b’ was added to distinguish from the
neighbouring gene. In addition, now that part of chromosome 7 is located on chr8, and vice versa, it would be more confusing to have MAL7 identifiers
on chr8 and MAL8 identifiers on chr7.
We have therefore decided to adopt a consistent new scheme to identify genes. The old identifiers will be
maintained as fully searchable synonyms but will be made significantly less prominent in PlasmoDB and GeneDB. An important additional change is that
both protein-coding and non-protein coding genes will now use the same consistent nomenclature. The new naming scheme complies with the standards
set by the International Nucleotide Sequence Database Collaboration (Genbank, EMBL & DDBJ). All new identifiers now start with a two-part prefix:
PF3D7_ to identify the genome and two numerical digits to identify the chromosome, e.g. the first gene on chr1 is now called PF3D7_0100100. The new
naming scheme increments intervals of 100 to allow significant numbers of genes to be added (although we believe that this is unlikely). This new
naming scheme will also be adopted to other genomes that are being worked on at the Sanger Institute, e.g. P. falciparum IT genes will start with
PFIT_. The new sequence version can be viewed and searched on GeneDB and will be loaded into PlasmoDB in early 2012. EMBL and GFF format files,
together with a conversion table of old:new identifiers, can be downloaded from the following FTP site:
ftp://ftp.sanger.ac.uk/pub/pathogens/Plasmodium/falciparum/3D7/3D7.latest_version/version3/October_2011/
Dr. Rebecca Wattam and Maulik Shukla highlight data and analysis tools for bacterial pathogens via Brucella use cases.
show more...
What PATRIC Offers:
- Consistent annotations across all sequenced bacterial species from GenBank.
- Searches based on taxonomy, gene name, locus tag, protein function/families, pathways, EC numbers, GO terms, and more.
- Data and analysis results are free and typically downloadable.
- Organism summaries; pre-sorted PubMed articles; interactive phylogenetic trees; virulence and disease information, and experimental data including, transcriptomics, proteomics, protein structure, and protein-protein interactions.
- Numerous interactive visualizations for genome browsing, multi-genome comparisons, pathway comparisons, protein family comparisons, phylogenetic trees with coupled MSAs, 3D protein structures, and feature group comparisons.
For registration information, agenda, and location please see Brucellosis Workshop Flyer.
If you have a suggestion for a future workshop topic and/or location, please send it to us via the “Contact Us” link at the bottom of this page.
VectorBase is pleased to announce the VB-2011-08 release.
Highlights of this release include:
- First announcement of the new population biology resource prototype web interface
- Updated Ensembl genome browser (to schema 63)
More details are available in the Release notes.
PATRIC has updated its annotation and analysis to include nine strains from the recent outbreak, including a complete assembly of E. coli TY-2482.
show more...
Nine Escherichia coli strains were included in the latest analyses: TY-2482 (complete assembly), LB226692, GOS1, GOS2, H112180280, H112180282, H112180283, H112180540, and H112180541. The proteins from these strains were used to search a specific E. coli database (compiled with 184 E. coli genomes at PATRIC) using BLAST. We calculated Smith-Waterman alignment scores for each of the top ten best-scoring homologs of these proteins and normalized the score to the self-alignment score, creating a conformity score. Conformity scores of 1 indicate that all proteins in the alignment are identical. Proteins with conformity scores of 0.8 or less are considered to be “divergent” from their top ten homologs in E. coli. A graphical representation showing conformity scores for proteins of the nine E. coli strains is provided in the figure below.

A list of all proteins and their conformity scores is provided in an Excel file (ecoli9genocav). Some of the proteins found in the outbreak strains have no homologs in E. coli. In the conformity score column, these “unique” proteins are identified by a “-“. The data for the unique proteins for all nine strains are also integrated into the PATRIC website, with the additional functionality the site provides (where applicable); you can access those data at the following links:
The divergent proteins (see above) were compared to virulence and antibiotic-resistance proteins collected from different sources with some interesting discoveries. For example, beta-lactamase, a protein responsible for resistance to beta-lactam antibiotics like penicillins, cephamycins, and carbapenems was found to be identical in the nine outbreak strains (CTX-M type), but very different from their closest homologs prevalent in other E. coli strains (TEM-type). An alignment of these proteins is shown here. Other divergent proteins include ABC transporters, phage-related and outer-membrane proteins.
The University of Notre Dame ( www.nd.edu) is seeking a biologist with training and research experience in molecular biology and genomics, to join a large team working on highly visible web resources for the genomes and genome-related data of arthropod vectors of human pathogens (VectorBase). The successful applicant is expected to work closely with VectorBase developers and the external community of scientists who use the resource. The incumbent must be able to effectively communicate with researchers using VectorBase in order to define their needs, and convey that information to VectorBase software engineers and database specialists with the goal of optimizing the user experience. Major responsibilities of this position include providing for an intuitive and informative interface to access and interrogate genomic-scale datasets, and identifying/soliciting additional genomic-scale datasets from the external community. Position requires active participation in scientific meetings, both within the US and abroad, including demonstrations, workshops, and other instructional activities. Leadership in the preparation of tutorials, technical documentation, and reports is also expected. Opportunity for participation in bioinformatics / computational biology research for suitably motivated individual.
Additional information can be found here.
VectorBase is pleased to announce the VB-2011-06 release.
Highlights of this release include:
- Pre-site for Glossina moritans. Features mapped to the genome and preliminary gene sets available.
- New microarray data from Christian et al. and Cassone et al.
More details are available in the Release notes.
The latest VectorBase release (VB-2011-06) contains preliminary data for the Glossina morsitans genome:
- assembly, ESTs and traces are available for blasting,
- contigs, supercontigs, agp files, ESTs and traces are available for download,
- EST-based genes, ab initio SNAP genes, MAKER gene models, and repeats are available via the genome browser.
Glossina needs you!
Manual annotation provide an invaluable addition to the automatic annotation - there is already a small community in place that will be helping with manual annotation but if you are interested, please contact Serap Aksoy (Serap.Aksoy -at- yale.edu) or Geoff Attardo (geoffrey.attardo -at- yale.edu).
Conformity scores and associated data are available.
show more...
Note: Data was updated June 17th.
The proteins from the Escherichia coli strains TY-2482 and LB226692 were used to search a specific E. coli database (compiled with 184 E. coli genomes at PATRIC) using BLAST. We calculated Smith-Waterman alignment scores for each of the top ten best-scoring homologs of these proteins and normalized the score to the self-alignment score, creating a conformity score. Conformity scores of 1 mean that all proteins in the alignment are identical. Proteins with conformity scores of 0.8 or less were considered to be “divergent” from their top ten homologs in E. coli. A graphical representation, where conformity scores for over 5000 proteins of the E. coli strains TY-2482 and LB226692, is provided in the figure below. The X-axis is in log scale.

Click image to enlarge
A list of all proteins and their conformity scores is provided in an Excel file (TY2482-LB226692_Vir_ABR_Conformity_All). Some of the proteins found in TY-2482 and LB226692 have no homologs in E. coli. In the conformity score column, these “unique” proteins are identified by a “-“. The data for the unique proteins for both species are also integrated into the PATRIC website here, with the additional functionality the site provides.
The divergent proteins (see above) were compared to virulence and antibiotic-resistance proteins collected from different sources with some interesting discoveries. For example, beta-lactamase, a protein responsible for resistance to beta-lactam antibiotics like penicillins, cephamycins, and carbapenems was found to be identical in TY-2482 and LB226692 (CTX-M type), but very different from their closest homologs prevalent in other E. coli strains (TEM-type). An alignment of these proteins is provided here. Other divergent proteins include ABC transporters, phage-related and outermembrane proteins.
PATRIC has collected metadata associated with genomes from sources, such as NCBI’s BioProject, GenBank, and the Human Microbiome Project.
show more...
We anticipate making a minimum of two awards of up to $600K each.
show more...
SOLICITATION CLOSED for Letters of Intent. Full proposals accepted through invitation only.
QUESTIONS AND ANSWERS Updated October 28th. See bottom of this page.
The NIAID-funded Pathosystems Resource Integration Center (PATRIC) invites proposals for Driving Biological Projects (DBPs) focused on one or more of the pathogens supported by this Bioinformatics Resource Center (BRC). We anticipate making a minimum of two awards of up to $600K each by April 1, 2012. A two page Letter of Intent (LOI) is due August 1, 2011. Full proposals will be solicited by invitation only, with a deadline of November 1, 2011.
DBP proposals must exploit high-throughput experimental technologies to functionally characterize bacterial species in order to help elucidate pathogenesis, antimicrobial resistance and/or other biological processes of interest in the study of bacterial infectious diseases. Such approaches may incorporate expression profiling microarrays, proteomics studies, metabolomics, interactome analysis, RNAi experiments, RNA-Seq, Chip-Seq, bioassays, and/or other technologies.
All experimental results, data, metadata, and reagents generated by the DBPs must be released into the public domain, according to data release guidelines provided by PATRIC. NIAID mandates that PATRIC staff assist with bioinformatics analysis and loading of data produced by the DBP, which may involve data management, computational and/or statistical analysis of experimental data, generating predictive models, developing algorithms for data analysis and visualization, etc.
Read further information about the BRC program on the NIAID website.
DBP Information
- Type of award: cost reimbursement subcontract.
- Number of awards: at least 2 (maximum total budget for all awards = $1.2M).
- Duration of each award: up to 24 months.
- Maximum budget per award: up to $600,000 in total costs.
- Letters of Intent must be submitted by 5:00 PM Eastern Time, August 1, 2011, via email to patric-dbp@vbi.vt.edu. LOIs are limited to 2 pages maximum, 11 pt font or larger.
- Full proposals will be solicited by invitation only, with a due date of 5:00 PM, Eastern Time, November 1, 2011, via email to patric-dbp@vbi.vt.edu. Full proposals are limited to 8 pages of text and figures (11 pt font or larger). Biosketches and references are not included in the page count. Appendices are discouraged.
- Projected start date: April 1, 2012.
Review
LOIs and Full Proposals will be reviewed by the BRC Scientific Working Group (SWG), an external board of advisors, based on the following criteria:
- Technical feasibility
- Scientific impact on the communities served by PATRIC
- Impact on the development of new features in PATRIC
- Cost-effectiveness
Recommendations from the SWG will be considered by PATRIC and NIAID program staff, which will be responsible for final selection of DBP awards.
Eligibility
All researchers with expertise and capabilities in the areas described above are invited to submit a Letter of Intent. DBP applicants / institutions need not be located in the US, but any local requirements must be compatible with NIH regulations and the tight time-lines noted above. Members of the SWG are not precluded from submitting LOIs, but if selected to submit full applications will not participate in the final review. Per the PATRIC prime contract, the number of DBPs is limited to only one from the prime contractor’s and subcontractors’ institutions, including the DBPs’ institutions, during the entire contract period of performance. Thus, proposals from institutions that have DBPs already awarded from PATRIC are ineligible. Information about current PATRIC DBPs can be found here:
Letters of Intent
Submission of a Letter of Intent is required, via email to patric-dbp@vbi.vt.edu. The LOI must be no longer than 2 pages (11 pt font or larger), and must include the following components:
- PI name, affiliation and contact information
- Project title
- Pathogen(s) involved
- Project description including specific description of work proposed
- Explanation of how the project will utilize and complement BRC resources in expediting research on these pathogens by the larger scientific community
Projects considered competitive by the SWG and PATRIC staff in consultation with NIAID program officers will be invited to submit full proposals. Note that the SWG may suggest (but may not require) communication between applicants proposing similar projects in order to facilitate coordination where appropriate.
Full Proposals – Only selected LOIs are eligible for Full Proposal submission
DBP proposals should be structured as a white paper, and must clearly address how the proposed project will exploit high-throughput experimental and bioinformatics techniques to functionally characterize the genome / proteome / metabolome of microbial species supported by PATRIC, helping to elucidate pathogenesis, antimicrobial resistance and/or other important infectious disease processes. DBP proposals are limited to 8 pages of text and figures (11 pt font or larger), and must include the following information:
- Executive summary of the project’s goals, including the pathogen(s) involved, information to be produced, and the expected impact on scientific communities served by PATRIC.
- Description of the data to be produced, including methods used to generate this information and possible obstacles. The proposal should also review any relevant datasets that are currently available, and discuss their relationship to the proposed project. Projects involving animals and/or human subjects are subject to standard NIH regulations and must document relevant approvals.
- Details of high throughput experimental technologies necessary to carry out the project, noting currently available reagents, facilities, equipment and other resources.
- Discussion of computational and/or statistical analyses, noting those to be provided by the applicant, and those required from PATRIC (see below).
- Assessment of impact, including how these data are likely to influence development of new PATRIC features, and how they will be used by relevant research communities.
- Project timeline, including specific milestones and deliverables.
- Data release / data sharing plan, consistent with NIAID and BRC data release guidelines (see below).
Additional material to include in the proposal (not included in the 8 page limit):
- References
- List of proposed scientific and technical personnel, detailing qualifications, experience, and role(s) in the project. The DBP lead must devote at least 10% effort to this project. Note: full CVs are not required.
- Proposed budget (maximum of $600K total costs over two years), broken down by category (labor, materials, other line items, as appropriate for the project). Note that cost effectiveness will be considered during review; budgets may be supplemented from other sources, and it is permissible to propose alternatives (e.g. line items detailing costs for pathogen A, pathogen B, dataset X, dataset Y, analysis N, etc). Use the NIH Budget and Budget Justification forms which can be downloaded from here: (Budget Justification Budget Template)
Please do not use appendices.
BRC Resource Utilization Plan: The proposal must include a specific plan that describes the intended PATRIC integration with the project and bioinformatics support expected from PATRIC. This section of the proposal must be submitted for review and evaluation at least one week before the proposal due date, submitted in MS Word format via email to patric-dbp@vbi.vt.edu. The response from PATRIC will either be “Yes, we can support what you are proposing”, or “No, we cannot support what you are proposing” with an indication of which part we will not be able to support and why. This information can be used by the proposer to modify their final proposal. At the time of final proposal submission PATRIC will provide an official letter that includes a description of the planned support from PATRIC that will be included with the proposal for SWG review.
Full Proposals are to be sent via email to: patric-dbp@vbi.vt.edu (contact rkenyon@vbi.vt.edu) if acknowledgement is not received within 72 hours of submission). Proposals may be subject to further negotiation prior to acceptance, according to standard NIH subcontracting procedures. We anticipate notification of awards (or opening of further negotiations) on or about January 9, 2012, and that subcontract awards will be in place by March 1, 2012. The projected start date is April 1, 2012.
Data Release Policy
- All data and information generated under the DBP award must be made accessible to the PATRIC team immediately upon generation.
- Public release via the PATRIC Website is expected within one month from publication or within one year of generation, whichever comes first.
- Complete datasets, meta-data and reagents generated under the DBP award must also be made publicly available within the same time-frame, without restriction (e.g. MTA requirements). For example, sequences should be deposited in Genbank / EMBL / DDBJ, or the European Short Read Archive, as appropriate. MIAME-compliant microarray data and MINSEQE-compliant RNA-Seq and ChIP-Seq data should be deposited at GEO / ArrayExpress / CIBEX, etc. Reagents of general utility should be made available via the BEI, ATCC, or other appropriate distribution centers.
- Any exceptions to the above policies must be approved in writing by the PATRIC management and NIAID program officers.
Further information on data handling procedures and data management policies is posted on the PATRIC Website.
The NIAID will also review and provide the final approval of the DBPs proposed for award by the SWG. Final Proposals selected for submission to NIAID are subject to NIAID approval, NIAID supplemental award to the PATRIC prime contract, and successful subcontract negotiation between Virginia Tech and the potential awardee.
Questions and Answers
We will post questions received from submitters and our responses here on the PATRIC web site during the proposal preparation phase. If necessary, the question and answer will be re-phrased to make them generically relevant to all/other proposals.
Reporting Requirements
Semi-annual and periodic ad hoc reports will be required to assist with the review of progress toward stated goals.
We regret that it will not be possible to fund all of the valuable projects that one might envision in response to this solicitation, but look forward to continuing to work with the PATRIC user community to integrate additional datasets generated through other mechanisms. Click on the ʻContact Usʼ link at patric@vbi.vt.edu if you wish to discuss integration or bioinformatics support for datasets that are currently available or in preparation.
QUESTIONS AND ANSWERS Updated October 28th.
When will notifications go out about decisions on LOIs and invitations for full proposals?
We are targeting mid-September for notifications of decisions on LOIs.
What organisms are acceptable for study in the DBP?
The Driving Biological Project should focus on BACTERIAL pathogens that are the target of the PATRIC project, which include NIAID category A-C Priority Pathogens and Emerging and Re-Emerging Infectious Diseases.
Can an investigator on a current DBP apply to the new solicitation?
Our contract limits the number of DBPs from any single institution to only one. However, it is permissible for an institution that currently has a DBP or investigator on one of the current DBPs to be part of a proposal for which a different institution will have the prime contract for the DBP.
Will DBPs support projects focused on the host side of host-pathogen interaction? That is, on projects aimed at identifying host regulatory and signaling pathways that 1) increase/decrease in response to infection and 2) contribute to host resistance or pathogen success.
DBP proposals must generate data to functionally characterize bacterial species in order to help elucidate pathogenesis, antimicrobial resistance and/or other biological processes of interest in the study of bacterial infectious diseases. If the host data would be generated that results from interaction with the pathogen, we have means for using and supporting that data in PATRIC.
Can an interdisciplinary team of experimentalists and computational biologists apply to the DBP? The solicitation emphasizes the analysis that the BRC team can provide to experimentalists?
It would be permissible for a team of experimentalists and computational biologists to apply. It is important however that the DBP proposal describe how the BRC will be utilized during the course of the DBP.
Would your group be able to take raw transcriptomics (microarray and RNA-seq) and metabolic data and generate classifications based on levels of expression or metabolite concentrations?
Yes.
Would your group be able to help in modeling of metabolic pathways (flux) based on the data?
Yes.
After submitting the LOI when will we be notified whether we receive an invitation to write a full proposal?
Invitations for full proposals should go out in mid-September.
How many groups will be invited to submit full proposals?
There is no set number of full proposals we will solicit. It will be driven by the quality & relevance of LOIs we receive.
Which institutions currently have a DBP?
Cornell University and University of California Irvine. Details can be found here.
Is it more important to present the underlying biological question in the proposal or to elaborate on the technologies to be used, data to be generated, and use of PATRIC resources?
The proposal should be balanced in addressing the review criteria:
- Technical feasibility
- Scientific impact on the communities served by PATRIC
- Impact on the development of new features in PATRIC
- Cost-effectiveness
Thus, the biological question is relevant to the second criterion, and the use of PATRIC resources to the third.
Our DBP may focus on an organism studied under a current DBP. Will this be a disadvantage? Is there a preference for a wider coverage of organisms and data types? Or are the underlying questions and approaches of greater significance?
The study of an organism currently covered by a DBP is not precluded nor necessarily disadvantaged, but justification that the questions and underlying approaches are more broadly applicable to other organisms would be helpful.
Does the BRC Resource Utilization Plan section of the proposal count against the 8-page limit, or is it considered additional material? Is there a suggested length for this section?
The Utilization Plan does not count against the 8-page limit per se, but we ask that you include a 1-2 paragraph summary of the approved plan in your proposal. The plan itself (due for review and evaluation at least one week before the proposal due date) can be whatever length you think is appropriate to describe how the interactions with the BRC will take place. We expect that 1-2 pages would probably be sufficient.
Does the $600K total budget include indirect costs to the home institution?
Yes the $600K includes direct plus indirect costs. The total amount of the budget cannot exceed $600K.
Our organization will be a subcontractor to the lead organization for the DBP. What budget forms are required from our institution?
Use the templates found here:
http://www.patricbrc.org/patric/html/NIH_Budget_Template.Contract.xls
http://www.patricbrc.org/patric/html/NIH_Budget_Justification_Template.doc
NIH 2043 Form
What are features of the PATRIC pipeline for RNA-Seq analyses?
At this point PATRIC doesn’t have any incorporated RNA-Seq tools. However, by the proposed DBP start date 1-April-2012, we plan to have deployed a workflow that will allow a user to bring their RNA-Seq data, produce BAM and RPKM files, and filter and do analysis of results by visualizing that data in a private PATRIC context. The planned PATRIC functionality includes the ability to view wiggle plots on the genome browser (along with other tracks), view and filter transcriptomic data, and view enrichment information. Since part of the goal of the DBPs is to drive functionality at PATRIC, if you think features beyond this are desirable, they should be included as part of the proposal. In the meantime, we have the ability and experience to process and help interpret RNA-Seq data.
Does PATRIC have tools for annotating ncRNAs?
PATRIC has in the past provided ncRNA annotations, but doesn’t currently provide such annotation in the PATRIC annotation system. What you see at the PATRIC website are ncRNAs identified either in the RefSeq annotation or from earlier annotation systems. This is something that we’d like to address but we don’t have a currently scheduled time frame for this feature. If it is a requirement for your DBP, it would become a priority in PATRIC.
Does the NIH salary cap of $199,700 apply to the DBPs?
Yes, since PATRIC is NIH funded and the DBPs are part of that funding, the NIH salary cap applies.
Can the PATRIC team provide support for RNA-Seq data analysis for eukaryotic hosts.
Yes, the PATRIC team can provide support for RNA-Seq analysis of host as well as pathogen data.
Is there someone to talk with to get some more guidance about how to best write the full proposal?
Along with our NIAID Program Officer, we have decided that having individual conversations with DBP proposers could lead to the appearance of perhaps giving some proposals a competitive edge. Therefore we have decided that we will use email correspondences to answer specific questions and post generic forms of those questions and answers to the PATRIC website.
As for additional guidance, this is an NIH funded project and the reviewers are familiar with NIH formats and procedures. We suggest that you approach this as you would other NIH proposals, with particular attention to the required information.
Are letters of support permitted?
Yes, letters of support are permitted and they do not count against the 8-page limit.
Is an NIH biosketch needed in addition to the list of scientific and technical personnel?
No, you do not need to include an NIH biosketch in addition to the list of scientific and technical personnel.
What level of detail is required in the Materials and Supplies page of the budget?
The DBP awards will be via an NIH contract, so you should follow NIH Guidelines for contracts (not grants) in filling out the Materials and Supplies page in the budget.
My proposal is intended to generate data types or utilize analysis tools that I can’t find currently on the PATRIC website. Is this a problem?
No, this is not a problem. One of the review criteria for proposals is “Impact on the development of new features in PATRIC” (see the other review criteria in the Review description above on this web page.) The goal of this criterion is to drive the development of novel infrastructure that is generally usable by the community, and the idea is to develop it as part of collaboration with the DBP awardees that are generating the data and know how they want to utilize the data.
Where can we find the FAR clauses and other flow-down requirements that will apply to our subcontract, if selected for award?
The original RFA for the BRC program (of which PATRIC is one of the awardees) is below. You should be able to find applicable terms and conditions there. Note that we will be handling all of the information system requirements, so your institute should not have to be concerned with those. If your institute is awarded one of the Driving Biological Projects, there will be an opportunity to negotiate terms and conditions as needed.
https://www.fbo.gov/spg/HHS/NIH/NIAID/NIH-NIAID-DMID-AI2008038/listing.html
Do I need to include anything other than the plan itself with respect to the BRC Utilization Plan in my application?
In the interest of time, we will forgo issuing a formal letter of approval from PATRIC. The email approval will suffice, for which we have a copy, so you do not need to include anything else if you have received approval of the plan from us via email.
Will this project be funded by a NIH contract or grant?
PATRIC is a NIAID/NIH contract, and the DBP will be funded via subcontract from the PATRIC prime contract.
VectorBase is pleased to announce the VB-2011-04 release.
Highlights of this release include:
- New microarray expression data for Culex quinquefasciatus and Aedes aegypti
- Updated mass spec. peptides for Anopheles gambiae available via DAS and simple browsing interface
- Anopheles funestus RNA-seq contigs available for BLAST and browsing and search
- GTF file format for use in tophat/cufflinks analysis of RNA-seq data available for download from all species.
More detailed release notes are available on the Wiki.
The NIAID-funded VectorBase invites research investigations called Driving Biological Projects (DBPs) from organizations housing experimental laboratories having expertise and knowledge in working with arthropod vectors of human pathogens. We anticipate making two awards of up to $600K each by April 1, 2012. A two page Letter of Intent (LOI) is due August 1, 2011. Full proposals will be solicited by invitation only, with a deadline of November 1, 2011.
See the full solicitation here. Send questions to vector@nd.edu
Free PATRIC bioinformatics resource center workshop featuring data and analysis tools for bacterial pathogens.
show more...
What PATRIC Offers:
- Consistent annotations across all sequenced bacterial species from GenBank.
- Searches based on taxonomy, gene name, locus tag, protein function/families, pathways, EC numbers, GO terms, and more.
- Data and analysis results are free and typically downloadable.
- Organism summaries; pre-sorted PubMed articles; interactive phylogenetic trees; virulence and disease information, and experimental data including, transcriptomics, proteomics, protein structure, and protein-protein interactions.
- Numerous interactive visualizations for genome browsing, multi-genome comparisons, pathway comparisons, protein family comparisons, phylogenetic trees with coupled MSAs, 3D protein structures, and feature group comparisons.
Who Should Attend:
- Bacterial pathogen researchers/students and bioinformaticians.
For registration information, agenda, and location please see UC Davis Workshop Flyer.
Following extensive discussion with
the malaria research community, the list of Plasmodium genome
sequencing projects supported by the US National Institutes of Allergy
and Infectious Diseases (NIAID) has been updated, replacing entries
that have been difficult to source with additional isolates and species
of interest.
The initial white paper is available here;
click here
for the revised list of approved projects. The Plasmodium
sequencing contract has been awarded to the Broad
Institute, and data is made publicly available to the broader
research community via deposition in GenBank and through relevant Bioinformatics
Resource Centers, including PlasmoDB. Further information on
the 'white paper' process by which key pathogen species are approved
for sequencing is available here.
Dear Cryptosporidium Researcher,
We would like to inform you of a proposal being prepared on behalf of the research community to generate genomic and RNA-seq data for a number of Cryptosporidium species, including laboratory propagated and geographic/clinical isolates. �The proposal will be submitted for funding to the National Institutes of Allergy and Infectious Diseases (NIAID) for consideration by the Eukaryotic Pathogen and Disease Vector Sequencing Project working group and coordinating committee.
http://www.niaid.nih.gov/labsandresources/resources/gsc/pathogen/Pages/default.aspx
The species we are suggesting include C. parvum, C. hominis, C. meleagridis, C. bovis, C. bailey, C. cuniculus and C. ubiquitum. This sequencing effort will facilitate comparative analyses to identify genes which control host specificity and other clinically and biologically important traits, as well as to extend our understanding of Cryptosporidium populations. All data will be available to the public through the National Center for Biotechnology Information (NCBI) and CryptoDB.org websites.
We would like to solicit your input and request your support. �At this time an e-mail expressing your interest in this project or proposing changes would be much appreciated. Please indicate whether you may have access to Cryptosporidium positive clinical or animal samples which could be made available for this project.
Please send emails of inquiry and/or support to Giovanni Widmer (giovanni.widmer@tufts.edu) or Jessie Kissinger (jkissing@uga.edu)
VectorBase is pleased to announce the VB-2011-02 release.
Highlights of this release include:
Preliminary Rhodnius prolixus genome browsers with transcript and protein similarities, repeats and preliminary gene predictions.
Update to the mitochondrial gene data for Aedes, Anopheles, Culex and Ixodes.
Update to the Xrefs for Aedes, Anopheles.
More detailed release notes are available.
A preliminary data set for Rhodnius prolixus is included in the VB-2011-02 release:
http://rprolixus.vectorbase.org/
The annotation of Rhodnius prolixus is in process. At the present time, the
following data are available:
- repeats (Dust, TRF, known repeated elements publicly available);
- alignments to 9,665 Rhodnius prolixus Expressed Sequence Tags (ESTs) from dbEST mapped to the genome;
- alignments to 792,023 sequences from 2 Rhodnius prolixus RNA-seq datasets mapped to the genome;
- EST and RNA-seq derived gene set
- alignments to 143,631 proteins from Uniprot, grouped into 7 taxa, mapped to the genome
- 18,426 (Genscan http://genes.mit.edu/GENSCAN.html) ab initio gene models
Be aware that the EST and RNA-seq derived genes are not part of the official gene set, their identifiers are not stable and there is no plan to maintain them.
They have not been indexed as part of the site-wide search.
Note that VectorBase provides a (Blast server http://www.vectorbase.org/Tools/BLAST/) to compare your sequence against the Rhodnius prolixus supercontigs.
Raw and processed versions of the ESTs and RNA-seq datasets are available for
(download http://www.vectorbase.org/GetData/Downloads/), as are the supercontigs.
A full genebuild will follow in a future VectorBase release.
Free PATRIC bioinformatics resource center workshop featuring data and analysis tools for bacterial pathogens.
show more...
What PATRIC Offers:
- Consistent annotations across all sequenced bacterial species from GenBank.
- Searches based on taxonomy, gene name, locus tag, protein function/families, pathways, EC numbers, GO terms, and more.
- Data and analysis results are free and typically downloadable.
- Organism summaries; pre-sorted PubMed articles; interactive phylogenetic trees; virulence and disease information, and experimental data including, transcriptomics, proteomics, protein structure, and protein-protein interactions.
- Numerous interactive visualizations for genome browsing, multi-genome comparisons, pathway comparisons, protein family comparisons, phylogenetic trees with coupled MSAs, 3D protein structures, and feature group comparisons.
Who Should Attend:
- Bacterial pathogen researchers/students and bioinformaticians.
For registration information, agenda, and location please see OCICB Workshop Flyer.
The Pathogen Functional Genomics Resource Center (PFGRC) Pathogen DNA Microarray oligo and printing designs have been submitted and are publicly available from GEO Datasets
for investigators who would like to use these
designs to order microarrays from a microarray supplier. The GEO Accession Numbers are available from the Pathogen Portal
here.
PFGRC Microarray Laboratory Protocols are also now available from the Pathogen Portal
here.
The PFGRC has also made program source files, executables, and documentation for bioinformatics data management and analysis tools available for download through the
SourceForge website and the PathogenPortal
here.
VectorBase is proud to announce its December release, codenamed VB-2010-12. You can find all the details in the release notes. Comments, questions and bug reports are welcome via the usual channels ( help desk email, forum, Facebook and Twitter).
We've incorporated another 500 community annotations to the Anopheles gambiae gene set! More precisely, the update has lead to:
- 65 new genes,
- 156 new isoform predictions,
- 443 modified transcript predictions.
This new gene set AgamP3.6 is available with the new VectorBase release VB-2010-12. More details are available on our Help Wiki, including the list of modifications between both gene sets.
Two-year awards focusing on infectious disease research related to human bacterial pathogens were granted.
show more...
The Pathosystems Resource Integration Center (PATRIC), a bacterial Bioinformatics Resource Center (BRC) funded by the National Institute of Allergy and Infectious Diseases (NIAID) of the National Institutes of Health (NIH), is pleased to announce awards for its Driving Biological Projects (DBPs). DBPs are two-year awards that focus on infectious diseases research related to human bacterial pathogens. Research includes the use of high-throughput experimental technologies (HTP) to functionally characterize the genome, proteome or metabolome of bacterial organisms and/or host-pathogen interactions. This research may help elucidate the role of genes, proteins and metabolites with respect to pathogenesis, antimicrobial resistance or other biological processes of interest. The data and information generated by the DBPs will be made accessible to the broad scientific community and will be used to drive PATRIC’s infrastructure development to enable other researchers to perform similar analyses at the PATRIC website using their own data.
The DBP proposals were independently reviewed and evaluated by the PATRIC Scientific Working Group (SWG), and the top two proposals were selected for award. The awarded DBPs are:
- Comparative transcriptome and proteome analysis of Clostridium difficile strains: Dr. Yung-Fu Chang, PI, Dept. of Population Medicine and Diagnostic Sciences, College of Veterinary Medicine, Cornell University – The Cornell team will compare five C. difficile strains, generate transcriptome and proteome sequence data for these strains, verify and complement the genome sequence annotations of available sequenced strains and expand the PATRIC data model by joint development, testing and deployment of tools for RNA-Seq analysis that can be used for other bacterial RNA-Seq projects.
- Fitness Annotation of Bacterial Genomes: Dr. Michael McClelland, PI, Dept. of Pathology and Laboratory Medicine, University of California, Irvine – The goal of this proposal is to provide PATRIC with the information needed to display which genes of non-typhoidal Salmonella serovar Typhimurium (STm) contribute to survival in a variety of environments, including in infections of various hosts. This will be accomplished through a combination of high-throughput screening and sequencing methods and unique resources developed to annotate the STm genome with fitness information. STm transcriptomes will be generated from bacteria growing in defined environments, including rich and minimal media, at stationary phase, and in conditions that induce virulence pathways. These transcriptomes will provide basal reference profiles to standardize and improve analysis of the impending onslaught of high-throughput transcriptomics data. Decoration of the genome with basal transcriptional data will complement the fitness profile and other existing annotations in this archetypical strain.
PATRIC will use the information learned and workflows identified in these DBPs to develop Web-accessible infrastructure to accept and analyze RNA-Seq data and other associated data types. The knowledge gained through the experiments will be used to improve annotations for the specific organisms under study and in a broader context where applicable.
Dr. Chang said, “The PATRIC DBP is an excellent opportunity for our laboratory to expand the ongoing genomics research on C. difficile. In this DBP, we will be integrating large scale RNA-Seq, proteome and phenotype array data into the genome annotations of C. difficile. The experimental data generated will further be used for extending the annotations to other draft C. difficile genomes. We will develop experimentally curated pathway maps for several C. difficile genomes. We will also develop a pathway cross-mapping system between popular pathway analysis tools such as KEGG, SEED and Pathway Tools. Dr. Joy Scaria from the Chang laboratory will also work with the PATRIC team at VBI to seamlessly integrate the experimental data into PATRIC system. With the active participation of PATRIC team in the data analysis and software deployment, we anticipate to develop a scalable genomic analysis model extendable to other bacterial pathogens.”
Dr. McClelland said, “It is a privilege to be part of this vitally important NIAID mechanism in which wet labs are funded to link up with bioinformatics resource groups and make their data and analysis tools quickly and easily available to the community. There is a desperate need for many more of these kinds of grants to be funded to keep up with the increasing power of high-throughput data generation and presentation strategies, and to include a wide a range of pathogens and experts.”
Dr. Bruno Sobral, PI of the PATRIC project, offered the following comments on the DBPs: “PATRIC fully supports the concept of driving biological projects with the infectious disease community that performs genomic-scale wet chemistry experiments. The excellent quality of all the proposals received and especially the two selected shows that there is much to be done in support of the thousands of laboratories that work on bacterial infectious diseases. We are grateful to expand the PATRIC team to include such excellent scientists working on two organisms of great importance for hospital-acquired infections and food-borne outbreaks.”
Another solicitation for PATRIC DBPs is expected to occur in 2012. DBP announcements will be posted on the PATRIC Website. For individuals who wish to be included on the PATRIC email list for updates to the site, workshop notifications and DBP announcements, please contact us at patric@vbi.vt.edu
About PATRIC
PATRIC, the PathoSystems Resource Integration Center is a comprehensive Web-based resource for bacterial pathogens, biodefense research, and the study of emerging infectious diseases. PATRIC acquires, stores and integrates diverse information from pathogenic bacteria, provides visualization and data analysis tools to scientists, and enables the analysis of genomic, proteomic and various other data arising from, and of relevance to, infectious disease research. PATRIC’s scope includes all bacterial species and near relatives in the selected category A-C pathogens list as defined by NIAID.
The PATRIC project includes three primary collaborators: the University of Chicago, the University of Manchester, and New City Media. The University of Chicago is providing genome annotations and a PATRIC end-user genome annotation service using their Rapid Annotation using Subsystem Technology (RAST) system. The National Centre for Text Mining (NaCTeM) at the University of Manchester is providing literature-based text mining capability and service. New City Media is providing assistance in website interface development. The PATRIC project also develops and hosts the Pathogen Portal a gateway and common data repository for the entire NIAID BRC Program.
This project has been funded in whole or in part with Federal funds from the National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Services, under Contract No. HHSN272200900040C.
About the Cyberinfrastructure Division
The Cyberinfrastructure (CI) Division at the Virginia Bioinformatics Institute (VBI) of Virginia Tech (VT) is a diverse and highly integrated team of biologists, bioinformaticians, software engineers, and other experts that develops data, methods, infrastructure, and resources that enable scientific discoveries in infectious disease research and increasingly other research fields. The team applies the principles of cyberinfrastructure to integrate data, computational infrastructure, and people. The CI Division has developed many public resources for curated, diverse molecular and literature data from various systems, and implemented the processes, systems, and databases required to support them. Members of the team also conduct research their own research in a highly coordinated way within the Division and with scientific communities across the globe by applying its methods, data, and infrastructure to make new discoveries.
The 10th issue of the VectorBase newsletter is out!
Lots of new data and tools to present this time. The most important might be the new Anopheles gambiae M and S genome browsers. But we also present the selected Driving Biological Projects, the Culex genome papers, link to additional data, and new tools, mainly to reach our community (contact form, RSS feeds, Facebook). Have a break and read our gazette!
It is accessible on-line and in PDF from the Newsletter Help Page. You can also subscribe to the Newsletter mailing list to receive it directly in your mail box.
Genome sequencing of the M & S molecular forms of An. gambiae were published today showing that speciation is more advanced than previously thought. This greater divergence between the genomes has highlights the need to identify those genes critical for initiating this process. A companion paper describes the development of a SNP genotyping platform for investigating gene flow between the incipient species.
Genome browsers for the M & S molecular forms are available through VectorBase:
We are pleased to report that the following Driving Biological Projects (DBPs) have been awarded in connection with the EuPathDB Bioinformatics Resource Center (BRC):
- The Toxoplasma oocyst transcriptome and proteome
John Boothroyd, Matt Bogyo (Stanford) & Pat Conrad (UC Davis)
This project is expected to yield expression data from an important, but inaccessible, life cycle stage (1 yr).
- A comprehensive catalog of the T. gondii & N. caninum parasite & infected host cell transcriptome & proteome
Brian Gregory (Univ. Pennsylvania) & Jonathan Wastling (Univ. Liverpool)
This project will produce an immense volume of sequence data, driving database improvements for handling 'next-gen' sequencing, non-coding RNAs, and interrogating host & pathogen datasets in parallel (2 yr).
- Dissection of Plasmodium falciparum genetic crosses by mass spectrometry-based metabolite QTL
Manuel Llinás (Princeton) & Michael Ferdig (Notre Dame)
This project is expected to improve handling of genetic mapping data, incorporation of metabolomics results (integrated with pathway maps), and linking of specific genetic loci with metabolite abundance (2 yr).
A biennial competition for DBP awards is mandated by the BRC contract from NIH/NIAID, and supported by a total of $1.2M in total costs per award cycle. The goal of this program is to produce high value genomic-scale datasets for the EuPathDB user community, and drive the further development of bioinformatics resources for data analysis and mining.
Letters of intent (LOI) were solicited in February (see Feb 4 News item), and reviewed by the EuPathDB Scientific Working Group (SWG; see News, March 31). 12 full applications were solicited from among the ~50 LOIs received, and awards were based on further scientific review by the SWG, and in consultation with NIAID program staff. Cost negotiations and 'in kind' contributions from other sources made it possible to support 3 applications, from 6 investigators (in some cases, more than one LOI was merged into a single award).
We received far more high quality proposals than could be supported, testifying to the need for such mechanisms for data production bioinformatics analysis. The EuPathDB team looks forward to working with members of our user community who are seeking funding for such projects from external sources. Inquiries, including requests for support letters, should be directed to the EuPathDB staff (click on the 'Contact Us' link).
A new solicitation for DBPs is anticipated in 2011, and will be announced on the EuPathDB web site.
VectorBase is pleased to announce our first two Driving Biological Projects (DBP) funded by NIH/NIAID on the recommendation by the VectorBase Scientific Working Group ( SWG). All data generated by these funded DBPs will be made publicly available on VectorBase:
- Comparative Neurotranscriptome of Aedes aegypti
(PDF)
- Physical Reference Maps for Vector Genome Assembly (PDF)
Please contact us at info@vectorbase.org if you have any questions. We expect the next call for proposals to be announced with the next 18 months.
VectorBase is pleased to announce the VB-2010-10 release. Highlights of this release include:
- Preliminary Anopheles gambiae M and S genome browsers with EST mappings and EST-based gene predictions
- Update to the mitochondrial gene data for Aedes, Anopheles, Culex and Ixodes.
- Update to the Xrefs for Aedes, Anopheles, Culex and Ixodes.
More detailed release notes are available on the Wiki.
Come bring your data and participate in free hands-on tutorials featuring the annotation and modeling services offered by the RAST/SEED servers.
show more...
The team that developed and supports the RAST server will be offering 2-day tutorials on the annotation and modeling services offered by the RAST/SEED servers at Argonne National Laboratory (ANL) on the following dates:
- Oct 4-6
- Oct 25-27
- Nov 1-3
- Nov 15-17
We find that many of our RAST users utilize the basic service but are not familiar with the more advanced features of RAST or the services that can be accessed via the SEED Servers. These hands-on tutorials will cover the details of how to create annotations and metabolic models for newly-sequenced bacterial or archaeal genomes.
We will:
- Present an overview of the RAST services,
- Help participants install a small client-side collection of software that supports access to our servers over the web.
- Use the servers to create an annotation and metabolic reconstruction for a bacterial genome.
- Create an initial model of the metabolism of the sequenced organism.
- Use the model to explore the metabolism represented by the genome.
- And iteratively refine the model.
Participants who wish to bring a genome of their choice and work with our team on annotating and modeling it are invited to stay an optional third day following the main tutorial.
The tutorial is free; however all travel expenses will be the responsibility of the participants. Reasonably priced lodging is available on-site at ANL. The rooms are within walking distance of the tutorial location and include internet access.
If you’re interested in attending one of these tutorials please sign up by sending a note to rast-tutorial@lists.mcs.anl.gov, specifying your dates of interest (Oct 4-6, Oct 25-27, Nov 1-3, or Nov 15-17).
The genome paper presenting the annotation and the analysis of the Culex quinquefasciatus genome, the third mosquito to be sequenced, has been published today in the journal Science. Check it out!
A companion paper detailing the analysis of the Culex pathogenomics and infection response to diverse pathogens is published in the same issue.
In our latest efforts to reach out to the bioinformatics community surrounding VectorBase, we will be providing an RSS feed providing updates on the latest employment opportunities. Please add this feed to your reader to catch the latest jobs news!
VectorBase is pleased to announce the VB-2010-08 release.
Highlights of this release include:
- New genome browser look and feel.
- Three Aedes aegypti expression studies added.
- Update to the mitochondrial gene data for Aedes, Anopheles, Culex and Ixodes.
- Update to the Xrefs for Aedes, Anopheles, Culex and Ixodes.
- Update to the EST data for Aedes and Anopheles.
More detailed release notes are available on the Wiki.
We are aware of a number of outstanding issues with this release, documented in the release notes, and are actively addressing those. Please report bugs or other issues to the VectorBase helpdesk
Welcome to the 9th issue of the VectorBase newsletter!
We announce the new Ensembl Genome Browser and an updated Search facility. The newsletter also features new data: Pediculus humanus genome browser, IDOMAL data as well as RNA-seq expression studies; and much more information ... so come and have a look!
It is accessible on-line and in PDF from the Newsletter Help Page. You can also subscribe to the Newsletter mailing list to receive it directly in your mail box.
VectorBase is pleased to announce the VB-2010-06 release.
Highlights of this release include:
- New genome browser look and feel
- Human body louse Pediculus humanus genome added
- IDOMAL malaria ontology released
More detailed release notes are available on the Wiki.
We are aware of a number of outstanding issues with this release, documented in the release notes, and are actively addressing those. Please report bugs or other issues to the VectorBase helpdesk
VectorBase will soon be updating its Genome Browser to stay in line with the Ensembl code, and benefit from their constant improvements. You can familiarise yourself with the new genome browser via the Ensembl Genome links below. These changes are scheduled for release in early June. Please note that, from June, the VectorBase-hosted genome browser will have enhanced functionality (e.g. expression data, useful DAS tracks configured by default).
The hope that the new presentation is more intuitive and accessible. We will help you go through the change with a new tutorial and will be answering questions on our HELP mailing list. The web addresses wonât change so any links you bookmarked will still be valid.
You can have a peak at the new presentation for using these links:
In its continuing effort to address Grand Challenges in Global Health, the Foundation for the National Institutes of Health is providing $6 Million in grants that seek approaches to discover new chemical entities with novel mechanisms of action for insecticides to control malaria.
Application instructions are here. More information in the forum.
Letters of Inquiry are due by June 25, 2010
PATRIC team members attend this bacterial genome annotation workshop whose purpose is to explore the utility of a competition process to advancing the science of gene and genome annotation.
show more...
Critical Assessment of Functional Annotation Experiment (CAFAE) Workshop, Arlington, VA, May 18-19, 2010.
PATRIC team members attend this bacterial genome annotation workshop whose purpose is to explore the utility of a competition process to advancing the science of gene and genome annotation.
Drs. Williams and Sobral's conatruct a new gamma-proteobacteria "tree of life" using PATRIC's data resources.
show more...
A new “tree of life” has been constructed by researchers at the Virginia Bioinformatics Institute (VBI) at Virginia Tech for the gamma-proteobacteria, a large group of medically and scientifically important bacteria that includes Escherichia coli, Salmonella typhimurium, and other disease-causing organisms.
Read full story at Science Daily
PATRIC team members attend this workshop to help develop PATRIC requirements and understand implications of emerging standards on PATRIC infrastructure.
show more...
NCBI Annotation Workshop, Rockville, MD, April 25-27, 2010.
PATRIC team members attend this workshop to help develop PATRIC requirements and understand implications of emerging standards on PATRIC infrastructure.
EuPathDB is pleased to announce the members of the scientific working group
(SWG). The SWG provides EuPathDB with scientific oversight and
guidance. Members of the SWG also review the driving biological projects initiative.
Michael Gottlieb, Chair
Foundation for National Institutes of Health (FNIH)
Broad expertise in parasitology, large integrative projects (chair)
|
Matt Berriman
Wellcome Trust Sanger Institute
Pathogen genomics & functional annotation
|
John Boothroyd
Stanford University
Toxoplasma biology & host-pathogen interactions
|
Greg Buck
Virginia Commonwealth University
Pathogenomics of apicomplexan and kinetoplastid parasites
|
Patrick Duffy
National Institutes of Health (NIAID)
Laboratory & clinical aspects of malaria biology
|
Pascale Gaudet
Northwestern University
Biocuration, ontologies, DIctyostelium
|
Adrian Hehl
University of Zurich
Cellular and molecular biology of Giardia
|
Patrick Keeling
University of British Columbia
Comparative genomics
|
Bill Petri
University of Virginia
Laboratory & clinical aspects of Entamoeba and diarrheal diseases
|
Lincoln Stein
Ontario Cancer Center
Bioinformatics tools & analysis
|
Lu Wang
National Institutes of Health (NHGRI)
Large-scale sequencing
|
Jonathan Wastling
University of Liverpool
Proteomics of apicomplexan parasites
|
Louis Weiss
Albert Einstein College of Medicine
Microsporidia, Toxoplasma, and clinical microbiology
|
Dyann Wirth
Harvard School of Public Health
Plasmodium population biology, drug discovery
|
VectorBase is pleased to announce the members of the Scientific Working Group (SWG). The SWG provides advice about the management, performance, and the needs of the scientific community using VectorBase. Members of the SWG also review the VectorBase Driving Biological Projects (DBP) proposals.
Group details can be viewed here.
VectorBase's BioMart services are now in sync with the gene sets on the rest of the web site. Please see the forum item for further details. Please contact the VectorBase help desk with any queries.
We are pleased to
announce new features in TrichDB 1.1
New Features:
- The gray tool bar at the top of all TrichDB web pages has
been enhanced to include a link to the "My Basket" feature including an
indication of how may items are in the basket.
- The genome browser has been updated to version 1.7, which
offers enhanced features and stability.
- Weighted searches are now available. This feature
allows you to assign an arbitrary weight to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
***Sign up to be a EuPathDB feature evaluator:***
We welcome you to *sign up* to be an early evaluator
of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process.
Signing up is easy. Evaluating will require minimal effort on your part,
typically in the form of answering specific questions, using the feature(s) and rendering your opinion.
We are pleased to
announce new features in ToxoDB 6.0
Community Annotation:
As many of you know, a community driven annotation effort
has been underway. The response has been great (350 genes
have been annotated by the community so far) and we are happy to now
include these annotations in ToxoDB. The ultimate goal is to
translate this effort into changes in product names and gene models.
As a first step, since this process is still ongoing,
community annotations can be viewed by searching using the
text search tool in ToxoDB and limiting your search to
the Community annotation. To view all genes that have a
community annotation associated with them simply enter the wild card
"*" in the text search field. The annotation itself is
prominently displayed on the gene pages right under the product name.
As noted, this is an ongoing process. We have several more
annotations to incorporate and many more promised.
New Features:
- The gray tool bar at the top of all ToxoDB web pages has
been enhanced to include a link to the "My Basket" feature including an
indication of how may items are in the basket.
- The genome browser has been updated to version 1.7, which
offers enhanced features and stability.
- Weighted searches are now available. This feature
allows you to assign an arbitrary weight to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
***Sign up to be a EuPathDB feature evaluator:***
We welcome you to *sign up* to be an early evaluator
of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process.
Signing up is easy. Evaluating will require minimal effort on your part,
typically in the form of answering specific questions, testing and rendering your opinion.
We are pleased to
announce new features in PlasmoDB 6.3
New Features:
- The gray tool bar at the top of all PlasmoDB web pages has
been enhanced to include a link to the "My Basket" feature including an
indication of how may items are in the basket.
- The genome browser has been updated to version 1.7, which
offers enhanced features and stability.
- Weighted searches are now available. This feature
allows you to assign an arbitrary weight to the searches (steps) in
your search strategy making it possible for you to view a ranked list
of your results.
***Sign up to be a EuPathDB feature evaluator:***
We welcome you to *sign up* to be an early evaluator
of new features on the EuPathDB sites. This will be an opportunity for you to help us refine and improve features early in the development process.
Signing up is easy. Evaluating will require minimal effort on your part,
typically in the form of answering specific questions, using the feature(s) and rendering your opinion.
Dr. Mao attends a meeting in UK to work with the OBO ontology community in the context of PATRIC data and resources.
show more...
PlasmoDB 4.4 has been discontinued. If there are tools or other functionality that you used on PlasmoDB 4.4 that you need, please contact us to let us know and we will help you regain access to the missing functionality.
Culex pipiens quinquefasciatus as been promoted from sub-species to species, and is now called Culex quinquefasciatus .
In order to reflect this change in the taxonomy, VectorBase has updated all the Culex pipiens (quinquefasciatus) instances to its new taxonomic name, Culex quinquefasciatus.
The name only has been changed, the underlying data remain the same.
The NIAID-funded Eukaryotic Pathogen Genome Database
( EuPathDB.org) invites proposals for Driving Biological
Projects (DBPs) focused on one or more of the eukaryotic
pathogens supported by this Bioinformatics Resource Center
(BRC). We anticipate making two awards of up to $600K each
by July 1, 2010 (a second round of solicitations is expected
in 2012). A two page Letter of Intent (LOI) is due Feb 28,
following which full applications will be solicited by
invitation only, with a deadline of March 31.
Click here to
download the RFP.
The NIAID-funded Bioinformatics Resource Center VectorBase invites proposals for Driving Biological Projects (*s) that focus on infectious diseases research related to human microbial pathogens and invertebrate vectors of human diseases. DBPs must include research that uses high-throughput experimental technologies to functionally characterize the genome, proteome or metabolome to help elucidate how genes, proteins and metabolites may be involved in pathogenesis, antimicrobial resistance or other biological processes of interest in the study of infectious diseases. HTP approaches to functional characterization of genes and proteins may include, for example, microarrays, mass spectrometry, two-hybrid screenings, RNAi experiments, Chip-Seq and RNA-Seq studies and bioassays. Download the full RFP Frequently Asked Questions
We are pleased to announce that the E-RNAi RNAi reagent design tool has been updated with the latest Anopheles gambiae and Aedes aegypti genomes.
Please note, this service is provided by a third party with minor assistance from VectorBase.
We are pleased to announce the implementation a new tool in
CryptoDBDB. The "My
Basket" tool allows you to select various genes from
your searches and add them to your basket (much like a shopping cart).
To add genes to your basket simply click on the basket icon
in a
result list or on a gene page. A green basket icon indicates
that a
genes is already in your basket. You can easily convert your
basket to
a strategy to save it and perform additional searches or combine it
with one of your already existing strategies.
Please note that using the basket feature requires you to
login first.
We are pleased to announce the implementation a new tool in
TrichDB. The "My
Basket" tool allows you to select various genes from
your searches and add them to your basket (much like a shopping cart).
To add genes to your basket simply click on the basket icon
in a
result list or on a gene page. A green basket icon indicates
that a
genes is already in your basket. You can easily convert your
basket to
a strategy to save it and perform additional searches or combine it
with one of your already existing strategies.
Please note that using the basket feature requires you to
login first.
The VectorBase developers finally heard about Twitter and set up a @vectorbase account. We'll use it to let you know about any planned or unplanned system maintenance and any new features and datasets at VectorBase. We look forward to your @vectorbase tweets too!
We are pleased to announce a workshop taught as part of a collaboration between the Pathogen Genomics Group
(Wellcome Trust Sanger Institute, UK), the Eukaryotic Pathogen Bioinformatics Resource Center
(University of Georgia and University of Pennsylvania, US), and the Seattle Biomedical Research Institute (US).
This residential workshop aims to provide experimental biologists with hands-on experience in
genomic-scale data analysis, including genome browsers and comparison tools, methods for
data integration, and resources for sophisticated data mining.
Many examples and exercises will be drawn from the kinetoplastida, including Leishmania spp.,
Trypanosoma brucei, and Trypanosoma cruzi, although applicants working on other protozoan
parasites will also benefit.
!!!!!Application Deadline
is December 18th 2009!!!!!!
For registration and additional information (including scholarships) please click here (Note: you will be diverted to and external site).
As usual, if you have any questions please send us an Email.
EuPathDB published in
Nucleic Acid Research
We are pleased to announce the publication of a paper providing an
update on the EuPathDB genomic resourse. The paper entitled
"EuPathDB: a portal to eukaryotic pathogen databases" is
available here
as an advance access publication.
This
article may be used to cite the TriTrypDB resource in your
publications.
To download this reference to your citation manager please
click here.
Detoxification genes were assayed for expression differences between two strains of Aedes aegypti: Vauclin, from Martinique, which is resistant to temephos and deltamethrin insecticides, and Bora-bora, which is susceptible to insecticides, in both larvae and adults.
The open access article can be found here, and many of the genes mentioned in the paper now have official VectorBase gene symbols, so you can locate their expression pages easily with the following style of URL: http://funcgen.vectorbase.org/ExpressionData/gene/CYP6M6.
This work was a collaborative effort between the Laboratoire d'Ecologie Alpine (LECA) of Grenoble, the Institut de Recherche pour le Développement (IRD) Montpellier, and the LSTM, Liverpool.
The AnoXcel sheet has been updated with the VectorBase-style Gene Identifiers - it is available from the download page.
Please note that we only provide the raw file, there is no possibility to search for information within this file with our search tool.
We are aware the there are some issues with our BLAST tool at the moment.
Since the installation of a newer, faster, application server, the TomCat sotfware (for running Java applications) is not behaving as expected and will sometimes stops completely, preventing any BLAST jobs to be submitted. We are working on this and are trying to figure out what is happening.
Restarting the server is enough to solve the problem (temporarily) so please let us know if you can't run any BLAST jobs by contacting info@vectorbase.org
Dear all,
The PlasmoDB team would like to alert you to two important issues
concerning the presentation of P.
falciparum microarray data in PlasmoDB.
1. Beginning with PlasmoDB 6.0 all profiles using the
Winzeler Affymetrix scrMalaria array were updated to reflect the latest
3D7 genome annotation. Previously, probes were mapped to the P.
falciparum 3D7 genome annotation released in July 2001. The
array was originally designed with a tiling of probes, so the new
annotation has resulted in many genes gaining new probes which did not
previously map to the gene, and dropping old probes which no longer map
(Note: The new mapping allowed up to two mismatches per probe, and only
probes which mapped uniquely, up to two mismatches, were
retained). Previous gene expression profiles were generated
using the MOID algorithm with the 2001 annotation, while the current
profiles were generated using RMA with the new annotation.
Many of the expression profiles exhibit minor differences and a
few changed significantly. Such differences are
expected and should result in a more accurate representation of the
data.
2. An internal error with one the files concerning the P.
falciparum 3D7 gametocyte expression data was discovered.
This error appeared with the release of PlasmoDB 6.0 in July of 2009
and was fixed on October 20th, 2009. Also, please note that
probe remapping and MOID vs. RMA analysis differences described above
also apply to this data set.
We regret any disruption this may cause you. Please do not
hesitate to contact us with any questions or concerns.
Sincerely,
The PlasmoDB Team
We are pleased to announce the publication of a paper
introducing TriTrypDB. The paper entitled
"TriTrypDB: a functional genomic resource for the Trypanosomatidae" is
available here
as an advance access publication.
This
article may be used to cite the TriTrypDB resource in your publications.
To download this reference to your citation manager please
click here.
Community Annotation in
Progress
The Toxoplasma research
community has been involved in improving the genome annotation.
With guidance from the current ToxoDB community advisers
(John Boothroyd, Vern Carruthers, Marc-Jan Gubbles and Michael White),
members of the research community (see below) have been asked to take
the lead in providing high quality annotation on subsets of cellular
and biochemical pathways. The goal is to incorporate the results of
this effort in ToxoDB by the end of the year.
If you have not been contacted yet and feel that you can contribute to
one of the pathways mentioned below, please contact us.
- Micronemes
(Vern Carruthers)
- Rhoptries
- Necks and Bulbs (Peter Bradley)
- Dense
Granules (Marie-France Cesbron-Delauw)
- Chromatin-associated
(Kami Kim)
- Proteases
(Dominique Soldati-Favre)
- Apicoplast
(Stuart Ralph)
- Mitochondrion
(David Roos)
- IMC
(Con Beckers)
- Surface
Proteins (Michael Grigg)
- Bradyzoite
(Louis Weiss)
- Golgi/ER
(Markus Meissner)
- Acidocalcisome
(Silvia Moreno)
- Fatty
Acid Metabolism (Frank Seeber)
- Nucleotide
Metabolism (David Bzik)
- Sugar
Metabolism (Stan Tomavo)
- Glycolipid
Metabolism (Boris Striepen)
- Transcription
(Michael White)
- Translation
(Bill Sullivan)
- Cell
Cycle (Marc-Jan Gubbels)
- Parasitophorous
Vacuole and Membrane (Tony Sinai)
The National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), has awarded a 5-year, $27,670,448 contract to the CyberInfrastructure Group (CIG) of the Virginia Bioinformatics Institute (VBI)
show more...
BLACKSBURG, Va., October 12, 2009 – The National Institute of Allergy and Infectious Diseases (NIAID), part of the National Institutes of Health (NIH), has awarded a 5-year, $27,670,448 contract to the CyberInfrastructure Group (CIG) of the Virginia Bioinformatics Institute (VBI) at Virginia Tech to support the biomedical research community’s work on infectious diseases. The funding will be used to integrate vital information on pathogens, provide key resources and tools to scientists, and help researchers to analyze genomic, proteomic and other data arising from infectious disease research. more…
The 7th issue of the VectorBase newsletter is now available. It is accessible on-line and in PDF from the Newsletter Help Page. You can also subscribe to the Newsletter mailing list to receive it directly in your mail box.
We announce the release of the Anopheles gambiae and Aedes aegypti gene set updates, as well as two additional expression data set. We also have redesigned the Blast Interface - come and check it out!
We would like to announce to the current and future VectorBase user
community that NIAID/NIH has formally renewed the VectorBase contract
for another 5 year cycle starting on 1 October 2009. We would
especially like to thank the many members of the genomics and vector
biology communities who provided letters and email messages endorsing
the VectorBase renewal proposal.
Posted by Bob on behalf of the entire VectorBase Team.
We are pleased to
announce the implementation of new features in ToxoDB:
- Now
after you perform a BlastN against genomic sequence you can link from
the blast output to the correct coordinates in the genome
browser.
This allows you to explore annotation and other functional data in
relation to your BLAST hits.
- Upadated product names for the Surface Antigen Glycoprotein
(SAG) Related Sequences (SRSs) (provided by Michael Grigg).
- The type ahead feature has been implemented for GO term, EC
number and Interpro domain searches.
- You can now edit or delete your user comments.
- You can now modify your user profile, change your password
or your email address.
Please also note that the parent portal site EuPathDB has just been
updated, offering you the ability to explore results across organisms.
Sincerely,
The EuPathDB team.
VectorBase is happy to present a new BLAST interface. We have changed the way you select datasets for searching to simplify selection. Please, have a look and send us any comments or suggestions using the Help system.
Data updates for ESTs and trace files for all species have been included.
A new Anopheles gambiae gene set, AgamP3.5, is available at
VectorBase
The update includes the manual appraisal of existing gene models, including the majority of models from chromosome arms 3L and X. More information about the new gene build can be found here.
The new gene set also includes an update of the expression and comparative data.
A new Aedes aegypti gene set, AaegL1.2, is now accessible at
VectorBase
The update is includes gene models which were deemed supplemental to the initial set and models submitted by the community (to learn how to submit your gene models to VectorBase, click
here).
The new gene set also includes an update of the expression and comparative data.
Anopheles gambiae expression data from two recent publications has been added to VectorBase. The work by Marco Neira Oviedo and colleagues has compared larval salivary gland gene expression to the whole organism. Yury Goltsev and colleagues have measured gene expression during embryonic development and also profiled the cells which form the embryonic serosa (egg shell). See also the forum topic for links and discussion.
We received many user comments during the KCMB meeting and following. Many of these have already been examined by TriTrypDB curators (at GeneDB, SBRI and UGA) and incorporated into the official annotation. 253 genes have been modified since the meeting resulting in a total of 710 genes that have been updated since this current build of TriTrypDB. Click here to see the complete list of genes that have been re-annotated since December 1, 2008.
Annotation of the Kinetoplastida is ramping up again as part of the collaborative efforts of EuPathDB and GeneDB. Annotations are being carried out at the Welcome Trust Sanger Center, the Seattle Biomedical Research Institute and the University of Georgia. Click here to see the list of genes that have been re-annotated since December 1, 2008 when this current build of TriTrypDB was started.
Just added to VectorBase's download section: tab delimited text files containing gene-averaged expression data for all Anopheles gambiae genes. This data is identical to that shown on the gene expression report pages.
Click on the Get Data menu item to visit the download page. The more human-friendly expression reports can be found via the Expression Data menu link.
Coming soon: more powerful expression data retrieval with BioMart... stay tuned!
We have recently completed Stage 1 of this population genomics database
Stage 2 of this system is being designed to store data for multiple taxa, and we are actively looking for users willing to submit vector population genetics data to VectorBase. To help facilitate this process, we are also developing a Workbench feature that will allow users to manage their own data. A small group of users have already been enlisted, but additional beta users are welcome. If you are interested in submitting data and/or in more information on participating as a beta user for the Workbench please contact us at: popgen at vectorbase dot org
Three new experiments are now available in the VectorBase Gene Expression resource:
These are all for Anopheles gambiae but there are experiments for Aedes aegypti in the pipeline.
The 3rd kinetoplastid molecular cell biology meeting will be held from April 26-29th, 2009 in Woods Hole, Massachusetts, USA. For more information click here.
In October 2008 VectorBase initiated a plan for the development of a web-based data repository for information about the population genomics of invertebrate vectors of human disease. This endeavor is being pursued via a subcontract with Dr. Gregory C. Lanzaro, University of California-Davis and Dr. Charles E. Taylor, University of California-Los Angeles. Development of this new functionality of VectorBase will begin with a population genomics database for Anopheles gambiae, now available at VectorBase.
We have recently completed Stage 1 of this A. gambiae database. This first stage includes data for 5,870 individual A. gambiae s.l. collected from 54 sites in Mali and for 3,245 individuals from 51 sites in Cameroon. The data includes species identification ( A. gambiae and A. arabiensis), molecular form (M vs. S), chromosomal form (Forest, Bamako, Mopti and Savanna) and karyotype. A Google Maps API and GoogleEarth provide visualization of these data on maps. Full details on this new component of VectorBase are available here.
We are also developing a Workbench feature that will allow users to manage and manipulate their own data for ongoing studies. We have enlisted a small group of beta users for this feature and are currently inviting additional users to participate. If you are interested in participating as a beta user for the Workbench please contact us at: popgen at vectorbase dot org.
Trace files for G. morsitans and R. prolixus are available for BLAST and Download.
The IscaW1.1 gene set for Ixodes scapularis has now been released by VectorBase and GenBank It can be downloaded and browsed from the Ixodes genomic data page. At GenBank, the genes are annotated as features on scaffold entries
DS611849-DS981340.
We are pleased to report
an award from the Bill & Melinda Gates Foundation in support of a
TrypDB component of the EuPathDB family of databases,
encompassing Leishmania species, Trypanosoma brucei, and
Trypanosoma cruzi. A coordinated award from the Wellcome Trust
will permit enhanced annotation of kinetoplastid parasites.
These projects represent a collaborative effort between the
GeneDB team (at the Wellcome Trust Sanger Institute), colleagues
at the Seattle Biomedical Research Institute, and the EuPathDB
team -- an NIAID-supported Bioinformatics Resource Center. Look
for the first release of TrypDB in Jan 2009!
The fifth issue of the VectorBase newsletter is now available (a PDF version is also available). New data include Insecticide Resistance data and traces for Glossina morsitans and Rhodnius prolixus (also available for blasting). Check out our new forum and try our top tip!
Sign up to receive the Newsletter when it comes out, or just browse it from the website. For more details, see the Newsletter Help Page.
We have now integrated a tabular text summary of microarray-based gene expression data on every gene report page (also known as GeneView). Scroll down to the "Gene DAS Report" panel, check the box next to "Gene Expression Summary" and click on "Update". After the page has refreshed you will see a list of experiments where significant differential expression has been observed. Click on the experiment names to see more details of the expression profiles and statistics.
Note that in Aedes aegypti there are many genes without microarray reporters, and therefore data, at the moment, so your mileage may vary. (More data is coming soon, however.) We have no microarray data at all for Culex pipiens yet.
Free microarrays: Please note that the Pathogen Functional Genomics Resource Center ( PFGRC) at the J. Craig Venter Institute ( JCVI) provides several resources for investigators including microarrays for Giardia lamblia (WB strain, clone C6).
Click here to view a list of available microarrays at PFGRC.
We are pleased to report
an award from the Bill & Melinda Gates Foundation in support of a
TriTrypDB component of the EuPathDB family of databases,
encompassing Leishmania species, Trypanosoma brucei, and
Trypanosoma cruzi. A coordinated award from the Wellcome Trust
will permit enhanced annotation of kinetoplastid parasites.
These projects represent a collaborative effort between the
GeneDB team (at the Wellcome Trust Sanger Institute), colleagues
at the Seattle Biomedical Research Institute, and the EuPathDB
team -- an NIAID-supported Bioinformatics Resource Center. Look
for the first release of TriTrypDB in Jan 2009!
Cassone et al. have performed expression profiling of several field strains of M and S form Anopheles gambiae mosquitoes. Considered across all three developmental periods (presented as separate experiments in VectorBase), differentially expressed genes represented around 1â2% of all expressed genes. One prominent example is this cuticular protein with much higher expression in the M-form compared to S.
We've added a forum to the site to foster communication between site users and VectorBase associates. Feel free to comment and enquire on all things VectorBase!
A link to the forum is located in the gray section of the navigation menu at the top of every page or follow this link.
The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), of the Department of Health and Human Services (DHHS) supports research related to the basic understanding of microbiology and immunology leading to the development of vaccines, therapeutics, and medical diagnostics for the prevention, treatment, and diagnosis of infectious and immune-mediated diseases.
show more...
The National Institute of Allergy and Infectious Diseases (NIAID), National Institutes of Health (NIH), of the Department of Health and Human Services (DHHS) supports research related to the basic understanding of microbiology and immunology leading to the development of vaccines, therapeutics, and medical diagnostics for the prevention, treatment, and diagnosis of infectious and immune-mediated diseases. The NIAID, Division of Microbiology and Infectious Diseases, has a requirement to maintain Bioinformatics Resource Centers, which have become the public repository of genomics and related data generated by NIAID-supported genomics programs and grants and contracts as well as publicly available data generated by the scientific community. more…
The Anopheles Genomes White Paper has been endorsed by the NHGRI and NIAID Human Eukaryotic Pathogens and Invertebrate Disease Vectors Sequencing Target Selection Working Group, and will be assessed later this month by the Coordinating Committee for Sequencing Target Selection. It is available on VectorBase.
The fourth issue of the VectorBase newsletter is now available. Check the new Culex data that are available, play with the improved version of the Gene Expression Resource and try our 'VectorBase Top Tip'! If you wish to receive the newsletter you can sign up or just browse it on the website
A listing of microarrays that are available to Plasmodium researchers has been posted at
http://plasmodb.org/common/microarrays/.
This list is meant to provide in a single location a summary of the publicly
available microarrays targeting Plasmodium genomes (including P. falciparum,
P. berghei, P. yoelii, and P. vivax). Also provided are
details, including, for some of the arrays, an updated mapping to genes in PlasmoDB.
An update to the Culex pipiens gene set, CpipJ1.2, is now available at VectorBase.
Changes from the CpipJ1.1 set involved rectifying some issues with start and stop codons for 87 predictions and the removal of approx. 1,500 predictions which are not thought to represent real gene predictions. These are mostly small 1-2 exon models with no orthologs to any other species. This process involved extensive manual appraisal of prediction in tandem with bioinformatic approaches looking at potential bacterial and transposon-related sequences.
VectorBase encourages community scientists to contribute to the Ixodes annotation effort through submission of their own gene models. Community-generated gene models will be displayed as a separate manual annotation track in the Ixodes Genome Browser. Community annotations will also be considered for incorporation into the final Ixodes gene set, which is targeted for release at Vectorbase and GenBank in a few months.
Community scientists wishing to submit manual annotations may do so through the VectorBase Community Annotation Pipeline. An account is required to submit gene models. To sign-up for an account, click the âUserâ tab in the top-right corner of the VectorBase homepage. To submit a gene model, follow these instructions. For further information regarding submission of community annotations, please contact the Ixodes VectorBase Community Representative, Jason Meyer, at meyerjm@purdue.edu
The proposal for sequencing of additional Plasmodium isolates and species
has been approved.
The whitepaper is available here.
A Genome Browser is now available to view the preliminary annotation of the Ixodes scapularis genome.
The NCBI has decided to follow the near-unanimous consensus in the taxonomic literature and promoted the subspecies Culex pipiens quinquefasciatus to species level, as Culex quinquefasciatus. The NCBI taxid will remain the same (7176).
VectorBase will reflect this taxonomy change some time in the future, but for the moment the web pages and file names at VectorBase will continue to use "Culex pipiens". Gene, transcript and protein identifiers (CPIJxxxxxx), and the assembly identifier (CpipJ), will not be modified.
The third issue of the VectorBase newsletter is now available. Check the new data that are available on the VectorBase site and try our 'VectorBase Top Tip'!
If you wish to receive the newsletter you can sign up or just browse it on the website.
Preliminary annotation of the Ixodes scapularis genome has been released as part of a joint venture between VectorBase, The Broad Institute and JCVI. Data is available for download, or for use with our BLAST server.
Please see the Ixodes scapularis genome page for more details.
The Ixodes scapularis Wikel strain genome sequence assembly is available from VectorBase. The assembly consists of 570,640 contigs arranged into 369,495 scaffolds yielding a combined assembly length of 1.4 Gb. Both contigs and scaffolds can be downloaded and the scaffolds are available for Blast searching.
You can receive updates and get involved through the Ixodes mailing list.
The Culex pipiens gene set is now available in GenBank under the project ID AAWU00000000. The genes and the proteins are available as 'features' via the scaffolds entries - the proteins are also available through the protein section of NCBI.
Want to see an entry? Try the accession DS231813 on the NCBI website.
VectorBase now hosts an electronic mailing list for the Ixodes scapularis genome sequencing project. You can find out more about the mailing lists hosted at VectorBase and subscribe to those you are interested in.
VectorBase BioMart has been relaunched for Aedes aegypti and Anopheles gambiae. The data is now up to date and provided in the standard "Ensembl BioMart" format (which you could previously find at biomart.org and ensembl.org). Rest assured that the data has been generated by the VectorBase team - we just use Ensembl technology.
You may now search and retrieve many types of data in VectorBase BioMart, including GO terms, InterPro domains and microarray reporters. Bulk sequence retrieval is also available.
BioMart is available through the Get Data menu.
Transcript datasets for all organisms have been updated as of December 2007 including 194 K Ixodes scapularis and 76 K Culex pipiens sequences. The datasets are available for download and BLAST searching.
By using the Aedes detox chip microarray, researchers from the French
Laboratory of Alpine Ecology of Grenoble (R. Poupardin, S. Reynaud and
J.P. David), the Liverpool School of Tropical Medicine (C. Strode and
H. Ranson) and the Greek Laboratory of Pesticide Science (J. Vontas) compared
the expression of candidate detoxification genes in Aedes aegypti
larvae exposed to two insecticides and three pollutants commonly found in
polluted environments. This study allowed the isolation of genes that are
induced by distinct subsets of insecticides and pollutants (e.g. AAEL001807/CYP9M9
is induced by atrazin, permethrin and fluoranthen while
AAEL009127/CYP6M11
responds to copper sulfate). Results
of this research work will contribute to decode the molecular mechanisms of
xenobiotic response in mosquitoes and the possible impact of pollutants on
insecticide tolerance. This study will be published shortly in the journal
Insect Biochemistry and Molecular Biology as (Poupardin et al., 2008). The
expression data is available in VectorBase from today.
The second issue of the VectorBase newsletter is available. Come and check
the new data and tools that are available through the VectorBase site!
If you wish to receive the newsletter you can sign up or just browse it on the website.
BLAST in ApiDB allows you to blast against: not only any Crypto, Plasmo and Toxo data set available in their websites, but also against the Theileria Genomes, all available apicomplexan ESTs (from NCBI dbEST) and the available EST assemblies that include Cryptosporidium parvum and Toxoplasma gondii.
HMMER Build and Search are now available on VectorBase. This version of HMMER Search takes advantage of MPI code, allowing it to run its search on multiple machines at the same time. With our equipment, speed results are on the order of twenty times faster than the traditional version. All peptides and supercontigs databases are accessible for searching.
Only one VectorBase HMMER Search job will run at a time, additional jobs will be placed in a queue. Depending on it's popularity and the number of databases per search job, the actual amount of time it takes to complete a search may vary. In my tests, running a search against all databases finished after an hour and forty-five minutes.
The sequencing of the Aedes aegypti genome has equipped researchers at the Liverpool School of Tropical Medicine (Dr. Hilary Ranson and Dr. Clare Strode) to design and produce a âdetox chipâ. This chip houses a complete set of cytochrome P450s, glutathione transferases (GSTs) and carboxyl esterases for the yellow fever mosquito.
As insecticides play a critical role in controlling vector-borne diseases such as malaria and dengue, our interest lies in identifying the genes likely to confer metabolic resistance to insecticides. Using the detox chip we have identified a cohort of genes which were over expressed in two insecticide resistant populations originating from Thailand and Mexico. Of particular interest is the over expression of a large number of P450s which belong to the CYP9 (e.g. probe aaeg-detox-2D7) class and members of the epilson GST class (e.g. probe aaeg-detox-1J3) in resistant Aedes. The detox chip will be used to monitor detoxification gene expression in Aedes populations worldwide with the aim of understanding the dynamics of insecticide resistance.
Multiple sequence alignments generated by the ClustalW tool are now piped into the Jalview applet and may be viewed on the results page along with the raw results.
Please check the ApiDB tutorials offered now in three formats: QuickTime (.mov), Microsift Windows (.avi) and Flash Player (.flv). In ApiDB.org the link to these Website Tutorials appears in the home page, at the bottom of the sidebar, under "Information and Help".
Please check the PlasmoDB tutorials offered now in three formats: QuickTime (.mov), Microsoft Windows (.wmv) and Flash Player (.flv). There is a link to these tutorials in PlasmoDB sidebar, in all pages.
The inaugural VectorBase newsletter is available. The newsletter will be published every quarter bringing you an update on new data and tools available through the VectorBase site.
If you wish to receive the newsletter you can sign up or just browse it on the website.
Wherever in ToxoDB you can type an Annotated Gene's ID you can now also type the ID of an electronically predicted gene (eg, TgGlmHMM_1045). You will be taken to the associated annotated gene (eg, 27.m00868).
~19 million genomic trace reads and ~370,000 BAC end sequences from the I. scapularis genome project are now available at VectorBase for use with BLAST and for download.
VectorBase has moved to the new AgamP3.4 gene build. It incorporates improved models (including manually annotated genes) and changes gene identifiers from the EnsEMBL naming convention (ENSANG*) to the VectorBase naming convention (AGAP*).
For more information about the new gene build, see this document.
For more information on the new naming scheme, and translating between old and new gene identifiers, see this document.
We are serving the tiling array expression data from Halasz et al. (2006) via DAS in the genome browser. This provides a genome wide view of expression levels in adult male and female mosquitoes. To enable the DAS track, go to your favourite genomic region ( example), and click on the "DAS Sources" menu in the "Detailed view" (third image from top), then check the box next to "Tiling Expression". For more information, see this help document.
Please check the new Gene Metrics table with all the organisms with gene information available in ApiDB.
A link to it has been added in the home pages of all ApiDB sites.
In ApiDB.org the Gene Metrics link appears in the home page, at the bottom of the sidebar, under "Information and Help".
The Aedes aegypti Genome paper has been published in the June 22, 2007 issue of Science magazine. For more information, see Science Magazine's Website.
A preliminary assembly of C. muris was obtained at approximately
4.5 X coverage, and
reveals a ~9Mb genome with average cytosine-guanine content (%GC) of ~28.5
%. This assembly is based on ~50,000 reads obtained mostly from a
small-insert library and a fosmid library. This assembly is estimated to
contain ~ 92% of the ~9Mb genome, and consists of 927 contigs arranged into
595 scaffolds.
Final assembly, to be released in late summer, will include an additional 2X
coverage obtained from a medium insert library and possibly >5X additional
coverage obtained with 454 pyrosequencing technology. The final assembly is
expected to contain > 99% of the genome.
Sequencing from the cDNA library provided by Giovanni Widmer is underway.
Average insert length is 1,600 bp. If the library is shown to be random, an
expected >30,000 EST reads will be obtained.
More information is
available on the project's website
(http://msc.tigr.org/c_muris/index.shtml).
Trace reads for the A. gambiae Pimperena (S molecular form) and Mali-NIH (M molecular form) WGS projects are now available for download and for use with BLAST.
The P. humanus PhumU1.1 gene build is now available at VectorBase. For more details, see the P. humanus Genome Homepage for more details.
The C. pipiens quinquefasciatus genome, CpipJ1 assembly is now available at VectorBase. See the C. pipiens Genome Homepage for more details.
The preliminary C. pipiens quinquefasciatus gene build, CpipJ1.0_5 assembly is now available at VectorBase. See the C. pipiens Genome Homepage for more details.
ReAnoXcel is a database of the conceptually translated proteome of A. gambiae developed by Dr. Jose Ribeiro (NIAID) using gene models determined by Ken Vernick's (University of Minnesota) annotation pipeline.
The BioMart query builder and data retrival system has been updated to version 0.5.
The VectorBase Genome Browser has been updated to be concurrent with Ensembl version 43 Genome Browser.
BLAST sequence fetching from the Interactive BLAST Results pages has been rewritten to provide faster result retrieval.
A 0.5 Gene Build for P. humanus is now available for download and for use with BLAST. The VectorBase and TIGR 0.5 gene builds will be merged to form the P. humanus 1.0 Gene Build.
February, 2007 The PFGRC announces that it will conduct a four day course titled, "Introduction to Microarray Technology", April 16-19, 2007.
The NIAID sponsored Pathogen Functional Genomics Resource Center at TIGR announces that it will conduct a four day course titled, "Introduction to Microarray Technology." The workshop will be held on the TIGR campus on April 16-19, 2007. The course will provide an overview of the two-dye spotted microarray platform. The course is designed for hands on participation and attendees will actively learn to make the most of their microarrays through a combination of wet lab and bioinformatics sessions.
The wet lab portion will be taught by PFGRC staff who are directly involved in microarray production. Participants will be guided through probe preparation and hybridization protocols. The training will also consist of detailed discussions on dye coupling, slide pre-hybridization, hybridization, and scanning techniques. Other topics for discussion will include the importance of assessing source material and probe quality, comparisons of various slide surfaces, and printing techniques.
The bioinformatics portion of the course covers an introduction to the stages of microarray data analysis. Major topics include data management, image analysis, normalization, and data mining. Fundamental data analysis concepts will be taught using a mix of lectures and interactive hands-on sessions. Participants will analyze the microarrays they prepared, using the open-source TM4 software suite while working closely with the software developers. CDs containing microarray datasets and the TM4 suite will be provided.
The course will be provided free of charge. Attendees will be responsible for their travel and lodging. This introductory course is one of several offered by the PFGRC on variety of functional genomics topics. Attendance will be limited to 14 participants.
Interested participants will find the preliminary course agenda here, and registration information here.
This updated home page for ApiDB.org provides the same functionality (queries and tools) but it adds a new Genomes table with key information. Also a new link to Data Documentation has been added on the sidebar under "Help and Information". Please send any comments and questions to ApiDB.
TIGR has released the preliminary P. humanus genome assembly and accompanying gene build. Data is available for download and for use with BLAST. For more details, see the P. humanus home page.
The A. gambiae Gene Tool has been updated to be current with Version 42 of the VectorBase / Ensembl Genome Browser. Access it from the A. gambiae home page.
EST datasets for all organisms have been updated to include all ESTs available in GenBank as of January, 2007.
An Invitation and Background Information regarding the Rhodnius Genome Sequencing Project
An international effort is underway that promises to expand and enrich research
on all aspects of reduviid biology, from their role as vectors of Chagas disease
to fundamental basic and evolutionary biology. We are sending out this email to
welcome all scientists who have an interest in and/or are working on Rhodnius
prolixus or other reduviids to participate in and contribute to this effort. As
elaborated on below the total genome sequencing of Rhodnius prolixus is
underway and the initial stages of planning for long term residence, curation
and annotation are under consideration. VectorBase at the University of Notre
Dame, where the mosquito genomes are being housed has kindly set up our list
server to assist us in facilitating broad communication and interaction within
the community. We are grateful to its director Frank Collins for his support and
willingness to assist us.
Our invitation is for all researchers in the reduviid or related fields to be
involved. This is in keeping with our intent to be inclusive, so we welcome any
individuals who may have unintentionally been left off the current list to be
added. You are receiving this email because your name was included in the
initial list which was compiled from various inputs. Please inform us of any
others you feel should be added. If you wish not to be included please let us
know.
With this preamble, the following provides some general background and
information on the Rhodnius Genome Project - how it arose, some of the
activities up to now and some plans and hopes for the future.
In recent years it has been widely recognized that there was a need to have
increased genome sequencing information on reduviids (for example the Triatomine
Genome Initiative in 2003 of scientists from the MERCOSUR countries held in
Montevideo). A concerted effort was undertaken with the submission of a "White
Paper " proposal " The case for sequencing the genome of the blood-feeding
Hemipteran insect Rhodnius prolixus " to the NIH Human Genome Research Institute
in the fall of 2004. This proposal resulted from input from a number of
researchers world-wide and was coordinated by Erwin Huebner (Canada). A copy of
the White Paper can be found on the NIH Human Genome web site.
This application was successful (selected spring of 2005) and the Genome
Sequencing Center at Washington University Medical School , St. Louis was
selected to carry out the large scale genome sequencing.
An organizational meeting was held Nov. 3 -5, 2005 in Rio de Janiero with
participants from Canada, USA, Brazil, Argentina and Uruguay. At this meeting a
Steering Committee to oversee and coordinate the Genome Project and work with
John Spieth ( Washington U Genome Sequencing Center) to accomplish the goals of
the White Paper. The Steering Committee consists of Erwin Huebner (Coordinator),
Antonio Bernardo de Carvalho , Ellen Dotson, Carl Lowenberger, Fernando
Monteiro, Pedro Oliveira, Francisco Panzera and Rolando Rivera Pomar. Other
sub-committees were also established which will become more active once the
genome project progresses further.
An important issue that arose in the initial consideration of which colony
should be the source of DNA for the sequencing was that a survey of some of the
longstanding lab colonies showed that they were in fact a hybrid of prolixus
and a closely related species Rhodnius robustus. It became important to
genotype as many as Rhodnius colonies as possible in order to select a strain
that was not of hybrid origin or too heterogeneous. Ellen Dotson and Fernando
Monteiro played a key role in this and what has emerged is that the colony at
the CDC in Atlanta is indeed prolixus and the best option for the sequencing.
The Steering committee in discussion with John Spieth has selected that colony
as the source for the full scale sequencing.
The plan is also to sequence as many ESTs as possible. Rolando Rivera Pomar has
been conducting a survey to catalogue what EST libraries and the vectors used
exist within the reduviid research community. If you have not already responded
to him or been contact yet please email him and John Spieth any information you
may have. Francisco Panzera has provide valuable information on genome size also
important at this stage.
The Steering Committee has also discussed the possibilities for the permanent
residence, curation and updating of the data base once the genome is sequenced
and assembled. Frank Collins at VectorBase has indicated they are willing to
assist the Rhodnius community, provide advice and help and will provide a home
for the Rhodnius Genome data base. It has been agreed by the Steering Committee
in discussion with Frank that the genome will reside at VectorBase. In addition
to and in concert with the assembly and automated annotation that will be done
at the Washington U Genome Center VectorBase will run automated annotations as
was done with the mosquito genomes. VectorBase will provide long term curation
and maintenance once the Genome Center has released it and submitted sequence
data to Gen Bank. The research community will have access to the data base at
VectorBase and will play an important roll in the essential mechanical
annotation to fully identify genes correctly. It is important that as many
researchers as possible be prepared to contribute to the annotation process. We
invite everyone who has an interest and wishes to please plan to participate in
the annotation of genes in your area of interest, gene families etc. Once the
genome assembly has been completed, auto annotation done and the subsequent
community wide annotation is needed it will be important to identify one or two
persons to serve as annotation coordinators to work with the community and
VectorBase.
The establishment of the List Server at VectorBase provides us an excellent
mechanism to foster communication within our community, disseminate information
and stimulate new interactions and research in reduviid genomics and proteomics.
There are excellent opportunities ahead and we hope you will join us in these
exciting times as we bring this project to fruition. Your input, ideas,
suggestions and participation are welcomed.
We are planning a meeting in Salvador, Brazil, May 19 - 20, 2007 immediately
before the SBBq/IUMB Meeting (May 21-25) to discuss Triatomine Biology and the
Genome Project Progress and Plans. Please contact Pedro Oliveira, who is
coordinating this conference for detail.
Submitted by Erwin Huebner (Coordinator of the Rhodnius Genome Project) on behalf of
the Steering Committee.
Dear Colleagues,
We are pleased to announce a batch of major data and functional updates at the
VectorBase website, including gene expression data and manual annotations for
the entire A. gambiae 2L chromosome arm. Please read on for more details.
Organism & Data Updates
|
•
|
New Culex pipiens Homepage
|
|
|
|
Culex pipiens quinquefasciatus has a new homepage at http://cpipiens.vectorbase.org. Genome data and assembly updates will be posted on this page as assembly nears completion.
|
|
|
|
•
|
Organism Datasets Updated
|
|
|
|
The first full gene set for Aedes aegypti (AaegL1.1) was released in June 2006. The latest gene set for Anopheles gambiae (AgamP3.3, February 2006) is also available. EST and Trace datasets for all organisms have been updated as of October 30th and are available for download and for use with BLAST. All current VectorBase gene sets are also indexed at NCBI.
|
|
|
|
•
|
A. gambiae Gene Tool Available
|
|
|
|
The AnoBase Gene Tool is now available for use at VectorBase and is accessible from the Anopheles gambiae homepage. The Gene Tool provides a full summary of gene meta-data on a single page.
|
|
|
|
•
|
A. gambiae BAC Hybridization Images Available
|
|
|
|
Images of BAC hybridizations in Anopheles gambiae are now accessible from the Anopheles gambiae homepage. Data is searchable by BAC name and Locus.
|
|
|
|
•
|
A. gambiae manual 2L Gene Models Available
|
|
|
|
Manual annotation of the 2L chromosome arm in A. gambiae is now
complete! Pending incorporation into the next A. gambiae gene build,
all new manually curated gene models have been made temporarily
available in the genome browser via DAS. To activate, select
'Manual Annotation' under DAS sources when in the Contig View of the
Genome Browser.
|
|
|
|
•
|
Gene Expression Data Submission Available
|
|
|
Several microarray experiments have been added to the VectorBase
gene expression repository. To submit your data to ArrayExpress via
VectorBase, please visit http://www.vectorbase.org/ExpressionData/. Two experiments
are currently published and publicly available through VectorBase:
* Anopheles gambiae: Vlachou et al. midgut invasion
* Aedes aegypti: Sanders et al. SINV infection
|
|
|
Functionality Updates
VectorBase Genome Browser and Genome Data updated based on Ensembl V.41.Updates include:
|
•
|
Inter-species comparison data Available
|
|
|
|
»
|
Ortholog mapping between Aedes aegypti, Anopheles gambiae, and Drosophila melanogaster genes, plus gene sets from two external organisms: Homo sapiens and C.elegans.
|
|
»
|
DNA/DNA comparison (tBLAT) between Aedes aegypti, Anopheles gambiae, and Drosophila melanogaster genomic sequence.
|
|
|
|
|
•
|
Gene Ontology Associations
|
|
|
|
»
|
GO Term mappings from UniProt to organism genes.
|
|
»
|
GO Term mappings from Drosophila orthologs.
|
|
|
|
|
•
|
Gene expression (microarray) data integration
|
|
|
|
|
|
|
|
•
|
BioMart Initial Testing Release Available
|
|
|
|
An initial version of the VectorBase BioMart database has been released to allow data mining of genomic data through the VectorBase website. BioMart is a query-building tool designed to make it possible to filter and retrieve data such as genes, exons, and proteins in a variety of formats.
|
|
|
|
•
|
BLAST Functionality Improved
|
|
|
|
Specialized displays of results are now available for specific target databases.
Results of BLASTs against genomic sequence databases can now be viewed directly
in the genome browser. Additionally, sequences hit identified by a BLAST query
can be downloaded or used as input for the VectorBase ClustalW service. BLAST is available
in the Tools section.
|
|
|
|
•
|
ClustalW Available
|
|
|
|
The multiple sequence alignment tool ClustalW is now available for use at VectorBase.
Users may upload their own sequences or pass in input directly from VectorBase BLAST results.
ClustalW is available in the Tools section.
|
|
|
As always, feedback is much appreciated! Please don't hesitate to
contact us with questions or comments at help@vectorbase.org - we'd love to hear from you.
Thanks,
The VectorBase Team
|
The Cryptosporidium muris genome sequencing project has been funded by the
National Institute of Allergy and Infections Diseases (NIAID), through the
Microbial Sequencing Center program at the Institute for Genomic Research
(TIGR). This project aims at obtaining a high-quality assembly of C. muris
genome based on 8X sequence coverage, as well as the comparative automated
and manual curation of gene structure predictions of all three
Cryptosporidium genomes sequenced to date. More information is
available on the project's website
( http://msc.tigr.org/c_muris/index.shtml).
All sequence reads and traces will be deposited with GenBank and the genome
assemblies and annotations will be deposited with GenBank and CryptoDB.
Public EST data for all organisms has been updated and is available for use with BLAST and for download.
An initial release of the BioMart data mining tool is now available at VectorBase. BioMart allows you to build complex queries based on gene, transcript and protein characteristics. Access BioMart under Get Data->Data Mining.
Images of BAC In situ hybridizations are now online and searchable under Images->BAC Hybridization Images. You can find images by searching by BACs name or chromosomal region using the VectorBase search engine.
The WU-BLAST programs supporting the CryptoDB BLAST tool were upgraded on 6 July 2006.
The current version is:
2.0MP-WashU [04-May-2006] [linux26-i786-ILP32F64 2006-05-09T12:19:58]
(The former version was
2.0MP-WashU [16-Mar-2005] [linux26-i686-ILP32F64 2005-03-16T08:14:50] )
An updated version of BLAST is now online at VectorBase! You can now sort your results by hit-specific properties, download result hits, and view alignments directly in the A. gambiae & A. aegypti genome browsers. Feedback / Suggestions are of course welcome.
The A. aegypti 1.0 Genebuild, created from a merge of the TIGR and VectorBase 0.5 annotation sets, is now available at VectorBase for download and for use with our BLAST server. These data are also available from NCBI and TIGR.
9+ million trace reads produced as part of the I. scapularis genome sequencing project are now available for download and for use with VectorBase's BLAST service.
Data pertaining to I. scapularis has been moved from the "Additional Organisms" section to it's own homepage, now available at http://iscapularis.vectorbase.org. Genomic trace reads will also be available shortly.
The offices and datacenter of CryptoDB.org are moving to new facilities and
will need to temporarily suspend services. We apologize for the inconvenience.
The P. berghei and P. chabaudi genomes are now incorporated in PlasmoDB 5.0 Beta. The P. berghei genome is accompanied by transcript and protein expression data. We have also continued to make progress migrating functionality from PlasmoDB 4.4 to PlasmoDB 5.0 Beta
The PlasmoMAP project provides computationally generated functional interactions between P. falciparum proteins. This data is now available as a PlasmoDB query.
The Aedes aegypti home page and genome page have been updated to provide a more consistent look across the website. New genomic statistics are available as well.
The Anopheles gambiae genome assembly and gene build has been updated and is now available. The genome assembly, dubed assembly AgamP3, has undergone several changes to remove duplicate / haplotype regions from the genome. Additionally, 140+ additional scaffolds have been maped to chromosomal locations. Details about the new assembly can be found in A.gambiae:Docs->Current Assembly. A new gene build to is also now available based on the new AgamP3 assembly that is viewable in the A. gambiae genome browser.
The Anopheles gambiae home page and genome page have been updated to provide a more consistent look across the website. New genomic statistics are available as well.
We are pleased to announce to the Plasmodium community Version 5.0 Beta of the Plasmodium Genome Database. PlasmoDB 5.0 Beta had an early release at the Vivax Malaria Research conference at TIGR on December 9, 2005, and is now available for community use. Please see PlasmoDB 5.0 Beta released
Basic BLAST services for use with VectorBase data have been put online. Try it out by clicking on Tools->BLAST or by clicking here. Coments about the interface or results formats are welcome.
GBrowse is an interactive generic genome browser developed as part of the GMOD project ( www.gmod.org). GBrowse is used in PlasmoDB to display gene models, EST alignments, and synteny relatonships between Plasmodium species. With GBrowse, we now provide:
- Custom restriction site identification
- Open reading frame identification
- Custom downloads in various formats:
- FASTA
- GFF (2)
- GFF3
- BSML (XML)
- EMBL
- GenBank
- GAME
- GCG
PlamoDB 5.0 Beta has been upgraded to use the latest Genomics Unified Schema (GUS) database system and the new Web Development Kit (WDK). GUS is an open source project ( www.gusdb.org) providing a functional genomics database schema covering sequence annotation, expression data, regulatory interactions, and proteomics using controlled vocabularies and ontologies. GUS also provides software to load data (pugins) and set up a web site (WDK). GUS has been adopted by several groups covering organism ranging from bacteria, protozoa, plants, and mammals.
We are pleased to announce Version 5.0 Beta of the Plasmodium Genome Database. We have incorporated into PlasmoDB the new release of the Plasmodium vivax genome and annotation generated by Jane Carlton's group and the sequencing center at The Institute for Genome Research. We have also used this release to change the way we create the web site and underlying database that should make it easier for us to generate new PlasmoDB releases and be consistent with other ApiDB projects such as CryptoDB and ToxoDB. As a result, we have focused on providing access to P. vivax through the PlasmoDB 5.0 Beta site but have not completely reproduced all the functions in PlasmoDB 4.4. Therefore we have kept the PlasmoDB 4.4 site fully active for continued use. Furthermore we have provided a mechanism to take the results of a PlasmoDB 4.4 query and use it in PlasmoDB 5.0 Beta.
PlasmoDB 5.0 Beta features:
- A new interactive genome browser, based on the GBrowse module of the GMOD project, providing rapid visualization of the parasite genome and gene models, custom restriction site identification, open reading frame identification, and downloads in various formats.
- The complete genome sequence for P. vivax Salvador, along with a complete set of annotated gene models, and syntenic relationships between P. falciparum, P. vivax, and P. yoelii (from Jane Carlton, TIGR).
- Updated P. falciparum annotation, including curated literature citations; updated GO term curation from Andy Berry, Matt Berriman, and the Sanger curation effort; updated EC/pathway mappings from Hagai Ginsburg.
- TIGR Gene Indices for P. vivax, P. falciparum, and P.yoelii.
- Corrected expression profiles from glass-slide arrays for P. falciparum 3D7, HB3 & Dd2 (from Manuel Llinas, Zybnek Bozdech & Joe DeRisi).
- Updated structural data, including information from the structural genomics consortia.
- Protein-protein interaction data for P. falciparum, based on a genome-wide yeast 2-hybrid analysis (LaCount et al. Nature 438:103; 2005).
- New ortholog analyses, using the OrthoMCL algorithm to group genes from several Plasmodium species with other complete eukaryotic and prokaryotic genomes (Chen et al. Nucl Acids Res, in press).
Finally, we are pleased to report a new CD-ROM version of the Plasmodium Genome Database. PlasmoCD provides users lacking reliable high-speed internet connections with full access to the complete genomes of P. falciparum, P. vivax, and P. yoelii, along with gene models, annotation, BLAST hits, ortholog results, expression data, and search functions for querying and integrating these results across species. PlasmoCD 4.1 can be downladed from the PlasmoDB web site, or requested by sending e-mail to <help@PlasmoDB.org>.
Yeast two-hybrid data for P. falciparum described in LaCount et al. Nature 438:103; 2005 has been provided by Doug LaCount (Purdue University) for inclusion into PlasmoDB. Interaction data can be seen on individual gene pages and used to search for interactors of a specificed P. falciparum Gene ID.
OrthoMCL is an algorithm developed to identify orthologs and recent paralogs based on whole genome comparison of protein sequences (Li et al Genome Res. 13:2178; 2003). Previously, OrthoMCL groups were based on comparison of only 16 species in PlasmoDB 4.4. In the current release, OrthoMCL groups were generated with 61 species. These species include the 55 used to generate orthologs groups at the OrthoMCL DB site plus P. vivax, Leishmania major, T. brucei, T. cruzi, Apis mellifera (honeybee) and S. purpatus (sea urchin). The OrthoMCL DB site is described in Feng et al. Nucleic Acids Res. 2006 Database Issue.
We've updated the Aedes aegypti homepage to give it more structure, hopefully a little cleaner, and make it consistent with the Anopheles gambiae homepage.
A document describing the post sequencing roles of the Microbial Sequencing Centers and VectorBase has been posted in the Aedes aegypti documents section.
The Microbial Sequencing Centers of the Broad Institute and The Institute for Genomic Research are pleased to announce the public release of a new assembly for the genome sequence of Aedes aegypti. As the result of increased sequence coverage the overall quality of the sequence, both in terms of coverage and continuity, has been vastly improved. The total size of the contigs in this assembly is 1.3 Gb, and the average coverage within the assembly is now 7.6 fold.
For more information, look at the Aedes aegypti genome page.
C. hominis gene Chro.rrn021 is annotated as having two exons but in CryptoDB release 3.0 this gene is not returned in the query 'Genes With Exon Count'. This error will be fixed in the next release.
The expression profiles obtained from Joe DeRisi's group (UCSF) are processed by the PlasmoDB team for generating the images on the expression pages and for use in the expression profile queries. The orginal HB3 data were processed without background subtraction while the more recent 3D7 and DD2 data were processed with background subtraction. These have now all been processed consistently in the following manner.
Raw data were loess normalized using the default loess implementation in the R package for microarray analysis (marray). The non-background corrected foreground mean was used as input to the loess algorithm. Replicate arrays for the same time point were then averaged to give one value for each timepoint/gene. Data was then smoothed using a least squares fit to each data point and its two neighbors on each side.
The profiles seen in PlasmoDB 4.4 have also been corrected. The correlation table that gives correlations between the different studies (the three DeRisi and the two Winzeler) has been recalculated based on the new profiles. This new data is not yet displayed on 4.4 but will be on 5.0 Beta.
CryptoDB is one of NIAID's Bioinformatics Resource Centers ( BRC) for Biodefense and Emerging or Re-Emerging Infectious Diseases. As part of our efforts for interoperability among the other BRCs and the research community in general we have released the published Cryptosporidium genome features in standardized GFF3 format. The files may be downloaded via FTP from ftp://ftp.tigr.org/pub/brc/ApiDB/Cryptosporidium/.
The home page for the Anopheles gambiae has been redone to provide quicker links to organism-specific data and news. Updates to contact information will also be made this week.
Javascript drop down menus in Ensembl have been fixed so they now work properly with most incarnations of Internet Explorer, Mozilla variants, and Safari. Older browser (Netscape 4, IE for Mac) support may be iffy...
On Wed., June 8, 2005, the National Human Genome Research Institute (NHGRI), one of the National Institutes of Health (NIH), announced today that the Large-Scale Sequencing Research Network will target 13 more organisms as part of its ongoing effort to produce genomic data that will expand biological knowledge and improve human health. Among those to be sequenced are the \"M\" and \"S\" strains of the Anopheles gambiae.
You can read the full press release at: http://www.genome.gov/15014493
A quick \"Getting Started\" page has been put up in the documents sections. Move the mouse over the red circles to look at some of the basic features of VectorBase. E-mail me (rbruggne@nd.edu) if there are any problems.
It's now possible to do some pretty basic searches of the Anopheles genome, including associated images. Much of the functionality is similar to Ensembl, although the proprietary Altavista syntax doesn't work.
Added a new web front end enabling administration of news items from the user homepage.
Support for detailed news entries has been added to the VectorBase Website.
New documentation detailing methods involved in creating the next assembly of the Anopheles gambiae genome have been posted. Look at the A. gambiae specific documentation for more details.
Images of in situ hybridizations of BAC clones previously stored at AnoBase have been imported into VectorBase and associated with genomic data.
Initial setup of the primary VectorBase infrastructure hardware has been completed.
The initial VectorBase website has been put online for testing purposes.
E-Mail services have been set up to provide mass e-mailing and automatic archiving of VectorBase related correspondance.
Representatives from each of the VectorBase subcontracting organizations met in Crete for role discussion.
| | |
|
|