Select Data Source(s):








<>

Pubs&Presentations

The annual workshops are intended to be a thorough and intensive introduction to eukaryotic pathogen database resources that are part of the EuPathDB Bioinformatics Resource Center (including OrthoMCL DB).
Four day bioinformatics workshop for thirty scientists, focusing on effective use of EuPathDB and its component sites AmoebaDB,CryptoDB, GiardiaDB, MicrosporidiaDB, PiroplasmaDB, PlasmoDB, ToxoDB, TrichDB, TriTrypDB and OrthoMCL
Jun 17-20, 2012

Includes doi, PMID, and PMCID links.
show more...
Click to view Abstract and link to full text.
show more...
Influenza Sequence Feature Variant Type (Flu-SFVT) analysis: evidence for a role of NS1 in influenza host range restriction Noronha JM, Liu M, Squires RB, Pickett BE, Hale BG, Air GM, Galloway SE, Takimoto T, Schmolke M, Hunt V, Klem E, Garc�a-Sastre A, McGee M, Scheuermann RH. J Virol. 2012 March doi: 10.1128/JVI.06901-11 PMID: 22398283

Click to view Abstract and link to full text.
show more...
BioHealthBase: informatics support in the elucidation of influenza virus host-pathogen interactions and virulence Squires B, Macken C, Garcia-Sastre A, Godbole S, Noronha J, Hunt V, Chang R, Larsen CN, Klem E, Biersack K, Scheuermann RH. Nucleic Acids Res. (Database issue) 2008:D497-503

In?uenza Research Database: an integrated bioinformatics resource for in?uenza research and surveillance Squires RB, Noronha J, Hunt V, Garc�a-Sastre A, Macken C, Baumgarth N, Suarez D, Pickett BE, Zhang Y, Larsen CN, Ramsey A, Zhou L, Zaremba S, Kumar S, Deitrich J, Klem E, Scheuermann RH. Influenza Other Respi Viruses. 2012 Jan 20. doi: 10.1111/j.1750-2659.2011.00331.x. [Epub ahead of print] PMID: 22260278

Working with Parasite Database Resources

21-26 October 2012 Application deadline: 29 June 2012

This residential workshop held at the Wellcome Trust Genome Campus, Hinxton, Cambridge aims to provide experimental biologists with hands-on experience in genomic-scale data analysis, including genome browsers and comparison tools, methods for data integration, and resources for sophisticated data mining. Examples and exercises will be drawn primarily from Plasmodium and kinetoplastida parasites but are likely also to include other organisms contained in EuPathDB and / or GeneDB, thus applicants working on any protozoan parasite from these resources will benefit.

Send questions to: advancedcourses@hinxton.wellcome.ac.uk
EuPathDB will be involved in planning and running the workshop
October 21-26, 2012

The American Society for Tropical Medicine and Hygiene will hold its 60th annual meeting from December 4-8 at the the Philadelphia Marriott Downtown, Philadelphia, PA, USA.

This year EuPathDB will join other bioinformatics centers at booth #417. Come visit us during exhibit and poster session hours:

Sunday, December 4
7:30 p.m. to 9:30 p.m. Opening Reception

Monday, December 5
9:30 a.m. to 10:30 a.m. Exhibits
Noon to 1:45 p.m. Poster Session A Presentations and Exhibits
3:15 p.m. to 4:15 p.m. Exhibits

Tuesday, December 6
9:30 a.m. to 10:30 a.m. Exhibits
Noon to 1:45 p.m. Poster Session C Presentations and Exhibits
3:15 p.m. to 4:15 p.m. Exhibits

Wednesday, December 7
9:30 a.m. to 10:30 a.m. Exhibits
Noon to 2:30 p.m.Exhibits

EuPathDB team members will be present throughout the meeting and available at booth #417 during exhibit hours.
Dec. 4-8, 2011

ViPR: an open bioinformatics database and analysis resource for virology research Pickett BE, Sadat EL, Zhang Y, Noronha JM, Squires RB, Hunt V, Liu M, Kumar S, Zaremba S, Gu Z, Zhou L, Larson C, Dietrich J, Klem EB, Scheuermann RH. Nucleic Acids Res. 2011 Oct 17. [Epub ahead of print]. PMID: 22006842

Full day tutorial entitled "Quantitative and Qualitative Methods for Human-Subject Visualization Experiments" was well attended and received.
show more...
Presentation entitled “PATRIC: A resource for infectious disease research. Real-life examples used to drive software development”.
show more...
ViPR Hands-on Workshop University of Texas Medical Branch at Galveston ? Aug. 17, 2011

Virus Bioinformatics Resource Centers ? ViPR/IRD (Presentation, Anticipating the Species Jump: Bioinformatics and Information Sharing, McLean VA) ? Mar. 30, 2011

The Informatics Crystal Ball: Mining the Past to Predict the Species Jump Event (Panel presentation, 5th Annual CEIRS Network Meeting, Atlanta GA) ? Apr. 19, 2011

Sequence Feature Variant Type: An Approach to Study Virus-Disease Associations in the Publicly Available Influenza Research Database (Poster presentation, ASV 2011, the 30th Annual Meeting for the American Society for Virology, Minneapolis MN) ? Jul. 16-20, 2011

Genomic sequence analysis tools and a genotype-phenotype association platform in the Virus Pathogen Resource (Poster presentation, 18th International Symposium on Hepatitis C Virus and Related Viruses, Seattle WA) ? Sept. 10, 2011

Influenza Research Database (IRD): A Web-based Resource for Influenza Virus Data and Analysis (Poster presentation, ASV 2011, the 30th Annual Meeting for the American Society for Virology, Minneapolis MN) ? Jul. 16-20, 2011

Data Mining in the Influenza Research Database (IRD) and the Virus Pathogen Resource (ViPR) (Presentation, JCVI-GSCID/NIAID Workshop: Empowering Genomics in Southern Africa ? Applications to Infectious Disease, University of Limpopo, Turfloop Campus, South Africa) ? May 30 ? Jun. 1, 2011

Tool for Identifying Sequence Variations that Correlate with Virus Phenotypic Characteristics (Poster presentation, 9th ASM Biodefense and Emerging Diseases Research Meeting, Washington DC) ? Feb. 6-9, 2011

Genomic sequence analysis tools and a genotype-phenotype association platform in the virus pathogen resource (Poster presentation, ASV 2011, the 30th Annual Meeting for the American Society for Virology, Minneapolis MN) July 16-20, 2011

IRD and ViPR Hands-on Workshop (Workshop presentation/ booklet, JCVI-GSCID/NIAID Workshop: Empowering Genomics in Southern Africa ? Applications to Infectious Disease, University of Limpopo, Turfloop Campus, South Africa) ? May 30 ? Jun. 1, 2011

IRD and ViPR Hands-on Workshop (Workshop presentation/ booklet, JCVI-GSCID/NIAID Workshop: Empowering Genomics in Southern Africa ? Applications to Infectious Disease, University of Limpopo, Turfloop Campus, South Africa) ? May 30 ? Jun. 1, 2011

Conserved Epitope Regions (Presentation, ASV 2011, the 30th Annual Meeting for the American Society for Virology, Minneapolis MN) ? Jul. 16-20, 2011

Conserved epitope regions (CER): elucidation of evolutionarily stable immunologically reactive regions of human H1N1 influenza (Poster presentation, Laboratory of Molecular Pathology Retreat 2011, Dallas TX) ? Mar. 10, 2011

Data Mining in the Influenza Research Database (IRD) and the Virus Pathogen Resource (ViPR) (Presentation, JCVI-GSCID/NIAID Workshop: Empowering Genomics in Southern Africa ? Applications to Infectious Disease, University of Limpopo, Turfloop Campus, South Africa) ? May 30 ? Jun. 1, 2011

Comparative Genomics in the Influenza Research Database (Presentation, Symposium on Systems Biology of Influenza, New Haven CT) ? Jun. 17, 2011


Virus Bioinformatics Resource Centers ? ViPR/IRD (Presentation, Anticipating the Species Jump: Bioinformatics and Information Sharing, McLean VA) ? Mar. 30, 2011

Overview and features of the freely available Virus Pathogen Resource (ViPR) Bioinformatics Resource Center (Workshop demonstration, 18th International Symposium on Hepatitis C Virus and Related Viruses, Seattle WA) ? Sept. 9, 2011

ViPR Hands-on Workshop Purdue University ? Sept. 7, 2011

Presentation entitled “Informatics, Infectious Diseases and Human-Microbe-Environment Interactions”.
show more...
Click to view Abstract and link to full text.
show more...

EuPathDB team members will be present throughout the meeting and available at a booth during all poster sessions.
Sept 11-15, 2011

Infectious disease research is generating an increasing amount of disparate data on pathogenic systems. There is a growing need for resources that effectively integrate, analyze, deliver and visualize these data, both to improve our understanding of infectious diseases and to facilitate the development of strategies for disease control and prevention.

Click to view Abstract and link to full text.

show more...
Presentation entitled “Informatics-driven biological research: Infectious diseases as an example”.
show more...
This residential workshop to be held at the Wellcome Trust Genome Campus, Hinxton, Cambridge, UK from 3-7 October 2011. The aim of this workshop is to provide experimental biologists with hands-on experience in genomic-scale data analysis, including genome browsers and comparison tools, methods for data integration, and resources for sophisticated data mining. Examples and exercises will be drawn primarily from Plasmodium and kinetoplastida parasites but are likely also to include other organisms contained in EuPathDB.org and / or GeneDB.org, thus applicants working on any protozoan parasite from these resources will benefit.
Taught by a collaboration between the Parasite Genomics Group (Wellcome Trust Sanger Institute, UK) and the Eukaryotic Pathogen Bioinformatics Resource Center (University of Georgia & University of Pennsylvania, US), the programme will include lectures on genomics and bioinformatics techniques, interspersed with hands-on exercises.

Application deadline: 1 July 2011


EuPathDB Workshop
October 3-7, 2011

Chapter Title: Comparative genomics and phylogenomics of the Brucella.
show more...
Presenters included Dr. Bruno Sobral, Dr. Mane Shrinivasrao, and Eric Nordberg.
show more...
Click to view Abstract and link to full text.
show more...
Presentation entitled "Informatics Enabled Infectious Disease Research and Development".
show more...
Presentation entitled “Leveraging Bacterial Bioinformatics Resource Center Data for HMP Research”.
show more...
The 11th International Congress on Toxoplasmosis will be held in Ottawa, Canada. The meeting site is a spectacular hotel originating in the golden days of the railroad. The Fairmont Chateau Laurier overlooks several national landmarks, including the Rideau Canal, the Ottawa River, the Canadian Parliament and its Library, Major's Hill Park, Gatineau Park, the ByWard Market, and the National Gallery. The meeting site is surrounded on 3 sides by green spaces and easily accessible bike paths, and bike rentals.

The meeting will start with an evening session on Saturday June 25th, and will end in the evening of Tuesday June 28th, with departure on Wednesday June 29th after breakfast.

The registration website is up and running, and we would like to invite all of you to visit http://www.toxomeeting.org, register your participation, and submit your abstract.

Abstracts are due March 31, 2011

Deadline for registration and payment is April 15, 2011
ToxoDB help desk
June 25-29, 2011

Presentation entitled “Informatics-Driven Infectious Disease Research”.
show more...
The biennial KMCB Meeting provides a forum for discussion of the Molecular Cell Biology of Trypanosomes, Leishmania, and related model organisms, without restrictions on attendance. On-line registration is open and an outline program is available at http://www.mbl.edu/kmcb/2011/index.php. The abstract deadline is midnight (in New York) on WEDNESDAY February 16th. The meeting will commence with registration on the afternoon of Friday April 8, with dinner followed by a scientific session and social mixer, and will conclude by noon on Tuesday April 12. Please plan to stay until the end. Because the 2011 meeting runs for 4 days, there will be a free afternoon on Sunday April 10.
TriTrypDB help desk
April 8-12, 2011

Registration is open for the 7th Annual BioMalPar Conference:
Biology and Pathology of the Malaria Parasite, 16 - 18 May 2011
Please visit the conference website: www.embl.de/training/events/2011/BMP11-01.
The conference will take place in Heidelberg, Germany at the EMBL Advanced Training Centre.
Registration and astract deadline: 23:59 CET on 11 February 2011. Please note that registration without an abstract submission is possible.
The purpose of the BioMalPar annual conference is to bring together malaria researchers from Europe and overseas (including Africa, America, Asia and Australia) in order to present and share recent groundbreaking findings on fundamental malaria research. New insights will also be featured through the use of poster sessions. This meeting will also provide an enriched environment for researchers at all stages of their career to interact with international leaders in the field. The meeting will offer an excellent opportunity for sharing ideas and for potential development of new worldwide collaborations.
PlasmoDBDB help desk and Workshop
May 16-18, 2011

Presentation entitled “PATRIC, Pathogen Portal, and Infectious Disease Ontology”.
show more...
Presentation entitled “Comparative Genomics of Brucella spp.”.
show more...
Presentation entitled “PATRIC – PathoSystems Resource Integration Center”.
show more...
Presentation entitled “The Impacts of New Sequencing Technologies on Infectious Disease Research”.
show more...
Genome sequencing of the M & S molecular forms of An. gambiae were published today showing that speciation is more advanced than previously thought. This greater divergence betwe en the genomes has highlights the need to identify those genes critical for initiating this process. A companion paper describes the development of a SNP genotyping platform for investigating gene flow between the incipient species. Genome browsers for the M & S molecular forms are available through Ve ctorBase:

BioHealthBase: Complete Genomic Characterization of a Pathogenic A.II Strain of Francisella tularensis Subspecies tularensis Beckstrom-Sternberg SM, Auerbach RK, Godbole S, Pearson JV, Beckstrom-Sternberg JS, et al.


BioHealthBase Support for CEIRS Research and Surveillance Squires B., Macken C., Baumgarth N., Garcia-Sastre A., Klem E., Scheuermann R.

NIAID Bioinformatics Resource Centers: New Assets for Pathogen Informatics Greene J., Collins F., Lefkowitz E., Roos D., Scheuermann R., Sobral, B., Stevens R., White, O., Di Francesco, V.

BioHealthBase: A web based resource for studies on the Microsporidia. Gire C., Scheuermann R., Weiss L. Pre-publication abstract, presented at the June 2006 meeting of the International Society of Protistologists(International Workshop on Opportunistic Protists).

The mosquito Culex quinquefasciatus poses a substantial threat to human and veterinary health as a primary vector of West Nile virus (WNV), the filarial worm Wuchereria bancrofti, and an avian malaria parasite. Comparative phylogenomics revealed an expanded canonical C. quinquefasciatus immune gene repertoire compared with those of Aedes aegypti and Anopheles gambiae. Transcriptomic analysis of C. quinquefasciatus genes responsive to WNV, W. bancrofti, and non-native bacteria facilitated an unprecedented meta-analysis of 25 vector-pathogen interactions involving arboviruses, filarial worms, bacteria, and malaria parasites, revealing common and distinct responses to these pathogen types in three mosquito genera. Our findings provide support for the hypothesis that mosquito-borne pathogens have evolved to evade innate immune responses in three vector mosquito species of major medical importance.

Culex quinquefasciatus (the southern house mosquito) is an important mosquito vector of viruses such as West Nile virus and St. Louis encephalitis virus, as well as of nematodes that cause lymphatic filariasis. C. quinquefasciatus is one species within the Culex pipiens species complex and can be found throughout tropical and temperate climates of the world. The ability of C. quinquefasciatus to take blood meals from birds, livestock, and humans contributes to its ability to vector pathogens between species. Here, we describe the genomic sequence of C. quinquefasciatus: Its repertoire of 18,883 protein-coding genes is 22% larger than that of Aedes aegypti and 52% larger than that of Anopheles gambiae with multiple gene-family expansions, including olfactory and gustatory receptors, salivary gland genes, and genes associated with xenobiotic detoxification.

Dr. Sobral’s presentation entitled “Informatics-Driven Infectious Disease Research: PATRIC, An all-bacterial Bioinformatics Resource Center”.
show more...
Dr Cammer’s workshop showcases PATRIC tools, and demonstrates several types of analyses, which illustrate how PATRIC can become an integral part of current research
show more...
Bacillus anthracis, Francisella tularensis, and Yersinia pestis are bacterial pathogens that can cause anthrax, lethal acute pneumonic disease, and bubonic plague, respectively, and are listed as NIAID Category A priority pathogens for possible use as biological weapons. However, the interactions between human proteins and proteins in these bacteria remain poorly characterized leading to an incomplete understanding of their pathogenesis and mechanisms of immune evasion. In this study, we used a high-throughput yeast two-hybrid assay to identify physical interactions between human proteins and proteins from each of these three pathogens.
show more...
Presentation entitled “Prokaryotic Annotation Status”.
show more...
The American Society for Tropical Medicine and Hygiene will hold its 59th annual meeting from November 3rd-7th at the Marriott Atlanta Marquis Hotel in Atlanta, Georgia, USA
EuPathDB team members will be present throughout the meeting and available at a booth during all poster sessions.
Nov. 3-7, 2010

Presentation entitled “Dealing with integration and interoperation: Bioinformatics resource center for bacterial infectious disease research”.
show more...
The XIIth International Congress of Parasitology (ICOPA) will be held in Melbourne, Australia, from 15-20th August 2010 at the new Exhibition and Convention Centre.
EuPathDB will present a lunch workshop on August 15th and will operate a booth (#30) in the exhibit hall every day of the conference.
Aug 15-20, 2010

This workshop will take place from October 18-19, 2010.
Workshop and presentation
October 18-19, 2010

The 10th annual international coccidiosis conference(ICC-10) will be held in Guangzhou, China from October 8th to the 13th.
EuPathDB team members will be present throughout the meeting and will run a help desk during poster sessions and conduct a full day workshop on October 13th.
October 8-13, 2010


EuPathDB team members will be present throughout the meeting and available at a booth during all poster sessions.
Sept 12-16, 2010

Event extraction approaches based on expressive structured representations of extracted information have been a significant focus of research in recent biomedical natural language processing studies. However, event extraction efforts have so far been limited to publication abstracts, with most studies further considering only the specific transcription factor-related subdomain of molecular biology of the GENIA corpus. To establish the broader relevance of the event extraction approach and proposed methods, it is necessary to expand on these constraints. In this study, we propose an adaptation of the event extraction approach to a subdomain related to infectious diseases and present analysis and initial experiments on the feasibility of event extraction from domain full text publications.
show more...
As an obligatory parasite of humans, the body louse (Pediculus humanus humanus) is an important vector for human diseases, including epidemic typhus, relapsing fever, and trench fever. Here, we present genome sequences of the body louse and its primary bacterial endosymbiont Candidatus Riesia pediculicola. The body louse has the smallest known insect genome, spanning 108 Mb. Despite its status as an obligate parasite, it retains a remarkably complete basal insect repertoire of 10,773 protein-coding genes and 57 microRNAs. Representing hemimetabolous insects, the genome of the body louse thus provides a reference for studies of holometabolous insects. Compared with other insect genomes, the body louse genome contains significantly fewer genes associated with environmental sensing and response, including odorant and gustatory receptors and detoxifying enzymes. The unique architecture of the 18 minicircular mitochondrial chromosomes of the body louse may be linked to the loss of the gene encoding the mitochondrial single-stranded DNA binding protein. The genome of the obligatory louse endosymbiont Candidatus Riesia pediculicola encodes less than 600 genes on a short, linear chromosome and a circular plasmid. The plasmid harbors a unique arrangement of genes required for the synthesis of pantothenate, an essential vitamin deficient in the louse diet. The human body louse, its primary endosymbiont, and the bacterial pathogens that it vectors all possess genomes reduced in size compared with their free-living close relatives. Thus, the body louse genome project offers unique information and tools to use in advancing understanding of coevolution among vectors, symbionts, and pathogens.

Background

Transposable elements (TEs) are mobile sequences found in nearly all eukaryotic genomes. They have the ability to move and replicate within a genome, often influencing genome evolution and gene expression. The identification of TEs is an important part of every genome project. The number of sequenced genomes is rapidly rising, and the need to identify TEs within them is also growing. The ability to do this automatically and effectively in a manner similar to the methods used for genes is of increasing importance. There exist many difficulties in identifying TEs, including their tendency to degrade over time and that many do not adhere to a conserved structure. In this work, we describe a homology-based approach for the automatic identification of high-quality consensus TEs, aimed for use in the analysis of newly sequenced genomes.

Results

We describe a homology-based approach for the automatic identification of TEs in genomes. Our modular approach is dependent on a thorough and high-quality library of representative TEs. The implementation of the approach, named TESeeker, is BLAST-based, but also makes use of the CAP3 assembly program and the ClustalW2 multiple sequence alignment tool, as well as numerous BioPerl scripts. We apply our approach to newly sequenced genomes and successfully identify consensus TEs that are up to 99% identical to manually annotated TEs.

Conclusions

While TEs are known to be a major force in the evolution of genomes, the automatic identification of TEs in genomes is far from mature. In particular, there is a lack of automated homology-based approaches that produce high-quality TEs. Our approach is able to generate high-quality consensus TE sequences automatically, requiring the user to only provide a few basic parameters. This approach is intentionally modular, allowing researchers to use components separately or iteratively. Our approach is most effective for TEs with intact reading frames. The implementation, TESeeker, is available for download as a virtual appliance, while the library of representative TEs is available as a separate download.

Dr. Sobral Visits Moscow, Russia to Promote PATRIC's Bioinformatics Tools and Techniques
show more...
With an obligate intracellular lifestyle, Alphaproteobacteria of the order Rickettsiales have inextricably coevolved with their various eukaryotic hosts, resulting in small, reductive genomes and strict dependency on host resources. Unsurprisingly, large portions of Rickettsiales genomes encode proteins involved in transport and secretion. One particular transporter that has garnered recent attention from researchers is the type IV secretion system (T4SS).
show more...
Systems-biology and infectious-disease (host-pathogen-environment) research and development is becoming increasingly dependent on integrating data from diverse and dynamic sources. Maintaining integrated resources over long periods of time presents distinct challenges. This review describes experiences and lessons learned from integrating data in two five-year projects focused on pathosystems biology
show more...
Murine typhus is a flea-borne febrile illness that is caused by the obligate intracellular bacterium, Rickettsia typhi. The cat flea, Ctenocephalides felis, acquires R. typhi by imbibing a bloodmeal from a rickettsemic vertebrate host. To explore which transcripts are expressed in the midgut in response to challenge with R. typhi, cDNA libraries of R. typhi-infected and uninfected midguts of C. felis were constructed.
show more...
Dr. Bruno Sobral travels to Europe to feature PATRIC and Pathogen Portal resources and to discuss opportunities for surveillance and epidemiological information and data exchange with PATRIC.
show more...
We are developing a set of ontologies dealing with vector-borne diseases as well as the arthropod vectors that transmit them. After building ontologies for mosquito and tick anatomy we continued this project with an ontology of insecticide resistance followed by a series of ontologies that describe malaria as well as physiological processes of mosquitoes that are relevant to, and involved in, disease transmission. These will later be expanded to encompass other vector-borne diseases as well as non-mosquito vectors. The aim of the whole undertaking, which is worked out in the frame of the international IDO (Infectious Disease Ontology) project, is to provide the community with a set of ontological tools that can be used both in the development of specific databases and, most importantly, in the construction of decision support systems (DSS) to control these diseases.

TriTrypDB: a functional genomic resource for the Trypanosomatidae

Nucleic Acids Research 2010 38(Database issue):D457-D462; doi:10.1093/nar/gkp851

Aslett M, Aurrecoechea C, Berriman M, Brestelli J, Brunk BP, Carrington M, Depledge DP, Fischer S, Gajria B, Gao X, Gardner MJ, Gingle A, Grant G, Harb OS, Heiges M, Hertz-Fowler C, Houston R, Innamorato F, Iodice J, Kissinger JC, Kraemer E, Li W, Logan FJ, Miller JA, Mitra S, Myler PJ, Nayak V, Pennington C, Phan I, Pinney DF, Ramasamy G, Rogers MB, Roos DS, Ross C, Sivam D, Smith DF, Srinivasamoorthy G, Stoeckert CJ Jr., Subramanian S, Thibodeau R, Tivey A, Treatman C, Velarde G, Wang H.

TriTrypDB (http://tritrypdb.org) is an integrated database providing access to genome-scale datasets for kinetoplastid parasites, and supporting a variety of complex queries driven by research and development needs. TriTrypDB is a collaborative project, utilizing the GUS/WDK computational infrastructure developed by the Eukaryotic Pathogen Bioinformatics Resource Center (EuPathDB.org) to integrate genome annotation and analyses from GeneDB and elsewhere with a wide variety of functional genomics datasets made available by members of the global research community, often pre-publication. Currently, TriTrypDB integrates datasets from Leishmania braziliensis, L. infantum, L. major, L. tarentolae, Trypanosoma brucei and T. cruzi. Users may examine individual genes or chromosomal spans in their genomic context, including syntenic alignments with other kinetoplastid organisms. Data within TriTrypDB can be interrogated utilizing a sophisticated search strategy system that enables a user to construct complex queries combining multiple data types. All search strategies are stored, allowing future access and integrated searches. 'User Comments' may be added to any gene page, enhancing available annotation; such comments become immediately searchable via the text search, and are forwarded to curators for incorporation into the reference annotation when appropriate.

PMID: 19843604

Anaplasma and related Ehrlichia spp. are important tick-borne, Gram-negative bacterial pathogens of livestock and humans that cause acute infection and disease and can persist. Immunization of cattle with an Anaplasma marginale fraction enriched in outer membranes (OM) can provide complete protection against disease and persistent infection. Serological responses of OM vaccinees to the OM proteome previously identified over 20 antigenic proteins, including three type IV secretion system (T4SS) proteins, VirB9-1, VirB9-2, and VirB10.
show more...
Dr. Stephen Cammer makes a presentation at the 3rd Center for Structural Genomics of Infectious Diseases (CSGID) Annual Meeting in Chicago, IL.
show more...
The phylogeny of the large bacterial class Gammaproteobacteria has been difficult to resolve. Here we apply a telescoping multiprotein approach to the problem for 104 diverse gammaproteobacterial genomes, based on a set of 356 protein families for the whole class and even larger sets for each of four cohesive subregions of the tree. Although the deepest divergences were resistant to full resolution, some surprising patterns were strongly supported.
show more...
EuPathDB is organizing a workshop in Montevideo, Uruguay entitled "Working with Pathogen Genomes" (March 16-19) that will be held right after the International Society for Computational Biology Latin-America Conference. The workshop will include hands-on training sessions on the use of EuPathDB databases (ie. PlasmoDB, TriTrypDB, OrthoMCL, etc.), metabolic pathway reconstruction (sponsored by NIAID), TDRtargets and SchistoDB.

The deadline for this workshop is January 20th. Application forms may be downloaded here and emailed to help@eupathdb.org
EupathDB instructors will teach the effective use of EuPathDB and its component sites CryptoDB, GiardiaDB, PlasmoDB, ToxoDB, TrichDB and TriTrypDB
March 16-19, 2010

Working with Parasite Database Resources Workshop -- note: this workshop was postponed due to volcano activity.
EupathDB instructors will teach the effective use of EuPathDB and its component sites CryptoDB, GiardiaDB, PlasmoDB, ToxoDB, TrichDB and TriTrypDB
October 22-26, 2010

Phylogenomics reveals extreme gene loss in typhus group (TG) rickettsiae relative to the levels for other rickettsial lineages. We report here a curious protease-encoding gene (ppcE) that is conserved only in TG rickettsiae. As a possible determinant of host pathogenicity, ppcE warrants consideration in the development of therapeutics against epidemic and murine typhus.
show more...
EuPathDB members will be at the 58th annual American Society for Tropical Medicine and Hygiene (ASTMH) meeting that will be held in Washington D.C., USA.

Come see us at our booth in the exhibit hall.

Exhibit Hall Hours:

Nov 18, 2009 7:30pm - 9:30pm
Nov 19, 2009 9:30am - 10:30am
Nov 19, 2009 12:00pm - 1:30pm
Nov 19, 2009 3:00pm - 4:00pm
Nov 20, 2009 9:30am - 10:30am
Nov 20, 2009 12:00pm - 1:30pm
Nov 20, 2009 3:00pm - 4:00pm
Nov 21, 2009 9:30am - 10:30am
Nov 21, 2009 12:00pm - 2:30pm

EuPathDB team members will be present throughout the meeting and available at a booth during all poster sessions.
Nov. 18-22, 2009

The annual workshops are intended to be a thorough and intensive introduction to eukaryotic pathogen database resources that are part of the EuPathDB Bioinformatics Resource Center ( AmoebaDB, CryptoDB, GiardiaDB, MicrosporidiaDB, PlasmoDB, ToxoDB, TrichDB, TriTrypDB and OrthoMCL DB).
Four day bioinformatics workshop for thirty scientists, focusing on effective use of EuPathDB and its component sites CryptoDB, GiardiaDB, PlasmoDB, ToxoDB, TrichDB and TriTrypDB
Jun 6-9, 2010

Dr. Rebecca Wattam gives a presentation entitled, "Pathosystems Resource Integration Center for Bacterial Diseases" and presents a poster with the same title.
show more...
The 20th annual molecular parasitology meeting will be held in Woods Hole, MA from September 13th to the 17th.
EuPathDB team members will be present throughout the meeting and available at a booth during all poster sessions.
Sept 13-17, 2009

We are developing a set of ontologies that deal with vector-borne diseases and the arthropod vectors that transmit them. For practical reasons (application priorities), we initiated this project with an ontology of insecticide resistance followed by a series of ontologies that describe malaria as well as physiological processes of mosquitoes that are relevant to, and involved in, disease transmission. These will be expanded to encompass other vector-borne diseases as well as non-mosquito vectors. The aim of the whole undertaking, which is worked out in the frame of the international IDO (Infectious Disease Ontology) project, is to provide the community with a set of ontological tools that can be used both in the development of specific databases and, most importantly, in the construction of decision support systems to control these diseases.

PlasmoDB: a functional genomic database for malaria parasites

Nucleic Acids Res. 2009. 37:D539-D543

Aurrecoechea, C., J. Brestelli, B. P. Brunk, J. Dommer, S. Fischer, B. Gajria, X. Gao, A. Gingle, G. Grant, O. S. Harb, M. Heiges, F. Innamorato, J. Iodice, J. C. Kissinger, E. Kraemer, W. Li, J. A. Miller, V. Nayak, C. Pennington, D. F. Pinney, D. S. Roos, C. Ross, C. J. Stoeckert, Jr., C. Treatman, and H. Wang

PlasmoDB (http://PlasmoDB.org) is a functional genomic database for Plasmodium spp. that provides a resource for data analysis and visualization in a gene-by-gene or genome-wide scale. PlasmoDB belongs to a family of genomic resources that are housed under the EuPathDB (http://EuPathDB.org) Bioinformatics Resource Center (BRC) umbrella. The latest release, PlasmoDB 5.5, contains numerous new data types from several broad categories--annotated genomes, evidence of transcription, proteomics evidence, protein function evidence, population biology and evolution. Data in PlasmoDB can be queried by selecting the data of interest from a query grid or drop down menus. Various results can then be combined with each other on the query history page. Search results can be downloaded with associated functional data and registered users can store their query history for future retrieval or analysis.

PMID: 18957442

GiardiaDB and TrichDB: integrated genomic resources for the eukaryotic protist pathogens Giardia lamblia and Trichomonas vaginalis

Nucleic Acids Res. 2009. 37:D526-D530

Aurrecoechea, C., J. Brestelli, B. P. Brunk, J. M. Carlton, J. Dommer, S. Fischer, B. Gajria, X. Gao, A. Gingle, G. Grant, O. S. Harb, M. Heiges, F. Innamorato, J. Iodice, J. C. Kissinger, E. Kraemer, W. Li, J. A. Miller, H. G. Morrison, V. Nayak, C. Pennington, D. F. Pinney, D. S. Roos, C. Ross, C. J. Stoeckert, Jr., S. Sullivan, C. Treatman, and H. Wang

GiardiaDB (http://GiardiaDB.org) and TrichDB (http://TrichDB.org) house the genome databases for Giardia lamblia and Trichomonas vaginalis, respectively, and represent the latest additions to the EuPathDB (http://EuPathDB.org) family of functional genomic databases. GiardiaDB and TrichDB employ the same framework as other EuPathDB sites (CryptoDB, PlasmoDB and ToxoDB), supporting fully integrated and searchable databases. Genomic-scale data available via these resources may be queried based on BLAST searches, annotation keywords and gene ID searches, GO terms, sequence motifs and other protein characteristics. Functional queries may also be formulated, based on transcript and protein expression data from a variety of platforms. Phylogenetic relationships may also be interrogated. The ability to combine the results from independent queries, and to store queries and query results for future use facilitates complex, genome-wide mining of functional genomic data.

PMID: 18824479


PlasmoDB Workshop
May 18-20, 2009


Presentation -- Introduction to the EuPathDB Bioinfomatics Resource
Feb 24-28, 2009

A forum for anyone working on or interested in the Molecular Cell Biology of Trypanosomes and Leishmania and related model organisms.
TriTrypDB help desk
April 26-29, 2009


PlasmoDB Presentation and EuPathDB Workshop
May 24-28, 2009

The annual workshops are intended to be a thorough and intensive introduction to eukaryotic pathogen database resources that are part of the ApiDB/EuPathDB Bioinformatics Resource Center (CryptoDB, GiardiaDB, PlasmoDB, ToxoDB, TrichDB, TriTrypDB and OrthoMCL DB).
Four day bioinformatics workshop for thirty scientists, focusing on effective use of EuPathDB and its component sites CryptoDB, GiardiaDB, PlasmoDB, ToxoDB, TrichDB and TriTrypDB
Jun 7-10, 2009


PlasmoDB Help Desk
May 28-29, 2009


Presentation and ToxoDB Help Desk
Jun 19-23, 2009


PlasmoDB workshop
Apr 5-7, 2006


Discussion of bioinformatics and annotation standards for microbial genomes
Aug 27-29, 2006


Booth with computers for hands-on experience with EuPathDB/CryptoDB/GiardiaDB/PlasmoDB/ToxoDB/TrichDB
Dec 7-11, 2008


Keynote lecture on parasite genome databases and comparative genomics; ApiDB/CryptoDB/PlasmoDB/ToxoDB help desk
Nov 6-9, 2006


Booth with computers for hands-on experience with ApiDB/CryptoDB/PlasmoDB/ToxoDB
Nov 4-8, 2007


Four day bioinformatics workshop for thirty apicomplexan researchers, focusing on effective use of ApiDB/CryptoDB/PlasmoDB/ToxoDB.
Jun 3-6, 2007

ToxoDB: an integrated Toxoplasma gondii database resource

Nucleic Acids Res. 2007. 36:D553-6

B. Gajria, A. Bahl, J. Brestelli, J. Dommer, S. Fischer, X. Gao, M. Heiges, J. Iodice, J. C. Kissinger, A. J. Mackey, et al.

ToxoDB (http://ToxoDB.org) is a genome and functional genomic database for the protozoan parasite Toxoplasma gondii. It incorporates the sequence and annotation of the T. gondii ME49 strain, as well as genome sequences for the GT1, VEG and RH (Chr Ia, Chr Ib) strains. Sequence information is integrated with various other genomic-scale data, including community annotation, ESTs, gene expression and proteomics data. ToxoDB has matured significantly since its initial release. Here we outline the numerous updates with respect to the data and increased functionality available on the website.

PMID: 18003657

ApiDB: integrated resources for the apicomplexan bioinformatics resource center

Nucleic Acids Research. 2007. 35:D427-30

Cristina Aurrecoechea, Mark Heiges, Haiming Wang, Zhiming Wang, Steve Fischer, Philippa Rhodes, John Miller, Eileen Kraemer, Christian J. Stoeckert, Jr., David S. Roos and Jessica C. Kissinger

ApiDB (http://ApiDB.org) represents a unified entry point for the NIH-funded Apicomplexan Bioinformatics Resource Center (BRC) that integrates numerous database resources and multiple data types. The phylum Apicomplexa comprises numerous veterinary and medically important parasitic protozoa including human pathogenic species of the genera Cryptosporidium, Plasmodium and Toxoplasma. ApiDB serves not only as a database in its own right, but as a single web-based point of entry that unifies access to three major existing individual organism databases (PlasmoDB.org, ToxoDB.org and CryptoDB.org), and integrates these databases with data available from additional sources. Through the ApiDB site, users may pose queries and search all available apicomplexan data and tools, or they may visit individual component organism databases.

PMID: 17098930

The PathoSystems Resource Integration Center (PATRIC) is one of eight Bioinformatics Resource Centers (BRCs) funded by the National Institute of Allergy and Infection Diseases (NIAID) to create a data and analysis resource for selected NIAID priority pathogens, specifically proteobacteria of the genera Brucella, Rickettsia and Coxiella, and corona-, calici- and lyssaviruses and viruses associated with hepatitis A and E.
show more...
PlasmoDB v5: New Looks, New Genomes

Trends in Parasitology. 2006. 22(12): 543-546

Christian J. Stoeckert Jr., Steve Fisher, Jessica Kissinger, Mark Heiges, Cristina Aurrecoechea, Bindu Gajria, David S. Roos

Version 5.1 of PlasmoDB, a resource for malaria parasite genomic and functional genomics datasets, was released in August 2006. This new release includes additional Plasmodium genomes and a newly designed website. The new site reflects the status of PlasmoDB as a member of a linked family of Apicomplexan databases.

PMID: 17029963

PlasmoDB: The Plasmodium Genomics and Functional Genomics Resource

In silico Genomics and Proteomics: Functional Annotation of Genomes and Proteins. Nicola Mulder and Rolf Apweiler (eds.). Nova Science Publishers. 2006

Patricia L. Whetzel, Shailesh V. Date, Kobby Essien, Martin J. Fraunholz, Bindu Gajria, Gregory R. Grant, John Iodice, Jessica C. Kissinger, Philip T. Labo, Arthur J. Milgram, David S. Roos, and Christian J. Stoeckert Jr.



ISBN: 1-59454-995-8

SynView: A GBrowse-compatible Approach to Visualizing Comparative Genome Data

Bioinformatics. 2006. 22(18):2308-2309

Haiming Wang, Yanqi Su, Aaron J. Mackey, Eileen T. Kraemer and Jessica C. Kissinger

SUMMARY: We present SynView, a simple and generic approach to dynamically visualize multi-species comparative genome data. It is a light-weight application based on the popular and configurable web-based GBrowse framework. It can be used with a variety of databases and provides the user with a high degree of interactivity. The tool is written in Perl and runs on top of the GBrowse framework. It is in use in the PlasmoDB (http://www.PlasmoDB.org) and the CryptoDB (http://www.CryptoDB.org) projects and can be easily integrated into other cross-species comparative genome projects. AVAILABILITY: The program and instructions are freely available at http://www.ApiDB.org/apps/SynView/ CONTACT: jkissing@uga.edu.

PMID: 16844709

CryptoDB: a Cryptosporidium bioinformatics resource update

Nucleic Acids Res. 2006 Jan 1;34:D419-22

Heiges M, Wang H, Robinson E, Aurrecoechea C, Gao X, Kaluskar N, Rhodes P, Wang S, He CZ, Su Y, Miller J, Kraemer E, Kissinger JC.

The database, CryptoDB (http://CryptoDB.org), is a community bioinformatics resource for the AIDS-related apicomplexan-parasite, Cryptosporidium. CryptoDB integrates whole genome sequence and annotation with expressed sequence tag and genome survey sequence data and provides supplemental bioinformatics analyses and data-mining tools. A simple, yet comprehensive web interface is available for mining and visualizing the data. CryptoDB is allied with the databases PlasmoDB and ToxoDB via ApiDB, an NIH/NIAID-fundedBioinformatics Resource Center. Recent updates to CryptoDB include the deposition of annotated genome sequences for Cryptosporidium parvum and Cryptosporidium hominis, migration to a relational database (GUS), a new query and visualization interface and the introduction of Web services.

PMID: 16381902

OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups

Nucleic Acids Res. 2006 Jan 1;34:D363-8

Feng Chen, Aaron J. Mackey, Christian J. Stoeckert Jr. and David S. Roos

The OrthoMCL database (http://orthomcl.cbil.upenn.edu) houses ortholog group predictions for 55 species, including 16 bacterial and 4 archaeal genomes representing phylogenetically diverse lineages, and most currently available complete eukaryotic genomes: 24 unikonts (12 animals, 9 fungi, microsporidium, Dictyostelium, Entamoeba), 4 plants/algae and 7 apicomplexan parasites. OrthoMCL software was used to cluster proteins based on sequence similarity, using an all-against-all BLAST search of each species' proteome, followed by normalization of inter-species differences, and Markov clustering. A total of 511,797 proteins (81.6% of the total dataset) were clustered into 70,388 ortholog groups. The ortholog database may be queried based on protein or group accession numbers, keyword descriptions or BLAST similarity. Ortholog groups exhibiting specific phyletic patterns may also be identified, using either a graphical interface or a text-based Phyletic Pattern Expression grammar. Information for ortholog groups includes the phyletic profile, the list of member proteins and a multiple sequence alignment, a statistical summary and graphical view of similarities, and a graphical representation of domain architecture. OrthoMCL software, the entire FASTA dataset employed and clustering results are available for download. OrthoMCL-DB provides a centralized warehouse for orthology prediction among multiple species, and will be updated and expanded as additional genome sequence data become available.

PMID: 16381887

Plasmodium research in the postgenomic era

Trends Parasitol. 2006 22(1):1-4

Duraisingh M, Ferdig MT, Stoeckert CJ, Volkman SK, McGovern VP

The complete genomic sequence of Plasmodium falciparum strain 3D7 was published in October 2002. At the Next Steps in Malaria Research meeting in April 2005, the next practical steps were considered and the priorities ranked for postgenomic research in Plasmodium. The high-throughput approaches that will help to answer the major biological questions regarding Plasmodium should, like the genome project itself, build community-shared resources, and efforts must be made to help researchers ready themselves to use the tools that will become available.

PMID: 16311071

Getting the most out of bioinformatics resources

In: Malaria Parasites, A.P. Waters & C.J. Janse, editors. Horizon, Norfolk UK. 2004

J.C. Kissinger and D.S. Roos



ISBN: 0-9542464-6-2

Composite genome map and recombination parameters derived from three archetypal lineages of Toxoplasma gondii

Nucl. Acids Res. 2005 33:2980-2992

Khan, A., S. Taylor, C. Su, A.J. Mackey, J. Boyle, R. Cole, D. Glover, K. Tang, I.T. Paulsen, M. Berriman, J.C. Boothroyd, E.R. Pfefferkorn, J.P. Dubey, J.W. Ajioka, D.S. Roos, J.C. Wootton and L.D. Sibley

Toxoplasma gondii is a highly successful protozoan parasite in the phylum Apicomplexa, which contains numerous animal and human pathogens. T.gondii is amenable to cellular, biochemical, molecular and genetic studies, making it a model for the biology of this important group of parasites. To facilitate forward genetic analysis, we have developed a high-resolution genetic linkage map for T.gondii. The genetic map was used to assemble the scaffolds from a 10X shotgun whole genome sequence, thus defining 14 chromosomes with markers spaced at approximately 300 kb intervals across the genome. Fourteen chromosomes were identified comprising a total genetic size of approximately 592 cM and an average map unit of approximately 104 kb/cM. Analysis of the genetic parameters in T.gondii revealed a high frequency of closely adjacent, apparent double crossover events that may represent gene conversions. In addition, we detected large regions of genetic homogeneity among the archetypal clonal lineages, reflecting the relatively few genetic outbreeding events that have occurred since their recent origin. Despite these unusual features, linkage analysis proved to be effective in mapping the loci determining several drug resistances. The resulting genome map provides a framework for analysis of complex traits such as virulence and transmission, and for comparative population genetic studies.

PMID: 15911631

Themes and variations in apicomplexan parasite biology

Science. 2005 309:72-73

Roos, D.S.



PMID: 15994520

Protozoan genomics for drug discovery

Nature Biotechnol. 2005 23:1089-1091

Chaudhary, K., and D.S. Roos



PMID: 16151400

The transcriptome of Toxoplasma gondii

BMC Bioinformatics 2005:3-26

Radke, J.R., M.S. Behnke, A.J. Mackey, J.B. Radke, D.S. Roos and M.W. White

BACKGROUND: Toxoplasma gondii gives rise to toxoplasmosis, among the most prevalent parasitic diseases of animals and man. Transformation of the tachzyoite stage into the latent bradyzoite-cyst form underlies chronic disease and leads to a lifetime risk of recrudescence in individuals whose immune system becomes compromised. Given the importance of tissue cyst formation, there has been intensive focus on the development of methods to study bradyzoite differentiation, although the molecular basis for the developmental switch is still largely unknown. RESULTS: We have used serial analysis of gene expression (SAGE) to define the Toxoplasma gondii transcriptome of the intermediate-host life cycle that leads to the formation of the bradyzoite/tissue cyst. A broad view of gene expression is provided by >4-fold coverage from nine distinct libraries (approximately 300,000 SAGE tags) representing key developmental transitions in primary parasite populations and in laboratory strains representing the three canonical genotypes. SAGE tags, and their corresponding mRNAs, were analyzed with respect to abundance, uniqueness, and antisense/sense polarity and chromosome distribution and developmental specificity. CONCLUSION: This study demonstrates that phenotypic transitions during parasite development were marked by unique stage-specific mRNAs that accounted for 18% of the total SAGE tags and varied from 1-5% of the tags in each developmental stage. We have also found that Toxoplasma mRNA pools have a unique parasite-specific composition with 1 in 5 transcripts encoding Apicomplexa-specific genes functioning in parasite invasion and transmission. Developmentally co-regulated genes were dispersed across all Toxoplasma chromosomes, as were tags representing each abundance class, and a variety of biochemical pathways indicating that trans-acting mechanisms likely control gene expression in this parasite. We observed distinct similarities in the specificity and expression levels of mRNAs in primary populations (Day-6 post-sporozoite infection) that occur prior to the onset of bradyzoite development that were uniquely shared with the virulent Type I-RH laboratory strain suggesting that development of RH may be arrested. By contrast, strains from Type II-Me49B7 and Type III-VEGmsj contain SAGE tags corresponding to bradyzoite genes, which suggests that priming of developmental expression likely plays a role in the greater capacity of these strains to complete bradyzoite development.

PMID: 16324218

Functional genomics databases on the web

Cell Microbiol. 2005 7(8), 1053-9

Stoeckert CJ Jr.

Experiments involving high-throughput methods for measuring transcripts, proteins and metabolites constitute the area of functional genomics. These experiments are highly context dependent and require much more detail about the experimental design, sample and protocols used than in genomics. Functional genomics databases are needed that follow established and emerging standards. Functional genomic databases are not yet very common; however, there are a few focused on microbial genomes and a couple integrative systems are available for setting up functional genomics databases.

PMID: 16008573