BHI 2023 Data Challenge Competition

Prize will be given to top teams.

Title: Integrating Artificial Intelligence (AI) methods and tools with Biomedical and Health Informatics (BHI) to combat Pandemics.

BHI 2023 Data Challenge Competition is continuing on as Phase 2 of COVID-19 Data Hackathon Competitions! Competitions are now live, don’t wait! The COVID-19 Pandemic impacted all our lives. Understanding and evaluating data related to the pandemic remains critical.

The final award ceremony will be held on the last day of the BHI 2023 conference (15 October) in a hybrid format.

What: We have five challenges to choose from – details for each project are provided below.

When: Preliminary Submissions are due Friday, 29 September 2023 (see timeline below for more details)


  1. Register your team for an account at by Tuesday, 5 September 2023!
  2. Email Bobak Mortazavi ( and the team sponsor of the dataset of your choice (see below) the conference confirmation number.
  3. As a team, you will download the data, code your solutions, and update it to Kaggle for scoring.
  4. Submit your files for evaluation and scoring. It is that simple!

If you have any difficulty feel free to email the organizing team: Bobak Mortazavi (, Ryan King (, Subhamoy Mandal ( and Tayo Obafemi-Ajayi (

Public Health Informatics Challenge

The goal of this data challenge is to predict the 7-day average of new COVID-19 cases and the positivity rate based on historical public health data. Accurate prediction of such epidemiological trends can provide useful insights for the public, helping them make informed decisions regarding protection/mitigation measures, travel planning, and more.

Natural Language Processing Challenge

Practical usage of NLP models with COVID-relevant papers might enable automated information extraction from literature to facilitate drug discovery efforts. One of the crucial elements that can inform these efforts is knowledge about viral proteins. The goal of this data challenge is to build an NLP model to identify answers to protein-related questions from scientific papers.

Sensor Informatics Challenge:

The pathogenesis of COVID-19 is increasingly suggesting impairments in the respiratory system. In this light, it is natural to ask – Can sound samples serve as acoustic biomarkers of COVID-19? If yes, an acoustics-based COVID-19 diagnosis can provide a fast, contactless, and inexpensive testing scheme with the potential to supplement the existing molecular testing methods, such as RT-PCR and RAT. The present Challenge is an exploration of ideas to find answers to this question.

Bioinformatics Drug Target Challenge:

The goal of this challenge is to build effective ML/AI-based surrogate models that can accurately predict the docking scores of candidate drug molecules on SARS-CoV-2 protein targets.

Bioinformatics scRNA-Seq Challenge:

Identification of molecular signatures of severity of COVID-19 infection has become of utmost importance for early treatment of this pandemic disease. For this the use of single-cell RNA sequencing (scRNA-Seq) makes possible to identify and quantify thousands of genes within thousands of cells. In this context, scRNA-Seq technology has called for novel artificial intelligence (AI) solutions for data analysis and medical applications. The present challenge consists of the application of an AI algorithm to predict the severity of COVID-19 infection using a scRNA-Seq dataset. This AI model can be of great significance and of practical value for further study of the signatures of the severity of COVID-19.


  • 21 August: BHI 2023 Data Challenge website launched and data release
  • 20 September: Registration deadline for interested teams to sign up on Kaggle
  • 29 September: Preliminary submission deadline
  • 3 October: Results of initial evaluation sent
  • 8 October: Final submission due
  • 10 October: Decision of top 5 Finalists teams released
  • 11 October: BHI Data challenge team member registration deadline (each team must register at conference day rate $50 to qualify for certificate and award.)
  • 13 October: Final report due
  • 15 October: Finalist presentation & awards ceremony at BHI conference (Hybrid session)