Hero image

About project

Central Forensic Laboratory of the Police (CFLP) is a leader of the consortium which consists of the top Polish universities (Medical University of Warsaw, Jagiellonian University in Kraków, Pomeranian Medical University in Szczecin) and Polish bioinformatical company (ARDIGEN). The Consortium received funding from the National Center for Research and Development for the implementation of the project with the acronym SMAFT (Soil Microbiome Analysis Forensic Tool) which aims to investigate the potential of using the soil microbiome in forensics (DOB-BIO10 / 03/01/2019).

Due to the ease of transferring on shoes, clothes, tools or vehicles, soil is a particularly valuable evidence material that allows to connect a suspect or object with a specific geographical location. Unfortunately, often the information obtained through routine soil analyzes, conducted mainly on the basis of its physical and chemical characteristics, does not allow for the verification of investigative hypotheses. For this reason, there is a need to develop soil identification methods based on DNA analysis of the microorganisms which inhabit it. It is estimated, that 1 gram of dry soil contains on average: 1010 viruses, 1010 bacteria and archaea (including 108 actinomycetes), 106 each of fungi and algae, 105 protozoa and 102 nematodes. Moreover, the soil may also contain plant fragments as well as invertebrates other than nematodes and extracellular nucleic acids. It seems, that such a huge 'library' of DNA, combined with modern sequencing technologies, may, in the future, prove to be just as useful for comparing soil traces or determining its place of origin as human DNA profiles for establishing the relationship between biological traces and a suspect. At present, soil microbiome DNA evidence is not used routinely in investigations or in courts. To change this, it is necessary to show that the statistical inference based on DNA analysis of the soil microbiome is sufficiently rigorous to be considered by the courts as evidence of adequate scientific basis. According to key opinion leaders in the field of forensic science, confirmation of the credibility and effectiveness of microbiome analyzes can be achieved by conducting research on sufficient sample size, creating databases of microbiome from various environments based on clearly defined and well-documented procedures, improving bioinformatics tools and learning about the dynamics of microbiome change in time and location.

The assumptions of the SMAFT project, which aims to develop an innovative tool for forensic analysis of the soil microbiome and take into account the guidelines presented above. The data obtained from sequencing nearly 1000 DNA samples of soil microbiomes will be the basis for the design of the SMAFT system, enabling the identification and determination of the place of origin of a soil sample. Information obtained with the use of the SMAFT system will be able to direct, and thus accelerate, investigations in criminal cases and those related to terrorist attacks. The system can also be used in biodiversity research.

Research plan of the SMAFT Project is organized in 8 work packages:

  1. Collection of 960 soil samples, 240 in each season: autumn, winter, spring and summer from 80 different locations in Poland (3 samples in each location).

    The selection of soil sampling sites was made on the basis of analysis of data from hydrological and meteorological measurements performed in all weather stations located in Poland, collected over last 20 years. This stage also includes the development of a detailed methodology for soil sampling (regarding securing, describing and transporting samples to the laboratory under appropriate conditions).

  2. Isolation of microbiome DNA from soil.

    Before starting the task, several tests were carried out to select the most effective method of DNA isolation from soil in terms of the quantity, purity and quality of the obtained isolate.

  3. Preparation of NGS libraries from the isolates.

    In order to obtain the best quality libraries with the desired fragment lengths, the library preparation procedure was optimized, both at the stage of DNA fragmentation and amplification. The library fragment lengths are verified by capillary electrophoresis, and the DNA concentration in the libraries is determined using the fluorometric method.

  4. WGS (whole genome sequencing) sequencing using Illumina® SBS technology with the latest generation NovaSeq 6000 Illumina® sequencer.

    The result of this task will be high-quality raw data from deep sequencing. To achieve this goal, we plan to obtain 80 to 100 million pairs of 150 bp reads per single soil sample.

  5. Data analysis.

    The data obtained in step four will be analyzed using various bioinformatics tools. The aim of the analysis is to select the optimal set of markers (identification panel), that will enable the assessment of the microbiome composition in a sample of unknown origin and then link the sample to a specific location. The selected genetic markers will be composed of highly informative genomic sequences which will allow qualitative and quantitative identification of the microbiome present in the analyzed sample.

  6. Development, optimization and validation of the targeted NGS method for soil microbiome analysis.

    Testing of samples will be based on selected genomic sequences defined in step five. The compatibility of the sequencing results for soil samples obtained in the fourth step (deep sequencing) with the sequencing results obtained for the test developed at this stage (targeted sequencing) performed with several sequencing methods of medium throughput will be verified. The optimal sequencing technology will be selected and validated, with particular emphasis on the specificity and limitations of the forensic analyzes.

  7. Creation of an IT system for the analysis and interpretation of the results obtained by the genetic test and selected NGS technology.

    Data obtained from DNA sequencing of soil microbiomes will be placed in the project database, which is an element of the system being created, constituting a 'map' of soil microbiome in Poland. Additionally, an efficient search engine within the database will be developed to allow effective database resources search and also interpretation of analysis results. The results of the DNA sequencing from a soil sample of unknown origin will be compared with the database using the developed software and then linked to the most probable location on the map of Poland. Ultimately, a complete predictive system will be created, which will allow identification of the bacterial communities/groups and interpretation of the data.

  8. Testing the effectiveness of the created system in the conditions similar to real ones and preparation of the standard operation procedures (SOPs) describing the “ways of working” for the developed system.

    The correctness of the parameters of the developed program/system will be assessed. Additionally, guidelines, procedures and instructions required to conduct predictive tests and then comparative analyzes of DNA samples isolated from soil will be developed.

The end product of the SMAFT project will be a complete predictive system, based on microbiome composition of soil samples, which will allow identification and tracking of the tested sample origin.

Technology

SMAFT Project uses the most advanced technologies and the most modern devices in the field of genetic diagnostics will be used for DNA sequencing of microbiome isolated from soil. It is planned to obtain 80 to 100 million pairs of 150 bp fragments from each of the nearly 1,000 soil samples.

Sequencing is carried out in SBS (sequencing by synthesis) technology on Illumina sequencers: MiSeqFGx (to check quality of the library) and on the NovaSeq 600 high-throughput sequencer, which allows obtaining up to 6 Tb of output data from one sequencing run.

Collection and documentation of soil samples is carried out using KoBoToolbox - a tool for collecting and analyzing data based on the GPS system.

Consortium

Central Forensic Laboratory of the Police

Central Forensic Laboratory of the Police (CFLP), as a research institute, conducts development work and scientific research in the field of forensics and forensic techniques. CFLP carries out Polish and international research projects, which result in the development of forensic tools supporting law enforcement agencies in the fight against crime (e.g. projects: AEVITAS, NEXT or VISAGE). CFLP is an active member of ENFSI and cooperates with other forensic institutes and universities. The SMAFT project is another genetic project that aims to develop a new tool to analyze the traces in the form of soil samples. The manager of the SMAFT project is the Head of the Scientific Development Department of CFLP Renata Zbieć-Piekarska, MD, PhD, author of many scientific publications in the field of forensics.

As a project leader, CFLP supports consortium members in individual tasks. Our goal is to ensure the implementation of the work packages of the project, in line with the project progress schedule and to organize good cooperation within the Consortium.

In addition to the supervisory functions over the project, the CFLP team is responsible for the preparation and verification of libraries for WGS sequencing and the implementation of the last stage of the project related to the validation of the final version of the SMAFT system.

Pomeranian Medical University

The Department of Forensic Genetics of the Pomeranian Medical University in Szczecin has many years of experience in the identification of human remains from various historical periods. Research conducted in the field of forensic anthropology, analysis of physical evidence and DNA identification (both nuclear and mitochondrial) are conducted by a team of specialists from the field of forensic genetics and forensic anthropology. Head of the Department: Professor Andrzej Ossowski is the originator and co-creator of the Polish Genetic Database for Victims of Totalitarianism. Employees are appointed by law enforcement and justice authorities to carry out domestic and international exhumation works and disaster victim identification (DVI). Due to extensive experience in the field work, the PUM Team will conduct the seasonal collection of soil samples from 80 sites in Poland. Coordinates from devices with the GPS function and the recording of the download locations will allow to link the samples to the specific location.

Warsaw Medical University

The Department of Medical Genetics of the Warsaw Medical University conducts scientific research, teaching and diagnostics of genetic diseases. The research concerns, among others analysis of genetic factors in diseases (allergic, autism, Graves' disease); factors responsible for the phenotype of identical twins and exome studies. It is possible thanks to a complete infrastructure enabling work based on platforms for advanced analysis of nucleic acid sequences and a qualified research. Head of the Department Professor Rafał Płoski is the author of over 100 works published in international literature in the field of genetic diagnostics of cancer, anthropology and forensic medicine. The Warsaw Medical University is responsible for the project stage related to the high-throughput WGS sequencing of the environmental microbiome's DNA derived from soil samples.

Jagiellonian University

Małopolska Centre of Biotechnology (MCB) is an independent research unit established by Jagiellonian University in Kraków. MCB was created as a project co-financed by Operational Innovative Economy 2007 – 2013 Program aimed at creating an interdisciplinary research center enabling the performance of comprehensive research at various functioning levels of the organisms. Two research groups of the JU – MCB are involved in the implementation of the SMAFT project: Human Genome Variation Research Group and Bioinformatics Research Group.

Human Genome Variation Research Group (MCB JU) specializes in the study of human genetic variability including the epigenome as well as human-related microorganisms. The Laboratory is created by people for whom work is a passion, with impressive scientific achievements - for example prediction of appearance or age for the forensics purposes. The Group Leader is professor Wojciech Branicki, an expert in forensic genetics, a member of Polish and international scientific societies. He is the author of many scientific publications and textbooks related to predictive DNA analysis and the development of investigative tools used in the world in forensic genetics. Professor Branicki’s team is responsible for the stage of isolating the microbiome from the soil as well as testing and selecting technologies for the targeted analysis of the composition of the soil microbiome.

Bioinformatics Research Group (MCB JU), leaded by Professor. Paweł Łabaj focuses on development of methods for biomedical sciences with special interest in transcriptomics and metagenomics for precision medicine. Paweł Łabaj is also leading in the same field of interest in SEQC consortium, which is coordinated by Food and Drug Administration of the United States of America. The consortium handles benchmarking of high throughput gene expression profiling technologies (including RNA-Seq). Apart from above, Professor Łabaj is also an active member of an international MetaSUB consortium, which deals with analysis of metagenomic data from environmental samples from all around the world. The scope of MetaSUB actions revolves around public healthcare & well- being and exploration of the forensic potential of metagenomic data. He is also co-founder of the MetaSUB Europe Society. The team under supervision of Professor Łabaj is responsible for analysis of data derived from sequencing of metagenomic soil samples and for mark out of metagenomic markers which will be included in dedicated genetic panel for analysis of soil microbiome, which is a part of the SMAFT system.

ARDIGEN

Bioinformatics Research Group (MCB JU), leaded by Professor. Paweł Łabaj focuses on development of methods for biomedical sciences with special interest in transcriptomics and metagenomics for precision medicine. Paweł Łabaj is also leading in the same field of interest in SEQC consortium, which is coordinated by Food and Drug Administration of the United States of America. The consortium handles benchmarking of high throughput gene expression profiling technologies (including RNA-Seq). Apart from above, Professor Łabaj is also an active member of an international MetaSUB consortium, which deals with analysis of metagenomic data from environmental samples from all around the world. The scope of MetaSUB actions revolves around public healthcare & well- being and exploration of the forensic potential of metagenomic data. He is also co-founder of the MetaSUB Europe Society. The team under supervision of Professor Łabaj is responsible for analysis of data derived from sequencing of metagenomic soil samples and for mark out of metagenomic markers which will be included in dedicated genetic panel for analysis of soil microbiome, which is a part of the SMAFT system.

Consortium icon

Research Group

Research icon

CLKP

Aleje Ujazdowskie 7
00-583 Warszawa
email: clkp@policja.gov.pl
https://clkp.policja.pl/
Scientific Development Department
Renata Zbieć-Piekarska- Head of Scientific Development Department, Project Manager
Biology Department
Agata Jagiełło
Anna Woźniak

PUM

Department of Forensic Medicine
Department of Forensic Genetics
70-111 Szczecin
al. Powstańców Wlkp. 72
email: zmedsad@pum.edu.pl
http://zms.pum.edu.pl/
Andrzej Ossowski - Head of Department of Forensic Genetics
Maria Szargut

WUM

Department of Medical Genetics
Center for Biostructure Research
Research First Faculty of Medicine - Medical University of Warsaw
Kampus Banacha
ul. A. Pawińskiego 3c
02-106 Warszawa
e-mail: rploski@wp.pl
https://genetyka.wum.edu.pl/
Rafał Płoski - Head of Department of Medical Genetics
Małgorzata Rydzanicz
Agnieszka Pollak
Piotr Stawiński

UJ- MCB

Human Genome Variation Research Group
ul. Gronostajowa 7A
30-387 Kraków
https://mcb.uj.edu.pl/
e-mail: mcb@uj.edu.pl

  • Human Genome Variation Research Groupo

    Wojciech Branicki – Group Leader
    Kamila Marszałek
    Kinga Herda

  • Bioinformatics Research Group

    Paweł Łabaj – Group Leader
    Michał Kowalski
    Alina Frolova

ARDIGEN

ul. Podole 76
30-394 Kraków
https://ardigen.com/
hello@ardigen.com
Team led by
Kai Milanowskiej-Zabel

Contact

Name

Surname

E-mail

Message