Inspur teams up with DNA lab to trace the origin of human civilization
“Who are we? Where did we come from? Where are we going?” Generations of humans have asked these questions since the dawn of time, and with today’s technology we are one step closer to the answer. Inspur has partnered with an ancient DNA laboratory to apply intelligent computing to archaeological research in ancient DNA. This fusion of modern technology and ancient history helps to unveil the myths of ancient creatures, uncover the evolution of ancient cultures, and trace the origin of human civilization from a cross-disciplinary, cross-cultural perspective.
Technology empowers archaeology to discover more possibilities of human origin
Since the third technological revolution, the application of technology in archaeological research has dramatically enhanced its precision and broadened its scope. For instance, in the field of biomolecular archaeology, advancements in gene sequencing and AI computing have enabled researchers to extract and analyze the DNA stored in ancient remains, giving them clearer insight into life many millennia ago.
Through interdisciplinary research that combines archaeology, anthropology and linguistics, the DNA lab has made significant contributions to the traceability of ancient human, plant and animal, DNA as well as ancient civilizations.
In 2019, the DNA lab successfully deciphered the whole genome of the world’s first ancient wheat, which provided direct chronological evidence for understanding cultural and agricultural exchanges between the East and West. In 2020, the DNA laboratory research results revealed population interactions in northern China since the Neolithic Agricultural Revolution, providing important evidence for exploring the origin, formation, and development of Chinese civilization.
Advances in science and technology have provided new resources and tools for the study of ancient civilization, helping archaeological research to leap from qualitative to quantitative, making research results more accurate as well as providing unlimited opportunities to approach the truth of history.
In 1984, scientists found traces of DNA from samples of the long-extinct South African zebra and sequenced them. The study of ancient DNA was born; the study of Neanderthals in 1997 confirmed the reliability of the existence of ancient DNA; In 2012, scientists proved the existence of Denisovans, who had lived in the Qinghai-Tibet Plateau for a long time and were named one of the “Top Ten Scientific Breakthroughs in 2012” by Science magazine.
Computing power to unlock the secrets of ancient DNA
Ancient DNA uses scientific methods to simulate the evolution of civilization, allowing us to survey objects from magnificent mountains and rivers down to the miniscule microbe. However, compared to modern DNA, ancient DNA research still faces many difficulties.
Ancient DNA is shorter than modern DNA due to its age, hydrolysis, oxidation, and microbial degradation. On the other hand, ancient DNA sequencing is also more complex because there is a larger amount of data must be read and compared, which first needs to be cleaned and processed before sequencing.
The methods used to increase the reliability of results — making the sequencing mosaic more diverse and repeat more times — lead to extremely large sequencing datasets. Ancient DNA research has achieved the development and application of high-throughput sequencing technology platforms and an enhanced ability to capture small fragments of ancient DNA molecules. This is an important breakthrough that addressed previous research problems due to lack of data but has created new challenges in DNA data processing.
A leap in gene sequencing speed — 39 times more efficient
In collaboration with Inspur Information, the DNA lab developed a gene sequencing acceleration program which sped up the identification and analysis of ancient DNA. The entire genome analysis can now be completed in only 9.64 hours, and the entire exome can be completed in just 48 minutes. Group analysis and genetic data processing speed increased 39 times compared with the CPU-based program. The solution provided by Inspur accelerates the preservation and analysis of ancient DNA data, maximizes the accuracy of sequencing, improves the use efficiency of ancient DNA, and speeds up the results of ancient DNA research.
Inspur’s computing solution leverages field-programmable gate array (FPGA) accelerators to solve the challenges posed by ancient DNA. The solutions with FPGA accelerators are programmable, enable parallel computing, and provide low latency, delivering powerful acceleration for AI and HPC scenarios such as gene sequencing, voice recognition, video processing, and risk management. After rigorous testing and comprehensive assessment, the DNA lab decided to work with Inspur to apply the FPGA technology into the gene sequencing of ancient DNA, one of the first of such applications.
The new gene sequencing solutions use the Inspur F10A FPGA accelerator as the core computing unit and are integrated with the Falcon GATK (Genome Analysis Toolkit) gene processing software. In a half-height, half-length form factor, the F10A is the highest density FPGA accelerator with the most powerful performance that supports OpenCL (Open Computing Language). Its small power consumption of 35W is suitable for various complex computing environments such as data centers, edges, and desktops. With a performance of 42GFlops per watt, Inspur F10A can easily support a professional software algorithm library.
The integrated GATK gene processing software of Falcon is fully consistent with standard bioinformatics analysis tools, including all workflow of GATK Best Practices, such as BWA, Picard, and GATK. The integrated software is identical to the original software with the same command. Users can easily use and adjust the data because of the intermediate data which is consistent with original samples.
Presently, based on the new accelerated computing system for gene sequencing, researchers in the DNA lab are analyzing the bones of the nomads in the northern Chinese grasslands, such as the Xiongnu, Donghu, Xianbei, Wuhuan, Khitan, and Mongolia as well as unearthed animals and plants for DNA extraction and research. They are also leveraging the research results to learn about the migratory patterns and biological, sociocultural and economic relationships between the ethnic groups along the ancient “Silk Road” in Xinjiang. Such research continuously provides new material for ancient history, culture, and ethnic research, and opens up new horizons.
With the power of computing, Inspur is helping archaeology achieve spectacular breakthroughs in speed, breadth, depth, and precision. Empowered by Inspur’s initiative of computing for good, the ancient DNA laboratory is one step closer to uncovering the origins of human civilization.
AIStation is an Inspur-developed AI development platform specifically designed to deal with these issues by offering an easy to set up and refined GPU resource scheduling system.
Inspur joined with Xishuangbanna National Nature Reserve to develop an extensive technology system for the conservation of some 300 Asian elephants in Yunnan, China.
Inspur teams up with DNA lab to trace the origin of human civilization
With the help of today’s intelligent computing, researchers are now more easily able to find out more about our world from critically examining the artifacts of the past.
By Arthur Kang, SPEC OSSC Member / OSG ML Chair, Performance Architect, Inspur Information
The advanced language capabilities of Yuan 1.0 necessitates a high number of parameters, which brings many challenges in model training and deployment. This article focuses on the computing challenges of Yuan 1.0 and the training methods used.
The amazing enhancements of Intel's new third-generation Xeon Scalable processors (Ice Lake),
lots of factors can influence the training performance. In this article, we use the ResNet50 model from MLPerf Training v1.0 benchmarks as an example to describe how to improve training speed with hardware and software optimization
Integrates hardware and software that helps data centers improve construction efficiency, simplifies operations management, and enhances operational efficiency.
New specification provides current system compatibility and a framework for mixed accelerator hardware applications
As basic IT infrastructure shifts from the on-prem private IT datacenter to the public/hybrid cloud with the growing demand for more computing performance, the world is facing a new challenge
As the world enters the 5G era, the common impression is that 5G will bring down consumer prices of information services.