Bioinformatics and Medical Applications. Группа авторов
2012 Nov 27.
14. Macedo, F. et al., Computer-aided detection (CADe) and diagnosis (CADx) system for lung cancer with likelihood of malignancy. Biomed. Eng. Online, 2, 15, 2016.
15. Mike, Hoover, W., Strome, T., Kanwal, S., Transforming healthcare through big data strategies for leveraging big data in the healthcare industry, Health IT Outcomes, USA, 2013, http://ihealthtran.com/iHT2 BigData 2013.pdf.
16. Rastogi, R., Chaturvedi, D.K., Satya, S., Arora, N., Trivedi, P., Gupta, M., Singhal, P., Gulati, M., MM Big Data Applications: Statistical Resultant Analysis of Psychosomatic Survey on Various Human Personality Indicators, ICICI 2018 Paper as Book Chapter, Chapter 25, © Springer Nature Singapore Pte Ltd, Singapore, 2020, Book Subtitle: Proceedings of Second International Conference on Computational Intelligence, 2018, https://doi.org/10.1007/978-981-13-8222-2_25.
17. Singh, Y. and Chauhan, A.S., Neural Networks in Data Mining. J. Theor. Appl. Inf. Technol., 5, 6, 37–42, 14, 2005.
18. Sun, and Reddy, C.K., Big data analytics for healthcare, in: Proc. 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1525–1525, 2013.
19. Wu, M. and Luo, J., Wearable technology applications in healthcare: A literature review. Online. J. Nurs. Inform. (OJNI), 23, 3, 1, Fall. 2019, Available at http://www.himss.org/ojn.
20. Xu, J., Yang, P., Xue, S. et al., Translating cancer genomics into precision medicine with artificial intelligence: applications, challenges and future perspectives. Hum. Genet., 138, 109–124, https://doi.org/10.1007/s00439-019-01970-5, 2019.
21. Zanella, A., Bui, N., Castellani, A., Vangelista, L., Zorzi, M., Internet of things for smart cities. IEEE Internet Things J., 1, 1, 22–32, 2014.
22. Zhao, W. and Wang, H., Strategic decision-making learning from label distributions: an approach for facial age estimation. Sensors, 16, 994–1013, 2016.
*Corresponding author: [email protected]
3
Computational Predictors of the Predominant Protein Function: SARS-CoV-2 Case
Carlos Polanco1,2*, Manlio F. Márquez3 and Gilberto Vargas-Alarcón4
1 Department of Electromechanical Instrumentation, Instituto Nacional de Cardiología “Ignacio Chávez”, México City, México
2 Department of Mathematics, Faculty of Sciences, Universidad Nacional Autónoma de México, México City, México
3 Clinical Research Center, Instituto Nacional de Cardiología “Ignacio Chávez”, México City, México
4 Research Center, Instituto Nacional de Cardiología “Ignacio Chávez”, México City, México
Abstract
In this chapter, we describe the main molecular features of SARS-CoV-2 that cause COVID-19 disease, as well as a high-efficiency computational prediction called Polarity Index Method®. We also introduce a molecular classification of the RNA virus and DNA virus families and two main classifications: supervised and non-supervised algorithms of the predictions of the predominant function of proteins. Finally, some results obtained by the proposed non-supervised method are given, as well as some particularities found about the linear representation of proteins.
Keywords: Adenoviridae, advantages, algorithms, Anelloviridae, Arenaviridae, Caliciviridae, computational predictions, Coronaviridae family, disadvantages, DNA virus, Herpesviridae, Herpesviridae, linear representation, non-supervised algorithms, Papillomaviridae, Parvoviridae, Picornaviridae, PIM® profile, Polarity Index Method®, Poxviridae, putative proteins, Reoviridae, Rhabdoviridae, RNA virus, SARS-CoV-2, supervised algorithms
3.1 Introduction
The discipline called Proteomics focuses, among other things, on the prediction of the predominant function of proteins, since this enables their construction by alteration or removal of their amino acids, either in its linear or in its three-dimensional representation.
This means some benefits for scholars of this discipline; since from these modifications, it is possible to obtain, mostly by accident, proteins with improved toxic action toward some pathogen.
Of course, it does not mean that there are no proteins or peptides in nature that have the toxic action we are looking for, it is just that in most cases, it is less expensive to try to manufacture one than to find it among the many organisms in nature.
The proteins expressing the SARS-CoV-2 that caused the COVID-19 pandemic have the particularity that their linear representation, i.e., their sequence, has a large number of amino acids, approximately 2,400 amino acids (aa). To give the reader an idea of the complexity and dimension of it, let us consider that some short peptides with antibacterial action have an average of 9aa.
Now, the capacity and robustness of the SARS-CoV-2 and all the regulatory processes can be dimensioned. Viruses, in general, have not simple mechanisms; they occupy some microns in length (a micron equals a thousand of a millimeters), but they are specialized organisms with multiple properly regulated functions that are characterized, among other things, for completing themselves when they are in contact with the RNA or DNA of the host.
This particularity suggests that they are not living organisms if we compare them with other living organisms whose species are known. Living organisms are born complete and do not get completed at birth; they have everything to reproduce themselves. This is not so with viruses; in any case, we can say that viruses are another type of life.
The SARS-CoV-2 is a virus of the group of coronaviruses, whose particularity and strength is the possibility of reproducing in any living organism. This feature is not shared by all viruses, e.g., the Ebola virus only passes to certain species, despite its lethality that is greater than 60%, it is less dangerous to our species because, when it kills its host, the virus dies with it and the damage is so fast that the number of deaths is small.
This does not occur with SARS-CoV-2 that can spread rapidly and the infected subject does not show any symptoms (asymptomatic) for 4 or up to 7 days, despite its lethality rate that is no higher than 2%.
From the above, it is clear that the definition of the lethality rate is not just a quotient, but multiple variables have to be considered. Almost as many as asking us about the lethality of a human being.
The review of the basic concepts related to viruses and particularly the SARS-CoV-2, make us think about the complexity of these organisms, not as organisms that by their small size are simple and potentially fragile, but as organisms with an auto-regulatory capacity as complex as the human species but in a few microns.
3.2 Human Coronavirus Types
Coronaviruses feature the crown-like spikes on their surface; they were first identified in the 1960s and there are seven types of coronavirus that infect people [1]:
1 1. 229E alpha coronavirus causes the common cold; it infects the upper respiratory tract, i.e., nose, throat, sinuses, and larynx. The subject presents symptoms within a couple of days of being in contact with the virus; these can be headache, runny nose, coughing, sore throat, and fever. The illness lasts