REVIEW

Current trends in anticancer drug prototype in vitro pharmacology: bibliometric analysis 2019–2021

About authors

Centre for Strategic Planning and Management of Biomedical Health Risks of the Federal Medical Biological Agency, Moscow, Russia

Correspondence should be addressed: Pavel V. Ershov
Pogodinskaya, 10, korp 1, Moscow, 119121; ur.xobni@97levap

About paper

Author contribution: Ershov PV — literature search, manuscript writing and formatting, conceptualization of the paper; Makarova AS — literature search, manuscript editing.

Received: 2021-07-21 Accepted: 2021-08-18 Published online: 2021-09-27
|

Progress in the therapy of socially significant diseases, including malignant neoplasms (MN), is organically linked to success in research and development (R&D) of the new drug prototypes, the pharmacologically active molecules, having the greatest antitumor effect and maximum conformity of physical and chemical parameters to canonical drug-like structures [1]. One of the evidence-based approaches to the drug discovery consists in design of molecules, which possess the desired properties and affect the cancer-related proteins [24]. Defining the mechanism of action of anticancer drug prototypes and the whole range of on-target and off-target proteins is essential for design of the highly selective drugs with minimal side effects. The development of the improved anticancer drugs follows from the properties of the tumor itself: intertumor and intratumor heterogeneity, and the diversity of mechanisms for resistance [5]. On the other hand, many pharmacotherapeutic approaches require improving the benefit–harm balance, since many existing anticancer drugs can cause severe side effects [6].

The number of annual publications dedicated to identification of the novel peptide and non-peptide molecules with anticancer activity, repositioning effects and prescriptions, is about 2700–3400 (over the five-year period, according to PubMed (accession on May 20, 2021). It should be noted that the review papers are focused mostly on systematizing the data on the anticancer drug prototypes, either belonging to a certain chemical taxon, or a spectrum of synthesized derivatives with the same chemical scaffold. Papers are also focused on discussing the range of biologically active substances with respect to the only molecular target. Nevertheless, reviews are a major source of scientific analysis in describing the current trends in drug discovery. Thus, the works [7, 8] cover the actual cancer-associated molecular targets, assessment of biological effects and concepts of the drug design, which in recent years has been intrinsically linked to computer-aided modeling (QSAR and others) [9]. At the same time, the search for current trends in drug prototypes, together with the interpretation of medical and biological effects of various anticancer compounds, results in the need for prompt data analysis from the multiple literature and biomedical sources. When performing without the use of customized algorithms and automated methods of data extraction (known as text mining), such an analysis can be very time-consuming. However, the application of the bibliometric analysis with adapted algorithms for creating an up-to-date "analytical portrait", which includes a more diversified repertoire of research data on drug prototypes and their molecular targets, has proved to be effective in defining the current trends in molecular oncology and pharmacology [1013]. The aim of the study was to assess the key trends and rank the scientific priorities in anticancer drug design using bibliometric analysis.

METHODS

PubMed (https://pubmed.ncbi.nlm.nih.gov/) was used to search for journal papers (issued from  May 20, 2020 to May 19, 2021) by keywords “antitumor and anticancer activity”. Inclusion criteria was “original research” reporting the new data on identification of anticancer drug prototypes and the best relevancy of search. Exclusion criteria was paper types “Books and Documents”, “Clinical Trial”, “Meta-Analysis”, “Randomized Controlled Trial”, “Review” and “Systematic Review”. A total of 1902 papers were found, from which 240 reviews and five clinical trials were subsequently excluded. 402 papers were selected from the remaining 1657 experimental papers, based on the abstracts analysis, and included in the target sample.

VOSviewer software [14] was used for keyword cooccurrence analysis in the papers from the target sample, as well as for keyword frequency analysis of MeSH terms and authors’ terms in the papers' abstracts.

Names of low molecular weight non-peptide compounds were extracted from papers' abstracts and keyword listings. Their MeSH classification groups (Classification/Ontologie/ MeSH-Tree) were defined using PubChem database (https:// pubchem.ncbi.nlm.nih.gov/).

Analysis of research trends and publication activity over the five-year period (2016–2020) was performed using the Scopus database (https://www.scopus.com/) (Elsevier; Netherlands). The key “compound name and tumor” (for example, “magnolol and tumor”)was used as a search query. The scientific interest to papers, issued in 2019, was assessed by the total number of citations for papers issued before May 19, 2021, being corrected for self-citation of all co-authors of the paper. Another parameter was the average number of citations per paper, issued in 2019.

Associations between protein targets, their cancer significance and drug prototypes were analyzed using the Pathway Studio and Reaxys (Elsevier; Netherlands). These resources enable extracting the links between different molecular entities from biomedical data containing in twenty million abstracts and several million full-text papers with the help of Medscan software. They adapted  for comprehensive search and prediction of drug targets as well as constructing of signaling pathways from the internal curated database. Associations found between molecular targets and key biological processes as “malignant transformation” and “cancer progression” was used to create the interaction map.

Web-based tool FACTA+ v.0.9 (accession on June 29, 2019) (http://www.nactem.ac.uk/facta/) was used to find biomedical associations in the full-text papers of the MEDLINE database.

Cortellis database (https://www.cortellis.com/intelligence) (Clarivate Analytics; UK) was used for patent search.

Principal component analysis (PCA) using k-means clustering and creating a heat map were performed using the ClustVis web-base tool [15]. The singular values decomposition method with imputation was used for PCA. It allows to generate and visualize the numeric data set geometric structure while imputing the missing values (imputation). Data preprocessing options included the following options: data transformation – “no transformation”; row scaling — “no scaling”. The heat map options were as follows: cluster distance for rows and columns — Pearson correlation or “correlation”, clustering method for rows and columns — “average”.

RESULTS

Analysis of abstracts from the target sample (402 papers) made it possible to define distribution of the occurrence frequency for keywords, which were related to identification of anticancer drug prototype and were mentioned in the text massive more than 10 times (fig. 1). Fig. 1 allows  one to see the focus in studying the pharmacological targeting of signaling pathways involving RAC-α serine/threonine-protein kinase (AKT1), signal transducer and activator of transcription (STAT)-3, hypoxia-inducible factor 1-α (HIF-1α), and p53dependent pathways. Cancer-associated protein targets can be also distinguished: protein kinases (PKC and AMPK), tubulin, matrix metalloproteinases (MMP), poly (ADP-ribose) polymerase 1 (PARP1), epidermal growth factor receptor (EGFR), vascular endothelial growth factor (VEGFR). Furthermore, targeting of caspases, aimed at activation of apoptotic programs in tumor cells, provides another important field of research.

Visual representation of keyword co-occurrence, mentioned in more than three papers, is provided in fig. 2. The figure presents five clusters with semantic links between the keywords. Thus, the pronounced co-occurrence was observed in the following keywords pairs: histone deacetylase–quinolines, topoisomerase inhibitors–indoles (or pyrazoles, or pyrimidines), tubulin modulators–chalcones (or naphthalenes). Thus, the analysis of papers abstracts and keyword co-occurrence (fig. 1 and fig. 2) made it possible to identify the studies focused on the design of chalcone and licochalcone (flavonoids with opened pyran ring), naphthalene, saponin, indole, quinolone, sesquiterpene, pyrazole, pyridine (pyrimidine), steroid, curcumin and britanin derivatives exhibiting anticancer activity. It is interesting to note that a sufficient number of papers were focused in the design optimization of tubulin polymerization inhibitors [16]. According to 10-year PubMed statistics, the number of new studies in this field remained stable, with a slight decline, observed in 2019–2020, which could demonstrate the decrease in scientific interest. Fig. 1 and 2 (fig. 1, fig. 2) also show that immortalized human embryonic kidney cell line HEK293, human colorectal carcinoma cell line HCT116, human lung carcinoma cell line A549, and mammary duct adenocarcinoma cell line (MCF-7) have become prevalent in vitro model systems. These were used for phenotype screening aimed at investigation of biological activity  of novel drug prototypes, obtained by organic synthesis or extracted from plants, marine algae and fungi. Many medicinal plants are used in ethnopharmacology, for example, in China, which has provided the basis for the contribution of this country researchers from to the publication activities.

Fig. 3 (fig. 3) demonstrates that compounds, representing the classes of heterocycles, glycosides, quinones and terpenes, appear to be the most abundant (71%) in the pool of all chemicals. Attention should be drawn to the terpenes and terpenoids, which is of high scientific priority and is considered an active “donor” of basic structures for synthesis of anticancer drug prototypes [17]. Furthermore, assessing the occurrence frequency of compound names in the papers' abstracts showed that it was possible to distinguish at least 29 most popular chemical structures (acridine, anthraquinone, dibenzyl, stilbene, phenanthridine, pyranocarbazole, benzopyran, imidazolo-pyridine, indole, indolizine, naphthyridine, phthalazine, chinazoline) used for organic synthesis of new derivatives with anticancer activity and other spectrum [18, 19]. Thus, indole, quinazoline, acridine and pyridine derivatives in total account for 35% of all ones.

The trends of publication activity over the 5-year period (2016–2020) performed for 150 compounds, including the names of synthetic (46%) and natural (54%) chemical scaffolds, is presented in fig. 4. It is reasonable to distinguish a discrete group of compounds (for example, derivatives of 1,6-naphthypyridone, sulfonylazaspirodienone, N-acyl-ophenylenediamine and scabioside C), in which the anticancer activity was found for the first time. Fig. 4 (fig. 4) also shows the reemerging scientific interest in dioscin, cinnoline, indolizine, nargenicin A1 derivatives and quercetin. It follows from the growing number of publications, issued in 2020, describing the new anticancer effects for these compounds. On the other hand, publication activity with a reference to derivatives of thiazolidinedione (troglitazone), nutlin, silybin, squamocin, and derivatives of benzodioxole, oleanolic acid, and hydroxamic acid declined gradually in 2018–2020 compared to 2016–2017 period.

In addition to assessment of dynamic changes in the papers number, citing the papers,  issued in 2019, was also analyzed. Fig. 5 (fig. 5) shows that, citation rate in 2019–2021 (minimum threshold: 1000 citations) was observed for studies reffering quinoxaline, naphthoquinone, berberine, valproic acid and docosahexaenoic acid derivatives, as well as on lapachone, luteolin and quercetin. The latter is the “cite leader”, which is mentioned 4200 times. However, based on the average number of citations per paper, issued in 2019, (minimum threshold: 15 citations), publications referring to crisosplenol D, 2-benzoyl-oxy-cinnamaldehyde, monastrol and lapachone ranked first in the context of new anticancer drug prototypes development.

The effective anticancer drug design is intrinsically linked with the related biomedical area of identifying the molecular targets. Efforts in this complementary area involve identification of mechanisms of action of drugs, which is critically important for differentiation of their on-target and off-target effects, as well as for design of highly selective compounds affecting the desired range of molecular targets. The vast majority of the clinically significant targets are the monomeric proteins or, more rarely, the oligomeric protein complexes [20]. Thus, the search for protein targets for a number of the priority anticancer drug prototypes (fig. 5), contained in the Reaxys database, has helped to find out molecular targets has been mapped for the other three compounds, except crisosplenol D. As it turns out, the 2-benzoyloxycinnamaldehyde structure similar, ((E)2-(3-oxoprop-1-en-1-yl) phenyl benzoate), inhibits cyclindependent kinase 4, CDK4 (Target ID: 820056979) enzymatic activity, glutathione reductase (Target ID: 820106479) and farnesyltransferase (Target ID: 470858527). However, the monastrol and lapachone derivatives exhibit more optimal druglike properties (compliance with the “rule of five” parameters [21]) compared to 2-benzoyl-oxy-cinnamaldehyde, and have a greater potential for identification of their molecular target. Reaxys database contains 31 and 117 records (accession on May 29, 2021) about the monastrol and lapachone targets, respectively. Among them, it can be found Aurora A protein kinase (Reaxys Target ID:818366617), M-phase inducer phosphatase 2 (Target ID: 820046846), DNA topoisomerase I (Target ID: 818289566), DNA topoisomerase II (Target ID: 824645880), glutathione S-transferase P (Target ID: 820104166), K-Ras GTPase variants (Target ID: 819104337) with G12C and Q61R substitutions, NAD(P)H quinone dehydrogenase 1 (Target ID: 820009294).

In table, the new experimental synthetic and natural compounds, exhibiting anticancer activity [2292] were shown with respect to molecular target mapping and the mechanisms of action studies. Scientific interest in such compounds follows from increased citation numbers and existence of patent potential. Furthermore, the newly identified chemical scaffolds, exhibiting a broad spectrum of anticancer activity, may later be used for design and synthesis of their derivatives. It is worth noting that papers on drug target mapping currently represent the great scintific demand and have a high citation rate in experimental pharmacology area.

DISCUSSION

Analysis of data presented in table revealed the following quantitative distribution of the new anticancer drug protein targets: protein kinases (38%), DNA topoisomerases I and II (10%), transcription factors (9%), clinically significant proteinprotein complexes (7%), cytoskeleton proteins (tubulin) (5%), and, finally, the rest of the diverse targets' group accounted for 40%. It is noteworthy that the latter group included phosphodiesterase 5A (PDE5А), Na+-glucose cotransporter 2 (SGLT2), and glutathione S-transferase (GSTP1) and dihydrofolate reductase (DHFR) enzymes, associated with chemotherapy drug metabolism and involved in the drugresistant tumor phenotype formation [93]. Therefore, blockade of such enzymes may be considered a complementary pharmacological strategy, enabling to decrease chemoresistance. Analysis of papers' abstracts made it possible to determine the mechanism of action of anticancer drug prototypes (table) which represent, mainly, competitive or noncompetitive reversible inhibition of the enzymes' catalytic activity.

Table (table) demonstrates scientific interest in low molecular weight compounds that capable of modulating the function of transcription factors  (for example, HIF-1a, FOXM1, ATF4). This is a group of intracellular nuclear proteins, which have been considered the extremely complicated molecular objects for drug design for a long time [94]. The search for new compounds disturbing signal transduction in the classic cancer-associated signaling pathways remains the traditional approach for carcinogenesis inhibition. Based on bibliometric analysis over 2020–2021, it should be pointed out that references to the PI3K, mTOR, AKT1, STAT3, HIF-1a and p53 signaling cascades were most often found among the pharmacologically targeted signaling pathways.

One more group can be seen in table, including the compounds (LQFM126, E7386, britanin) and miconazole, which have the potential to selectively modulate the protein complex affinity. This approach to drug prototype design is innovative, since it is based on the pharmacological targeting of conservative interfaces of protein-protein interactions (PPI) with low mutation rate [95, 96]. However, there is a number of polemical papers [97, 98], reporting the opposite, namely, predominant accumulation of mutations, associated with carcinogenesis, in the PPI interfaces of clinically significant protein complexes. This point is needed for in-depth study of the mutation factors' significance for drug targeted protein interfaces.

Then, we searched for information of involvement in carcinohenesis of the protein targets (listed in table) using Pathway Studio. The assessment was performed based on the presence of links between proteins, and terms “malignant transformation” and “cancer progression”. The results were selected on the availability of at least five papers, reporting the association of each protein target with each of these parameters. Thus, the visualization (fig. 6) has revealed many  evidence on the cancer involvement of majority of protein targets (table).

The table also demonstrates that, on the one hand, STAT3 has been identified as a new protein target for the wellknown anticancer drug napabucasin. On the other hand, studying the repositioning effects of miconazole, troglitazone, dexamethasone, vinpocetine and empagliflozin has made it possible to determine one more type of their biological activity, that is the anticancer activity. It can be assumed that such activity results from the modulating of signal transduction in the pro-oncogene pathways.

Pathway Studio can help to found out that the main type of interaction between the protein targets, was a posttranslational modification by means of phosphorylation. The identified binary protein-protein interactions, where the first protein of a pair phosphorylates the second protein, are as follows: EGFR/STAT3, MET/STAT3, GSK3B/HIF1A, ABL1/STAT3, EGFR/GSK3B, ABL1/TERT, EGFR/MET, PRKCA/

GSK3B, ABL1/STAT3, AURKA/GSK3B, PRKD1/EGFR, GSK3B/HSPA1A, GSK3B/NFKB1. It should be emphasized that the above mentioned interactions include proteins, which are involved in carcinogenesis regulation and are the targets for the approved anticancer drugs [99, 100].

Thus, bibliomertic analysis of papers, issued in 2020–2021, focused on the identification and development of the novel non-peptide anticancer drug prototypes revealed the following findings: i) a significant proportion (71%) of compounds belong to the chemical class of heterocycles, glycosides, quinones and terpenes; ii) indole, quinazoline, acridine and pyridine scaffolds were up to 35% from all used in organic synthesis of derivatives. Trend analysis of publication activity made it possible to define the scientific interest in the distinct groups of anticancer compounds and the progress in mapping their molecular targets. Among them, were proteins, associated with carcinogenesis, such as serine/threonine protein kinases, receptor tyrosine kinases, DNA topoisomerases and tubulins, which still remain the most studied anticancer drug targets. However, there are the increasing growth of the scientific interest in the design of inhibitors targeted to cancer-associated protein-protein complexes, transcription factors and metabolic enzymes, involved in the drug-resistant tumor phenotype.

CONCLUSIONS

Bibliometric approach with the using of Scopus, Reaxys, Pathway Studio and Cortellis databases together with VOS Viewer software enables prompt monitoring of the trends in the anticancer drug research and development (R&D) as well as defining the priorities in current scientific interest. At the same time, it can be said that there emerges a growing number a growing number of novel molecular modalities associated with carcinogenesis processes, which encorages the studies discovery of more efficient pharmacotherapeutic agents.

КОММЕНТАРИИ (0)