Personalized Medicine: Redefining Cancer Treatment

This is the exploratory data analysis for Personalised Medicine for redefining Cancer Treatment.

Personalized Medicine refers to the idea of analyzing an individual patient's tumor to determine what combination of drugs will work best for that paricular individual. For doing this, the thousands of genetic mutation in cancer tumor needs to be distinguished into a. the mutations that contribute to tumor growth (drivers) b.the mutation which are neutral and do not contribute to tumor growth (passengers). The interpretation of genetic mutations is being done is currently done manually.

In this project we are going to automate the classification of these drivers from the passengers.

2.1 Data Set Information

DATA SET CHARACTERISTICS: Multivariate

ATTRIBUTE CHARACTERISTICS: Integer and String

NUMBER OF INSTANCES: 3321

NUMBER OF ATTRIBUTES: 1.training_variants : 4 namely ID,Gene,Variation,Class

                   2.training_text : 2 namely ID,TEXT


LABEL: Class (1-9) denotes types of mutation

MISSING VALUES: Yes

About the given data: The data comes in 4 different files. Two csv files and two text files:

training/test variants: These are csv catalogues of the gene mutations together with the target value Class, which is the (manually) classified assessment of the mutation. The feature variables are Gene, the specific gene where the mutation took place, and Variation, the nature of the mutation. The test data of course doesn’t have the Class values. This is what we have to predict. These two files each are linked through an ID variable to another file each, namely:

training/test text: Those contain an extensive description of the evidence that was used (by experts) to manually label the mutation classes.

The text information holds the key to the classification problem and will have to be understood/modelled well to achieve a useful accuracy.

In [8]:
import pandas as pd
import numpy as np
train_varients=pd.read_csv("training_variants")
test_varients=pd.read_csv("test_variants")
train_text=pd.read_csv("training_text",sep="\|\|",engine="python",names=["ID","TEXT"],skiprows=1)
test_text=pd.read_csv("test_text",sep="\|\|",engine="python",names=["ID","TEXT"],skiprows=1)
In [9]:
print train_varients.head()
   ID    Gene             Variation  Class
0   0  FAM58A  Truncating Mutations      1
1   1     CBL                 W802*      2
2   2     CBL                 Q249E      2
3   3     CBL                 N454D      3
4   4     CBL                 L399V      4
In [10]:
print train_text.head()
   ID                                               TEXT
0   0  Cyclin-dependent kinases (CDKs) regulate a var...
1   1   Abstract Background  Non-small cell lung canc...
2   2   Abstract Background  Non-small cell lung canc...
3   3  Recent evidence has demonstrated that acquired...
4   4  Oncogenic mutations in the monomeric Casitas B...
In [11]:
# converting the train text to a list
test_text_list=test_text.TEXT.tolist()
pat1_text=test_text_list[1]
print type(pat1_text)
<type 'str'>
In [12]:
len(str(test_text_list[:1]).split(" "))
Out[12]:
7495

Unique Variations

In [13]:
unique_variation= set(train_varients.Variation)
print unique_variation
#print pd.unique(train_varients.Variation)
set(['Y62C', 'D399N', 'L858M', 'Q531*', 'E1586G', 'SRGAP3-RAF1 Fusion', 'H93Q', 'H93R', 'V564I', 'H93D', 'L1195V', 'K4E', 'T1623I', 'E203K', 'E2856A', 'P380R', 'R2505*', 'S786F', 'Y1463S', 'N581Y', 'G464E', 'S32I', 'N581S', 'MAGI3-AKT3 Fusion', 'A502_Y503dup', 'H697Y', 'Y53H', 'L2396F', 'R2450*', 'Y24C', 'N581D', 'S270L', 'D289del', 'V1188L', 'W557_K558del', 'L191H', 'H118P', 'EP300-MOZ Fusion', 'D603N', 'G480W', 'D289_D292del', 'L1122V', 'D351H', 'D603G', 'F460L', 'G17V', 'S860L', 'P86H', 'G1738R', 'C450_K451insMIEWMI', 'V411L', 'T1324N', 'G17A', 'V555_V559del', 'G776S', 'GPIAP1-PDGFRB Fusion', 'E598_Y599insGLVQVTGSSDNEYFYVDFREYE', 'D171G', 'P124Q', 'G165V', 'P124S', 'G829R', 'G165R', 'R496C', 'D835del', 'P95S', 'K558_E562del', 'R496H', 'R383*', 'G165E', 'R970C', 'P153H', 'D808N', 'G1743R', 'V118D', 'TRA-NKX2-1 Fusion', 'P106L', 'S562L', 'M504V', 'L387M', 'D600_L601insFREYEYD', 'M1008I', 'C630Y', 'Y801H', 'E719G', 'E719K', 'R283Q', 'K509I', 'I391M', 'Q227L', 'A750_E758delinsP', 'T599_V600insV', 'E746G', 'D837N', 'P179L', 'D162G', 'E746Q', 'E746V', 'P179R', 'R866C', 'RET-CCDC6 Fusion', 'K59del', 'V561_I562insER', 'E1935G', 'G423V', 'K11R', 'L559R', 'G423R', 'T790M', 'A1685S', 'V1804D', 'M224R', '560_561insER', 'C384R', 'E69K', 'V321M', 'H68Y', 'K666M', 'K666N', 'G35V', 'R108H', 'S215C', 'D814V', 'R108K', 'S215G', 'F876L', 'G35A', 'C44Y', 'N375S', 'E1735K', 'K666R', 'Y578C', 'K666T', 'M1328I', 'D1739G', 'R2505Q', 'A2643G', 'G75R', 'D1739V', 'D419del', 'P133T', 'IGH-MYC Fusion', 'L584F', 'Y155C', 'D1739Y', 'R2318Q', 'L424V', 'P278R', 'T77M', 'N510K', 'E144K', 'T77P', 'I28T', 'L424I', 'L158V', 'L1678P', 'P38L', 'Q79R', 'G271E', 'L158Q', 'L158P', 'R2602T', 'R698W', 'Y849S', 'S904F', 'C135Y', 'C135S', 'C135R', 'Y849C', 'D1349H', 'V1713A', 'S1486C', 'T529I', 'AGK-BRAF Fusion', 'T529N', 'T529M', 'I90T', 'S1036P', 'F71I', 'S371C', 'G505S', 'F311L', 'S752_I759del', 'K700R', 'K700E', 'T17A', 'F1200I', 'F28L', 'S33A', 'F119S', 'V2908G', 'Y1003*', 'N870S', 'Y1295A', 'L78T', 'S33Y', 'S102F', 'L388M', 'ETV6-FLT3 Fusion', 'G328E', 'G328V', 'A2717S', 'T160I', 'Overexpression', 'H2074N', 'N561D', 'G1656D', 'S427G', 'Y3098H', 'Y1003C', 'P287A', 'ESRP1-RAF1 Fusion', 'Y1003F', 'N841I', 'E1021K', 'H773insLGNP', 'P287S', 'T605M', 'P287T', 'C628Y', 'TRKAIII Splice Variant', 'Y98N', 'L2427R', 'R882L', 'L1584R', 'K2472T', 'R882C', 'R177Pfs*126', 'L861P', 'L861Q', 'L861R', 'G697C', '422_605trunc', 'S217T', 'R251Q', 'H412Y', 'T1354M', 'F384L', 'C61G', 'M1411T', 'I1018W', 'K513R', 'F384Y', 'F359C', 'D121G', 'D520N', 'I1018F', 'F384V', 'G60E', 'Q12Y', 'D493A', 'R373Q', 'T1151dup', 'N387K', 'Y34C', 'M1_E165DEL', 'L43V', 'R373H', 'A161T', 'A161S', 'Q331R', 'D631G', 'S217C', 'Q331H', 'D544H', 'I1680N', 'P387S', 'S24F', 'Y1230H', 'R1589P', 'DNA binding domain missense mutations', 'Y1230C', 'R482Q', 'M1400V', 'R167Q', 'N53A', 'K2950N', 'Q58_Q59insL', 'P1806A', 'E1384K', 'R1589H', 'KIF5B-PDGFRA Fusion', 'Y1278S', 'NPM-ALK Fusion', 'S1301R', 'L202F', 'A889P', 'G128V', 'E475K', 'A122*', 'A1519T', 'R217C', 'P2412T', 'G1788D', 'H396R', 'E275K', 'Q1785H', 'G1035S', 'M784V', 'G1788V', 'N334K', 'M1250T', 'R537P', 'V852I', 'H65Y', 'G17E', 'BIN2-PDGFRB Fusion', 'P47S', 'TMPRSS2-ETV5 Fusion', 'D816V', 'P47A', 'K1702E', 'PVT1-MYC Fusion', 'R132G', 'R132C', 'K50E', 'S1039F', 'T599R', 'S153R', 'L234fs', 'R132H', 'FIG-ROS1 Fusion', 'E501G', 'R132Q', 'G1738E', 'S1841N', 'N126D', 'I35S', 'E866K', 'D587H', 'T417I', 'V677I', 'R15K', 'R15S', 'G1125A', 'G370C', 'V344G', 'Y537C', 'V344A', 'P25L', 'Y537S', 'E75G', 'R121Q', 'L30F', 'L19F', 'N71S', 'V2969M', 'E1978*', 'P780L', 'R755S', 'G129R', 'G776delinsLC', 'S1651F', 'C334S', 'N71K', 'N71I', 'E868G', 'V769E', 'H870R', 'R987W', 'V769M', 'Q579R', 'PCM1-JAK2 Fusion', 'S768_D770dup', 'V1808A', 'R711*', 'P284L', 'A1374V', 'I867S', 'A1234T', 'Y42C', 'Q2416*', 'C248T', 'X1009_splice', 'V1809F', 'D842Y', 'V1809A', 'I122L', 'D842V', 'F161L', 'R1097H', 'D842I', 'Y599F', 'I122S', 'E598_Y599insDVDFREYE', 'A41T', 'L1951R', 'L1407P', 'Q58_E62del', 'V774A', 'C706F', 'Y640F', 'P124L', 'V774M', 'F134Y', 'Exon 1 mutations', 'K83E', 'V191I', 'R2304C', 'C134W', 'G469del', 'P219S', 'R715G', 'R321Q', 'AKAP9-BRAF Fusion', 'Y35C', 'K111E', 'Y35H', 'S464L', 'K111N', 'V384D', 'V506A', 'R369Q', 'R369W', 'E258V', 'T599dup', 'S1651P', 'N810Y', 'A648T', 'Q252H', 'S65L', 'F1695L', 'S65N', 'S784F', 'C569Y', 'M552_W557del', 'CEP85L-ROS1 Fusion', 'F1592S', 'G305W', 'S65A', 'G305R', 'G250E', 'R310H', 'S276L', 'A111P', 'S65W', 'P326L', 'Y1294A', 'A159T', 'K120E', 'D84Y', 'K409Q', 'K120N', 'K120M', 'V1671I', 'D84N', 'D84H', 'EWSR1-WT1 Fusion', 'E580*', 'D384N', 'D84G', 'E622Q', 'A347T', 'N319D', 'G774V', 'R276W', 'P142H', 'A883T', 'L1240V', '2010_2471trunc', 'S459del', 'A636P', 'N319T', 'R1446H', 'Truncating Mutations Upstream of Transactivation Domain', 'R1446C', 'F347L', 'F615S', 'R1189*', 'P40T', 'L345Q', 'E884K', 'R2430M', 'F568fs', 'KDR-PDGFRA Fusion', 'L210R', 'N826Y', 'E731K', 'P151S', 'C696Y', 'P151H', 'G163D', 'BCAN-NTRK1 Fusion', 'P151A', 'A197T', 'IGH-NKX2 Fusion', 'V592A', 'K550_V555delinsI', 'L747_P753del', 'K806A', 'Q22E', 'Q1396R', 'Q22K', 'D770_N771insNPG', 'A232V', 'S80R', 'V559_V560del', 'F400I', 'L321A', 'T244_I245insCPT', 'T37A', 'D277H', 'H68R', 'P1812R', 'P1812S', 'BCR-ABL1 Fusion', 'N549T', 'N239S', 'E76K', 'T37R', 'E622D', 'D252G', 'E76A', 'K128Q', 'Exon 11 mutations', 'R905G', 'G42R', 'V1378I', 'I448V', 'R905Q', 'R905W', 'Y253H', 'Y253F', 'R1076C', 'V369G', 'M1255I', 'Exon 19 deletion', 'R166Q', 'K28M', 'G60D', 'I21V', 'N659S', 'R552G', 'S217A', 'L747V', 'N217I', 'G60R', 'L747P', 'S170R', 'ERLIN2?FGFR1 Fusion', 'S703I', 'S760A', 'G129E', 'G129A', 'CPEB1-NTRK3 Fusion', 'E685V', 'R23A', 'S170N', 'D2723H', 'T785A', 'T286A', 'E41A', 'S1424C', 'M541L', 'T286I', 'Y598C', 'SND1-BRAF Fusion', 'T401I', 'PAX5-JAK2 Fusion', 'D32H', 'R465H', 'T1025S', 'D32N', 'D32A', 'R133*', 'R812A', 'D32Y', 'H650Q', 'YAP1-FAM118B Fusion', 'C157Y', 'Y553_Q556del', 'V289F', 'R18H', 'S3660L', 'H773_V774insH', 'G264S', 'D1270G', 'W345*', 'R133H', 'M918V', 'S45del', 'G796S', 'W183A', 'T19N', 'R583A', 'W24C', 'I836del', 'P34R', 'M351T', 'C443Y', 'S142I', 'Y375C', 'E78K', 'D92A', 'D92G', 'D92E', 'T1720A', 'A272V', 'D92H', 'D92N', 'G251C', 'S31R', 'A205T', 'R625L', 'M535I', 'R625H', 'R2520Q', 'R625D', 'R1625C', 'R625G', 'E525K', 'R625C', 'Q144R', 'G93W', 'F594L', 'Q1756fs', 'I653T', 'G52R', 'V1833M', 'G199R', 'V1833E', 'E1644G', 'N1647K', 'FGFR2-TACC3 Fusion', 'D323H', 'S425C', 'G31R', 'H643D', 'TPM3-NTRK1 Fusion', 'R1204W', 'R689S', 'R689Q', 'G311D', 'R1204G', 'Q72L', 'M90I', 'L704N', 'E563K', 'R337L', 'L384M', 'A634V', 'R337H', 'D194Y', 'K45Q', 'S256W', 'K45T', 'R337C', 'A634D', 'T654I', 'K45N', 'T654M', 'H410R', 'R337P', 'S256F', 'K830R', 'V391I', 'G325A', 'E317K', 'R552S', 'V1605del', 'E69G', 'D254N', 'R320Q', 'F1704S', 'N1178H', 'Q233*', 'C229R', 'G735S', 'A211D', 'D1071N', 'Y16C', 'R201C', 'K558delinsNP', 'L692F', 'S241L', 'L1196M', 'S241F', 'H297N', 'Y298A', 'P250L', 'D646Y', 'S241Y', 'S46N', 'L1196Q', 'S241T', 'R1231Q', 'R368C', 'Q689R', 'S326R', 'Y105C', 'M35R', 'C41Y', 'S335C', 'G35R', 'RUNX1-RUNX1T1 Fusion', 'Y551F', 'V1306I', 'Q984K', 'L248V', 'S297F', 'L607I', 'E355A', 'E17K', 'KIAA1549-BRAF Fusion', 'E322K', 'L507P', 'H570R', 'V14I', 'E709V', 'L1198F', 'E709Q', 'G12V', 'F346V', 'P1859R', 'G12R', 'G12S', 'E709G', 'L1198P', 'E709A', 'G12F', 'G12D', 'G12C', 'S186Y', 'G12A', 'L344P', 'L344R', 'C611Y', 'T352M', 'C554W', 'D130A', 'E746_T751delinsVA', 'N276S', 'L983F', 'A40E', 'D422N', 'E31K', 'R130A', 'Amplification', 'R130G', 'V1736G', 'K601N', 'R130K', 'G508S', 'R130L', 'R802*', 'X963_splice', 'E804G', 'S376F', 'V1688del', 'G207E', 'EWSR1-ETV4 Fusion', 'R479Q', 'R1835P', 'R154W', 'D1280V', 'M269R', 'Y234C', 'R479H', 'E40K', 'S453fs*', 'R479L', 'Y234H', 'E79K', 'T41A', 'Q201H', 'V430M', 'G70D', 'Q58L', 'V555_L576del', 'Q2858R', 'N238S', 'F367S', 'MPRIP-NTRK1 Fusion', 'R2842H', 'E77K', 'D1739E', 'S729C', 'P1749R', 'S858R', 'I1307K', 'C1156F', 'N1236K', 'R130*', 'S1002R', 'C121S', 'H1966Y', 'G591V', 'L32P', 'N535K', 'K320E', 'G637W', 'C1156Y', 'RUNX1-EVI1 Fusion', 'A113_splice', 'H214N', 'R640G', 'S1140G', 'W557G', 'K373E', 'P306H', 'H214R', 'I2675V', 'H214Q', 'G245D', 'G245A', 'TFG-ROS1 Fusion', 'I195T', 'Q477E', 'G245S', 'N372H', 'E1346K', 'R631C', 'L117P', 'I251S', 'E40N', 'E40L', 'C2060G', 'T771R', 'E40W', '963_D1010splice', 'K291E', 'E40Q', 'K342N', 'P491S', '1_2009trunc', 'C136R', 'I219V', 'G216R', 'R1040L', 'R1095H', 'N857S', 'I1616N', 'I1616T', 'T992I', 'N653H', 'Q59E', 'C228T', 'G1809R', 'L46R', 'D257N', 'D300H', 'R183P', 'K117R', 'D300N', 'I26N', 'L46F', 'E586K', 'T241P', 'M160V', 'G1809K', 'E1051K', 'R183G', 'S45P', 'S1297del', 'P253R', 'S1653F', 'S45Y', 'S10N', 'S45A', 'S45F', 'L702H', 'E606G', 'S240R', 'S1653P', 'G1123S', 'E1794D', 'M774_A775insAYVM', 'L601_K602insREYEYDL', 'D331G', 'G1123D', 'E579K', 'V794M', 'F212Y', 'H114Y', 'Q145H', 'T338I', 'E221Q', 'I68K', 'T1685I', 'D86N', 'K1062M', 'G469V', 'D835N', 'Y297A', '256_286trunc', 'G13R', 'Q1503P', 'G13V', 'A717G', 'S2215T', 'G469A', 'S2215Y', 'L272F', 'F341C', 'G13E', 'G13D', 'E330K', 'K558N', 'V1838E', 'I1807S', 'K128N', 'P42T', 'D820G', 'K507Q', 'G161V', 'M133T', 'Y375_K455del', 'E452K', 'D423N', 'K507A', 'F1061W', 'I2285V', 'R1276P', 'R1276Q', 'C634W', 'C634S', 'C634R', 'A532H', 'F1174L', 'F1174I', 'E483*', 'W802*', 'H1421Y', 'V773A', 'G881D', 'I767M', 'S1670A', 'C528S', 'Y803N', 'R378A', 'Y412F', 'F1524V', 'E14*', 'M1628V', 'S1172L', 'E632_L633del', 'S1715R', 'D153V', 'V559G', 'C64G', 'P1771R', 'D806H', 'F958V', 'G810S', 'X434_splice', 'R38H', 'A1789T', 'L493V', 'A34D', 'L493P', 'R1896M', 'A750P', 'G116S', 'R342W', 'N848K', 'I15T', 'L230P', 'W1502A', 'K935I', 'V555M', 'D108H', 'D108N', 'ACPP-PIK3CB Fusion', 'C39S', 'D842_H845del', 'W117C', 'C39Y', 'V1653M', 'R5Q', 'Y113*', 'G39E', 'R481G', 'G2430A', 'Fusions', 'D140G', 'ROS1-CD74 Fusion', 'T670I', 'S2670L', 'P33S', 'C630R', 'A1131T', 'A1020V', 'G382D', 'A1752P', 'EGFRvII', 'D835A', 'ETV6-PDGFRB Fusion', 'S501_A502dup', 'T1087I', 'D1352Y', 'P1139S', 'EGFRvIV', 'K292I', 'ZNF198-FGFR1 Fusion', 'P1771L', 'D845A', 'V777L', 'V777M', 'V777A', 'M1663K', 'G1202R', 'G623R', 'R267Q', 'L57del', 'F133L', 'G430C', 'Y806C', 'K648N', 'F1088Lfs*5', 'R561C', 'A151T', 'RANBP2-ALK Fusion', 'W1038C', 'D162H', 'I89T', 'G266R', 'G284R', 'A2770T', 'Y599_D600insPAPQIMSTSTLISENMNIA', 'M237K', 'I89N', 'G266E', 'D67N', 'T131A', 'M1K', 'S362L', 'R1758G', 'G701S', 'W80A', 'CAD-ALK Fusion', 'I642V', 'M1R', 'M1T', 'D67Y', 'M1V', 'S783P', 'G253C', 'GOLGA4-PDGFRB Fusion', 'I843del', 'I47F', 'T485K', 'R732Q', 'K62R', 'E627D', 'F1734S', 'A1708V', 'Truncating Mutations in the PEST Domain', 'F31I', 'D3170G', 'M1783T', 'R678Q', 'Q120E', 'YAP1-MAMLD1 Fusion', 'T875N', 'M1652T', 'E501K', 'A134D', 'P305L', 'M1652K', 'M1783L', 'M1775E', 'M1783I', 'R1515H', 'P449T', 'G244D', 'N542_E543del', 'DNA binding domain deletions', 'F102C', 'G244R', 'G244S', 'R725C', 'E565G', 'H1620R', 'A290T', 'M1775K', 'X475_splice', 'N82K', 'H845Y', 'E921K', 'G724S', 'F893L', 'K526E', 'G1201E', 'T1203K', 'L830R', 'R331P', 'K101M', 'S476I', 'A8S', 'PRKG2-PDGFRB Fusion', 'S2G', 'L747S', 'W535L', 'S250P', 'L747F', 'R506_K507insVLR', 'H538Q', 'P214L', 'A1708E', 'S59R', 'G1567D', 'A500T', 'Y646N', 'G2748D', 'H115N', 'M1775V', 'R182W', 'L749P', 'L1204F', 'H115R', 'L67P', 'D1692N', 'N1044K', 'EWSR1-FLI1 Fusion', 'SSBP2-JAK2 Fusion', 'E554_V559del', 'S2483N', 'S33C', 'M887I', 'SPAG9-JAK2 Fusion', 'K601E', 'M2327I', 'Y553N', 'L28P', 'EGFRvV', 'T389K', 'K601Q', 'S33F', 'Y3092C', 'A767_V769del', 'M37K', 'P114L', 'P114S', 'V379I', 'L996F', 'BCR-JAK2 Fusion', 'S869L', 'P848L', 'K499E', 'L481F', 'L1460P', 'H193P', 'A389T', 'H193N', 'E255K', 'R213Q', 'K38N', 'E2663V', 'V414A', 'E279K', 'STRN-ALK Fusion', 'E1099K', 'H1402Y', 'G1079D', 'E453A', 'D821N', 'C176F', 'N1125I', 'R879C', 'R1464*', 'L703P', 'T488_P492del', 'P577_D579del', 'E88K', 'N1228D', 'M713I', 'CD74-ROS1 Fusion', 'N2113S', 'R11K', 'EWSR1-DDIT3 Fusion', 'SEC16A1-NOTCH1 Fusion', 'Y236S', 'H36P', 'EWSR1-ETV1 Fusion', 'L550P', 'L399V', 'Y236D', 'Y236C', 'S1512I', 'Y98H', 'FAM131B-BRAF Fusion', 'F1245C', 'R1751P', 'R1751Q', 'I31M', 'K1026E', 'H1904R', 'V242F', 'I563_L576del', 'W509R', 'W742C', 'F1245V', 'DNMT3B7', 'C1483F', 'V1714G', 'E836K', 'C1483R', 'L790F', 'C1483W', 'S46I', 'C1483Y', 'E996K', 'C582F', 'D770_N771insVDSVDNP', 'ATG7-RAF1 Fusion', 'C1385', 'P480L', 'L410R', 'N564K', 'I157T', 'Y238A', 'L1267S', 'G1232D', 'N564D', 'A11_G12insGA', 'V654A', 'Y1853*', 'Exon 20 insertion', 'A2425T', 'D24Y', 'R724H', 'NSD1-NUP98 Fusion', 'V220F', 'R156C', 'Y1853C', 'T1365M', 'H773Y', 'R156H', 'I463S', 'E102_I103del', 'L469V', 'R139G', 'K310R', 'H773L', 'K603Q', 'V3079I', 'L108P', 'FGFR2-KIAA1967 Fusion', 'D1399Y', 'S1463F', 'D761Y', 'V1075F', 'D846Y', 'V211D', 'P2415del', 'I744_K745delinsKIPVAI', 'L747_T751delinsP', 'G87R', 'E116K', 'D594A', 'C18Y', 'D357Y', 'L64P', 'ATF7IP-JAK2 Fusion', 'S387N', 'D1420Y', 'L915M', 'S72R', '533_534del', 'A146T', 'A146V', 'R181C', 'P691S', 'N2436I', 'E664K', 'V560G', 'R181L', 'R315*', 'R181H', 'D92V', 'R1192P', 'R611Q', 'R611W', 'H123D', 'L1780P', 'D1067Y', 'D1067V', 'A148T', 'IGH-BCL2 Fusion', 'K125L', 'L188Q', 'V536M', 'Y842C', 'T319del', 'V536E', 'D1067A', 'H845_N848delinsP', 'H2428Q', 'FGFR2-FAM76A Fusion', 'Y652H', 'CEP85L-PDGFRB Fusion', 'L142P', 'S459F', 'G101W', 'D631A', 'S387Y', 'E626K', 'T341P', 'L726F', 'L726I', 'L2431P', 'E255V', 'R243A', 'E1705A', 'S891A', 'S492R', 'E1705K', 'C77F', 'K39N', 'A4419S', 'R1343L', 'L844R', 'L1600P', 'TEL-RUNX1 Fusion', 'V714M', 'H1106D', 'R389*', 'L833V', 'M2676T', 'A530T', 'A530V', 'A246P', 'G776_V777insYVMA', "3' Deletion", 'E768D', 'V579A', 'K590R', 'P395A', 'L2721H', 'S310F', 'Q56_V60del', 'I1349T', 'BRD4-NUT Fusion', 'R115L', 'S249C', 'D119N', 'Y139D', 'R201H', 'V343E', 'A627T', 'F12L', 'N45S', 'V343L', 'W383R', 'P838L', 'V851A', 'A36P', 'E1210K', 'I2500M', 'Y599_D600insGLYVDFREYEY', 'S846I', 'N219D', 'I2500F', 'R683K', 'G114R', 'Y472C', 'Exon 20 insertions/deletions', 'Y472H', 'G478C', 'R978*', 'W531C', 'T417_D419delinsRG', 'V557I', 'D1203N', 'E321K', 'S1290A', 'E321G', 'G751R', 'R487Q', 'R487W', 'H355M', 'SDC4-ROS1 Fusion', 'V274F', 'K753E', '534_536del', 'K753A', 'MYC-nick', 'Y371H', 'K753M', 'F1662S', 'P525L', 'Y371S', 'C456_R481del', 'A1022E', 'AR-V7', 'T798M', 'R2336H', 'T798I', 'G380R', 'K341A', 'M552_K558del', 'A1669S', 'L861F', 'R2784W', 'L1596H', 'V755I', 'W557_V559delinsC', 'E709K', 'W349C', 'KIF5B-RET Fusion', 'Y647C', 'V1398D', 'Y1248F', 'Q429*', 'P551_V555del', 'I491M', 'V559del', 'R177Q', 'S1473P', 'D2512Y', 'G936R', 'K659N', 'L597V', 'D2512G', 'L221R', 'Y237A', 'K659E', 'F57C', 'RANBP1-ALK Fusion', 'A149P', 'E137K', 'K442Nfs*', 'F57L', 'E79Q', 'R669G', 'F57V', 'E168D', 'E554_K558del', 'R669C', 'D65N', 'L145R', 'C238S', 'I1170S', 'I1170N', 'V1578del', 'T340A', 'H233N', 'C238F', 'L115R', 'F590G', 'T1691K', 'S222D', 'K459_S460delinsN', 'Delta-NTRK1 Fusion', 'E157G', 'I130M', 'F79S', 'D887N', 'L1947R', 'Exon 19 insertion', 'L118R', 'L118P', 'Y353L', 'R1200W', 'V597A', 'D2870A', 'Q472H', 'K181M', 'A767_V769dup', 'N588D', 'IGH-FGFR3 Fusion', 'S478P', 'V544_L545insAVLVLLVIVIISLI', 'S646F', 'R1275L', 'W1610G', 'H94Y', 'I1250T', 'G503V', 'N116H', 'A209T', 'S4F', 'T574_R588delinsL', 'H83Y', 'E746_A750delinsQ', 'S259A', 'H83N', 'S4P', 'K539L', 'C1767S', 'S252W', 'S645C', 'E2419K', 'D1818G', 'V659E', 'L622H', 'N1730S', 'V1741G', 'L49H', 'K428A', 'T244R', 'T3349A', 'E1799K', 'Q337*', 'D390Y', 'K558_V559del', 'N540S', 'L1854P', 'R897Q', 'L348F', 'H1746Q', 'G67W', 'V1676D', 'R206H', 'G67R', 'G67S', 'H396P', 'F351L', 'D927G', 'P449_L455del', 'L348S', 'H1746N', 'A19V', 'L785F', 'Q2384K', 'E326L', 'L251P', 'W1837R', 'W1291R', 'A57V', 'A677G', 'N81Y', 'V191D', 'L1301R', 'P551_E554del', 'S614R', 'W1837C', 'DNA binding domain insertions', 'R683S', 'V32G', 'R268A', 'G466V', 'G466R', 'L485_Q494del', 'G778_P780dup', 'G466E', 'R342P', 'R342Q', 'G466A', 'R922*', 'H191D', 'MSN-ROS1 Fusion', 'D560Y', 'A75P', 'D770_N771insD', 'L910P', 'V804L', 'Q1500P', 'K78I', 'E734Q', 'Q2223K', 'N550H', 'M53I', 'G2032R', 'L1904V', 'R420H', 'C396R', 'R420Q', 'R415G', 'Q61R', 'L1152R', 'FGFR1OP1-FGFR1 Fusion', 'L1152P', 'V742A', 'Q227R', 'T123A', 'G857E', 'G857A', 'N48K', 'K656E', 'Y791F', 'P539R', 'E81K', 'G602R', 'L225LI', 'F81V', 'L792R', 'R1753T', 'S243C', 'V1736A', 'R776C', 'D245V', 'T910M', 'R174*', 'R82P', 'H694R', 'E466K', 'P123M', 'H398Y', 'D609E', 'D1709E', 'D107Y', 'H123Y', 'D1709A', 'A1830T', 'T80A', 'R64P', 'L370fs', 'T2681R', 'NIN-PDGFRB Fusion', 'P428L', 'WDR48-PDGFRB Fusion', 'K125E', 'S243N', 'C242S', 'V126D', 'E1282V', 'R659P', 'S1088F', 'C242F', 'R659L', 'T196A', 'MIR143-NOTCH1 Fusion', 'R158L', 'Exon 20 insertions', 'T582P', 'R158H', 'V277D', 'G31A', 'T283A', 'R158C', 'M232T', 'G1286R', 'FGFR3 - BAIAP2L1 Fusion', 'R659*', 'I255F', 'L1657P', 'T58I', 'I538V', 'Y646S', 'C24Y', 'TMPRSS2-ETV4 Fusion', 'L536H', 'Y646H', 'R2842C', 'L536R', 'L536Q', 'L536P', 'Y646C', 'V217D', 'Y646F', 'G85R', 'Q546K', 'H662R', 'H662Q', 'E518K', 'H123Q', 'V658F', 'Q546E', 'V658A', 'Q79K', 'Q276*', 'Q546P', 'Q546R', 'Q79E', 'F691L', 'E746_S752delinsI', 'E746_S752delinsA', 'L485_P490del', 'R167W', 'T205A', 'F21A', 'C277R', 'L838V', 'C277Q', 'R1190W', 'C277W', 'E633K', 'F158C', 'M587V', 'P1502L', 'P48T', 'E875G', 'A126G', 'D594N', 'A126D', 'W731L', 'D594V', 'A126S', 'D594Y', 'K1299E', 'TP53BP1-PDGFRB Fusion', 'A126V', 'A504_Y505ins', 'V534E', 'D387V', 'A617T', 'I638F', 'Exon 19 deletion/insertion', 'G356A', 'I1171T', 'F129L', 'L783F', 'I1171N', 'A598T', 'R505L', 'V804M', 'R292A', 'C609Y', 'H1047Y', 'V804G', 'H1047R', 'L390F', 'D594G', 'E218*', 'H1047L', 'V2006L', 'L826P', 'ZC3H7B-BCOR Fusion', 'V2006I', 'G465E', 'K57E', 'D1546N', 'K57T', 'T576del', 'R258H', 'S653C', 'L866M', 'P1856S', 'P1856T', 'R258C', 'EWSR1-FEV Fusion', 'CASP8L', 'D594E', 'R88Q', 'K467T', 'Y280H', 'W406A', 'C44F', 'K525E', 'E554_I571del', 'T263P', 'KANK1-PDGFRB Fusion', 'G106_R108del', 'Hypermethylation', 'W131A', 'W131G', 'L485F', 'L188V', 'S56I', 'I18V', 'EBF1-PDGFRB Fusion', 'S1613C', 'R498L', 'S1613G', 'P48R', 'N382H', 'G20E', 'R117G', 'P48L', 'E50K', 'L747_E749del', 'STRN-PDGFRA Fusion', 'C1787S', 'D401N', 'FGFR2-CCDC6 Fusion', 'T1481fs', 'R976H', 'L82P', 'L412F', 'CIC-DUX4 Fusion', 'D106A', 'R100*', 'p61BRAF', 'E29V', 'TMPRSS2-ERG Fusion', 'C360R', 'TGFBR1*6A', 'S119N', 'I326V', 'H1918Y', 'R100T', 'G1194D', 'V1576E', 'V600D_K601insFGLAT', 'R100A', 'V559D', 'L2230V', 'V559C', 'D839G', 'V559A', 'S330A', 'K550_K558del', 'G665A', 'P262H', 'G123R', 'G123S', 'L358R', 'E490K', 'P261L', 'F53C', 'E311_K312del', 'C382R', 'L448P', 'D61N', 'Y373C', 'X1007_splice', 'V569_L576del', 'L747_A750delinsP', 'D171N', 'Y1035C', 'P463L', 'K642E', 'L747_A750del', 'R421*', 'R592H', 'TPR-NTRK1 Fusion', 'F328V', 'Y510C', 'Q157P', 'V1070E', 'L1273F', 'BCR-PDGFRA Fusion', 'L1593P', 'V45L', 'G44S', 'R418G', 'V356R', '550_592del', 'K289E', 'R175L', 'R836C', 'A864T', 'R175H', 'G909R', 'R175C', 'E872K', 'E135K', 'E23fs', 'E142D', 'HMGA2-RAD51B Fusion', 'S384F', 'P316L', 'R634W', 'N1878K', 'D595V', 'R647A', 'P336S', 'K1690N', 'K550_W557del', 'M1689R', 'S279Y', 'T779fs', 'R228K', 'N463S', 'E1356G', 'T417_D419delinsI', 'A614D', 'W279*', 'T674I', 'Q1756C', 'F317L', 'W24S', 'K376N', 'F248S', 'R658Q', 'G248V', 'E545Q', 'LMNA-NTRK1 Fusion', 'EGFR-PURB Fusion', 'S584L', 'R487*', 'E518A', 'N132K', 'E545G', 'K97M', 'E545A', 'R3052Q', 'L747_T751del', 'E545K', 'G322S', 'W368H', 'V648G', 'T733I', 'D74N', 'T1852S', 'V648I', 'E758G', 'R282W', 'D74Y', 'W257C', 'L52R', 'W257G', 'R3052W', 'Y553_K558del', 'M18K', 'A723D', 'K291Q', 'Q50P', 'P34L', 'Q395*', 'R601Q', 'CUX1-FGFR1 Fusion', 'A1200V', 'M664I', 'M18T', 'L147F', 'Q538P', 'D29H', 'Q50*', 'PTPRZ1-MET Fusion', 'H168R', 'N2875K', 'D29Y', 'H231R', 'H168N', 'L63P', 'L63V', 'D83V', 'S1164I', 'I290A', 'E552K', 'N546K', 'S1986Y', 'FLT3 internal tandem duplications', 'I347M', 'E40T', 'S1986F', 'V769_D770insGVV', 'S267_D273dup', 'F468C', 'V84L', 'D1778Y', 'R93W', 'D3095E', 'R93Q', 'A77T', 'P96Q', 'A77P', 'A77S', 'D1778H', 'Q324L', 'N198_F199delinsI', 'R200W', 'D1778N', 'R248W', 'R248Q', 'EZR-ROS1 Fusion', 'P81T', 'T507K', 'H773dup', 'K575M', 'R248C', 'R159G', 'R248L', 'R248K', 'D820A', 'L858Q', 'L858R', 'T50I', 'D557H', 'I32del', 'E219K', 'N345I', 'S463P', 'V561D', 'V561A', 'A60V', 'D537Y', 'E1682V', 'G464A', 'L23F', 'E380Q', 'K50L', 'T1720I', 'G464V', 'G464R', 'C136Y', 'M136R', 'D537E', 'E1682K', 'I111P', 'I111R', 'K2729N', 'R1563S', 'N1068fs*4', 'R161G', 'Q556_K558del', 'T1343I', 'I111A', 'I99M', 'E60L', 'S566_E571delinsR', 'R470C', 'BCR-FGFR1 Fusion', 'I111N', 'K288Q', 'G106D', 'R48W', 'R172S', 'K765R', 'R24C', 'TRIM24-BRAF Fusion', 'R172K', 'R172M', 'G106V', 'R453C', 'W237_Y242del', 'R24P', 'V248D', 'R172G', 'R1060H', 'R110P', 'D473G', 'Y63C', 'A859_L883delinsV', 'V60E', 'R80L', 'R80C', 'V60M', 'E459K', 'G23D', 'N296I', 'T117M', '596_619splice', 'R80P', 'R544W', 'V319D', 'R110L', 'D287H', 'R544S', 'S1841A', 'E439del', 'GIT2-PDGFRB Fusion', 'T73I', 'S1498N', 'R957Q', 'A39P', 'D101Y', 'D402Y', 'S1841R', 'T574insTQLPYD', 'CHTOP-NTRK1 Fusion', 'G1706A', 'P70R', 'L576P', 'G1706E', 'V11A', 'Y406H', 'F808L', 'F1761S', 'N655K', 'T878S', 'F1761I', 'K975E', 'Y27S', 'V1180L', 'L128F', 'T878A', 'E542Q', 'G719S', 'K375A', 'E542V', 'E612_F613insGYVDFREYEYDLKWEFRPRENLEF', 'H878Y', 'R961W', 'G719A', 'R183Q', 'G719C', 'G719D', 'E542G', 'T241M', 'N826S', 'E542K', 'G376R', 'N1387A', 'K413E', 'V271L', 'Q22R', 'V271A', 'E839K', 'V600R', 'T47D', 'V600K', 'V600M', 'V2098I', 'V600E', 'V600D', 'V600G', 'S921R', 'R143Q', 'Y591D', 'V750E', 'K641N', 'K641R', 'Y513A', 'C105F', 'I162M', 'R873Q', 'FGFR1-TACC1 Fusion', 'R79P', 'L283_D294del', 'A883F', 'S310Y', 'E285K', 'L209F', 'K117N', 'L146R', 'Q249E', 'K420A', 'LIMA1-ROS1 Fusion', 'E285V', 'Y68H', 'C275S', 'S36Y', 'V1565I', 'R680*', 'S1303N', 'F154L', 'R844H', 'Y68D', 'EGFR-RAD51 Fusion', 'G334R', 'Q1554H', 'D641N', 'D1853N', 'F958S', 'I834V', 'D641G', 'A120S', 'P417A', 'E143K', 'Q809R', 'W719*', 'EWSR1-YY1 Fusion', 'D60N', 'EGFR-KDD', 'Y1414C', 'X1008_splice', 'R671Q', 'K128T', 'I204T', 'I42V', 'E172K', 'K1452N', 'S123T', 'Q816*', 'R290H', 'G34V', 'BCOR-RARA Fusion', 'C456_N468del', 'R290G', 'D186A', 'F2108L', 'V560D', 'V560E', 'R267P', 'S462Y', 'L1224F', 'FGFR2?PPHLN1 Fusion', 'G434R', 'L1574P', 'L755S', 'L755P', 'E207K', 'P1675L', 'C620R', 'Y384A', 'K483E', 'V842I', 'D2312V', 'K483M', 'R1699W', 'R134Q', 'T730S', 'M117I', 'R361C', 'H78Q', 'H1805P', 'Q347_A348del', 'P577_W582delinsPYD', 'F877L', 'H105R', 'K1434I', 'Y555C', 'M117V', 'C71Y', 'A391E', 'A1843P', 'BCOR-CCNB3 Fusion', 'V422del', 'D816G', 'P1776S', 'D816E', 'A633T', 'W742L', 'A633V', 'MKRN1-BRAF Fusion', 'E717K', 'P2417A', 'N1026S', 'V487_P492delinsA', 'S768I', 'S206C', 'L165P', 'ATF7IP-PDGFRB Fusion', 'E365K', 'Y418_D419del', 'T1977R', 'A1099T', 'M1293A', 'E1214K', 'G1269S', 'L617F', 'Copy Number Loss', 'L617M', 'G1269A', 'A459V', 'Promoter Mutations', 'S151A', 'I408V', 'P95L', 'C618R', 'R287A', 'P95H', 'Y1703H', 'H875Y', 'C311R', 'L274P', 'P173S', 'R2888C', 'R201Q', 'R505C', 'P4309A', 'E267G', 'V705M', 'R249W', 'V35M', 'R249S', 'V705E', 'H1382Y', 'F354L', 'L597R', 'L597S', 'L597Q', 'L1705P', 'ARv567es', 'V194M', 'D572A', 'R249M', 'SPTBN1-PDGFRB Fusion', 'D1733G', 'I103N', 'E70K', 'CUL1-BRAF Fusion', 'R796G', 'V270A', 'Y901C', 'E49K', 'N233Y', 'G785S', 'R441P', 'G863D', 'T80R', 'E812K', 'W563L', 'W398V', 'M244V', 'M391R', 'Exon 9 mutations', 'T167A', 'R776H', 'G863S', 'G596C', 'A750_E758del', 'T80K', 'S1497A', 'Y1235D', 'S1025C', 'S765P', 'M253K', 'N676S', 'Q1064R', 'V354E', 'N676K', 'G914R', 'C24R', 'N78S', 'G31V', 'N676D', 'T24A', 'A23E', 'S37Y', 'L455M', 'A598V', 'R834Q', 'R173P', 'T783A', 'FUS-ERG Fusion', 'C712R', 'K218T', 'R173H', 'E946*', 'R49L', 'R173C', 'R49C', 'C324Y', 'P798L', 'A2034V', 'D61Y', 'A121P', 'A121V', 'F53S', 'G186R', 'N987I', 'V1092I', 'H206D', 'KIF5B-ALK Fusion', 'A121E', 'D737N', 'F53L', 'V197E', 'Wildtype', 'S432L', 'C250T', 'I122V', 'F522C', 'D717V', 'E161del', 'K129E', 'A919V', 'P2273S', 'R47Q', 'P551_W557delinsL', 'M299I', 'E1660G', 'L181P', 'V143A', 'I562M', 'R1598P', 'N1100Y', 'T1685A', 'G375C', 'S37C', 'L485_P490delinsY', 'A59G', 'EZR-ERBB4 Fusion', 'L485_P490delinsF', 'P596L', 'G375P', 'H179R', 'H179Q', 'H179Y', 'I852M', 'M1689T', 'S505N', 'Y599_D600insSTDNEYFYVDFREYEY', 'Y823D', 'H179N', 'L770V', 'H179L', 'Y285C', 'I853T', 'W24R', 'H284P', 'A18D', 'T630I', 'S2215F', 'T315I', 'A707T', 'R304*', 'N553S', 'Q56P', 'H284N', 'S567L', 'A298T', 'R264C', 'T1977S', 'A263V', 'L747_P753delinsS', 'R304W', 'D325A', 'E124Q', 'Q545A', 'P1637L', 'L929S', 'CEP110-FGFR1 Fusion', 'F341V', 'T1977K', 'T1977I', 'G1529R', 'V104M', 'FGFR2-BICC1 Fusion', 'E139D', 'G469E', 'E636K', 'R1262A', 'S37F', 'D617G', 'S840_N841insGS', 'S37A', 'R280T', 'R280A', 'R202S', 'L611_E612insCSSDNEYFYVDFREYEYDLKWEFPRENL', 'D473H', 'P94A', 'R280K', 'R886W', 'V465M', 'Y426A', 'P655R', 'G13C', 'E1060A', 'K517R', 'CD74-NTRK1 Fusion', 'N480del', 'T599_V600insEAT', 'FGFR2-AHCYL1 Fusion', 'M737I', 'C49Y', 'G419V', 'Y87N', 'Y65C', 'S247Y', 'E2014K', 'L330R', 'P648S', 'S1206Y', 'Y87C', 'R833C', 'K745_A750del', 'A95D', 'P648L', 'S247F', 'C1365Y', 'Y835F', 'R1391S', 'E746_A750del', 'S227F', 'R1594Q', 'S227A', 'P753S', 'G1596V', 'R1391G', 'A763_Y764insFQEA', 'KIAA1509-PDGFRB Fusion', 'V843I', 'K52R', 'ESR1-YAP1 Fusion', 'P169H', 'N659K', 'C1265S', 'C47G', 'V1665M', 'K382E', 'R2336P', 'E330G', 'D820E', 'C47S', 'G776delinsVC', 'V726M', 'N505I', 'V894I', 'K335I', 'E82G', 'G853D', 'R174C', 'F1088Sfs*2', 'P130S', 'Y163C', 'L535P', 'PAX8-PPAR? Fusion', 'R273C', 'R273G', 'I1766S', 'G831E', 'P83L', 'C482R', 'R273H', 'R273L', 'E362H', 'EWSR1-ATF1 Fusion', 'ETV6-NTRK3 Fusion', 'LRIG3-ROS1 Fusion', 'G14V', 'N387P', 'S768_V769insVAS', 'K56N', 'H61R', 'V716M', 'L246V', 'H61D', 'A1170V', 'M980T', 'G333S', 'P2476L', 'D404G', 'E462G', 'A1065T', 'S1206R', 'ESR1-CCDC170 Fusion', 'F74S', 'G2101A', 'M374V', 'K745M', 'W1782C', 'R574fs', 'T413N', 'P531S', 'F123I', 'D1344H', 'K650E', 'KDELR2-ROS1 Fusion', 'C634Y', 'K650M', 'R177*', 'K650N', 'K650T', 'F170I', 'C141Y', 'K650Q', 'E1836K', 'K650R', 'EWSR1-ERG Fusion', 'L461V', 'P375S', 'A829P', 'R170W', 'R170Q', 'C481S', 'C381A', 'V173L', 'N822Y', 'E5K', 'S1206C', 'V173E', 'W1837G', 'Y570H', 'D324N', 'T74P', 'L52F', 'R1608S', 'P291Qfs*51', 'A1701P', 'T3211K', 'V765A', 'Deletion', 'G81D', 'S34Y', 'K78A', 'T725M', 'F156L', 'R1726G', 'R1209W', 'G81R', 'G81S', 'T34_A289del', 'L180P', 'H597Y', 'S34F', 'P1709L', 'R1093*', 'A1823T', 'N822H', 'T1691I', 'V155F', 'K162D', 'E1286V', 'SLC34A2-ROS1 Fusion', 'V155A', 'E571K', 'K83N', 'V665A', 'Q110R', 'S121A', 'S121C', 'T1151M', 'N551K', 'G1803A', 'V109G', 'R265S', 'N549K', 'Y64A', 'R370C', 'N549S', 'EWSR1-CREB1 Fusion', 'S746fs', 'R265C', 'S1655F', 'Q1826H', 'L576del', 'S23R', 'G2274V', 'P168T', 'D2723G', 'H174R', 'V471F', 'Q367P', 'S226D', 'Y174N', 'Y35N', 'IGL-MYC Fusion', 'D2665G', 'S257W', 'S196N', 'D661Y', 'D661V', 'L708P', 'G325R', 'V1804A', 'T2250A', 'S451E', 'S451F', 'K65M', 'S1733F', 'L57V', 'L193F', 'TMPRSS2-ETV1 Fusion', 'H132Y', 'A171V', 'M134L', '981_1028splice', 'K1436Q', 'Q209L', 'D842_M844del', 'G1763V', 'R87P', 'G127E', 'G127N', 'D326N', 'R272C', 'R514C', 'EP300-MLL Fusion', 'V370D', 'D935N', 'T468M', 'R87L', 'P44L', 'V294M', 'T599I', 'P44A', 'D450E', 'I1183T', 'D450H', 'A1066V', 'L611V', 'R201W', 'P186S', 'T710A', 'S1715C', 'R1088C', 'V839G', 'P654L', 'S1715N', 'S259F', 'Exon 2 mutations', 'R203C', 'V1673D', 'V1673F', 'A72V', 'S259P', 'A72S', 'W557R', 'A339V', 'K22A', 'D835H', 'NFASC-NTRK1 Fusion', 'D835E', 'E746_T751insIP', 'C125S', 'E265K', 'L265P', 'MLL-TET1 Fusion', 'L325F', 'D835Y', 'T1219I', 'P1819S', 'L37P', 'P26S', 'N181S', 'C27A', 'N235D', 'R841K', 'Y179C', 'N771_H773dup', 'R841Q', 'L1019V', 'V272L', 'R1627', 'C124N', 'Y220S', 'T131I', 'HIP1-PDGFRB Fusion', 'R140Q', 'T131L', 'E709_T710delinsD', 'T131S', 'C809G', 'Y220C', 'C124S', 'C124R', 'T28I', 'R162*', 'T82A', 'L1026F', 'Q635E', 'W515L', 'W515K', 'L63F', 'N56T', 'I33del', 'G1770V', 'N486_P490del', 'IGK-MYC Fusion', 'V128del', 'L97R', 'G101S', 'L582F', 'Y532C', 'Y846C', 'R683T', 'F1888L', 'L668F', 'F1888I', 'F241S', 'H701P', 'PAPSS1-BRAF Fusion', 'A1789S', 'R683G', 'M199del', 'F1888V', 'L112P', 'L112R', 'H1094R', 'H1094Y', 'Q96P', 'K147E', 'Q61L', 'TEL-JAK2 Fusion', 'Q61K', 'Q61H', 'F594_R595insSDNEYFYVDF', 'E106G', 'K82T', 'H1094L', 'Y364H', 'K2411T', 'R838Q', 'D579del', 'D2033N', 'Y572C', 'N1102Y', 'L1433S', 'T844M', 'E190K', 'G373R', 'Y69H', 'C91A', 'N454D', 'L2865V', 'R69C', 'L2106P', 'E541K', 'C91S', 'T75M', 'D84V', 'Y1703S', 'L1844R', 'W2626C', 'G1128A', 'I49S', '385_418del', 'K125R', 'G1128S', 'K125M', 'I668V', 'E160*', 'S35Q', 'S273C', 'I289M', 'X582_splice', 'S100F', 'E281K', 'CCND1-IGH Fusion', 'A727V', 'P490_Q494del', 'V1075Yfs*2', 'D1810A', 'W308C', 'L838P', 'D258N', 'R339W', 'ETV6-PDGFRA Fusion', 'D350G', 'Q164R', 'I290R', 'P704S', 'G118D', 'N564_Y578del', 'Q510E', 'D770_P772dup', 'A1459P', 'I559_D560insDKRMNS', 'V1643A', 'Promoter Hypermethylation', 'E286K', 'G596R', 'S502T', 'G596V', 'H1862L', 'K189N', 'R2659T', 'R661W', 'S68W', 'K79E', 'D96N', 'C39R', 'R2659K', 'R306S', 'G1971E', 'C278F', 'P278A', 'P278L', 'E120Q', 'T1700A', 'R112G', 'E1552del', 'L362R', 'P278S', 'H492R', 'K508M', 'R282Q', 'N1819S', 'N1819Y', 'Y599_D600insEYEYEYEY', 'W714*', 'W1718L', 'M1043V', 'M1043I', 'R571W', 'R348*', 'E3002K', 'Exon 13 deletion', 'R324L', 'T195I', 'R262T', 'C620Y', 'PDE4DIP-PDGFRB Fusion', 'H133Q', 'V509A', 'I2627F', 'Y32S', 'E239A', 'P1311T', 'R2502H', 'E1250K', 'A349P', 'D513Y', 'R2502C', 'W1718C', 'K5N', 'L239R', 'R380A', 'V550E', 'D1778G', 'G660D', 'S214A', 'S214C', 'R1699Q', 'Y772_A775dup', 'C378R', 'D816Y', 'D816F', 'R1699L', 'G34E', 'I843_D846del', 'H1686R', 'H1686Q', 'D816A', 'D816N', 'S214T', 'D816H', 'R20Q', 'Y149D', 'TFG-NTRK1 Fusion', 'C420R', 'Y1045W', 'L585I', 'Y130C', 'Q2405Rfs*17', 'E1322*', 'K618T', 'Y112H', 'A59T', 'BTBD1-NTRK3 Fusion', 'FGFR2-MGEA5 Fusion', 'W603_E604insDREYEYDLKW', 'V197L', 'P573_D579del', 'P85L', 'Y1045*', 'S308A', 'R497H', 'R515G', 'Y489C', 'N517S', 'Y931C', 'K398A', 'R2973C', 'R99P', 'K607T', 'P29S', 'R109*', 'V348L', 'A41P', 'S723F', 'P29L', 'P152L', 'V710A', 'F537_K539delinsL', 'D769A', 'A1752V', 'A2351G', 'E82D', 'P531L', 'D769H', 'P531A', 'L232LI', 'V157D', 'R748G', 'E82V', 'D769Y', 'N659R', 'A546D', 'R71G', 'Truncating Mutations', 'Q1811R', 'E746_T751delinsA', 'K292T', 'K379E', 'P1614S', 'P81L', 'N822I', 'A1843T', 'N234I', 'V299G', 'CCDC6-ROS1 Fusion', 'H41R', 'P577S', 'D408Y', 'Epigenetic Silencing', 'C1697R', 'V299L', 'D408E', 'TRB-NKX2-1 Fusion', 'D1091N', 'R2418G', 'D408H', 'Q579_L581del', 'R462E', 'W1456R', 'R428A', 'E35*', 'S70fsX93', 'V1810G', 'W557_V560del', 'E846K', 'T2722R', 'D1010Y', 'K179M', 'H876Q', 'E127G', 'S428F', 'D1010H', 'D1010N', 'A728V', 'Y40A', 'P286H', 'S1722F', 'P286R', 'V1534M', 'N1333Gfs*', 'D1029Y', 'EGFRvIII', 'I279P', 'S1101N', 'V995M', 'G309A', 'W290_I291delinsC', 'F133V', 'V157F', 'E1071W', 'T150I', 'R462C', 'I168F', 'I151S', 'L122R', 'R462I', 'L1764P', 'D1384V', 'Q60K', 'R164Q', 'R246K', 'G309E', 'EWSR1-NR4A3 Fusion'])

Unique Genes

In [14]:
unique_genes=set(train_varients.Gene)
print unique_genes
print len(unique_genes)
set(['ERRFI1', 'CCNE1', 'STK11', 'MEN1', 'FAM58A', 'AKT1', 'AKT2', 'AKT3', 'ARAF', 'RB1', 'IKBKE', 'NKX2-1', 'ROS1', 'AXL', 'RARA', 'RAD51D', 'BRCA1', 'PIK3CA', 'PIK3CB', 'MAP3K1', 'EIF1AX', 'INPP4B', 'WHSC1L1', 'GATA3', 'FGFR2', 'GLI1', 'CDH1', 'PPP6C', 'MYC', 'YAP1', 'BTK', 'NF2', 'CDKN2B', 'ERBB3', 'IGF1R', 'CDKN2A', 'PIK3R1', 'ERBB4', 'BARD1', 'IDH2', 'IDH1', 'NUP93', 'RET', 'BRD4', 'PMS1', 'MDM2', 'SETD2', 'MDM4', 'FGFR3', 'RAF1', 'FGFR1', 'MYD88', 'CCND1', 'ARID1B', 'ARID1A', 'CCND2', 'B2M', 'TCF7L2', 'KIT', 'FOXA1', 'PTEN', 'FAT1', 'RUNX1', 'WHSC1', 'APC', 'CCND3', 'CTCF', 'KDM5C', 'IL7R', 'DNMT3B', 'BRCA2', 'FOXP1', 'SDHC', 'CDKN1B', 'CDKN1A', 'RRAS2', 'CARM1', 'RIT1', 'PTPN11', 'RICTOR', 'KDM5A', 'XPO1', 'MYCN', 'PPM1D', 'SRSF2', 'ASXL1', 'TSC2', 'RASA1', 'ASXL2', 'JUN', 'PIK3R3', 'PIK3R2', 'H3F3A', 'JAK1', 'MSH2', 'FLT1', 'CHEK2', 'CARD11', 'CTLA4', 'TCF3', 'STAG2', 'ARID2', 'PBRM1', 'RNF43', 'VEGFA', 'HRAS', 'RHOA', 'FGF4', 'FGF3', 'PAX8', 'KEAP1', 'ETV1', 'EPAS1', 'MGA', 'TP53', 'GNAQ', 'ETV6', 'DDR2', 'MPL', 'CBL', 'PAK1', 'MAP2K2', 'MEF2B', 'SHQ1', 'PRDM1', 'NFE2L2', 'NSD1', 'CREBBP', 'AGO2', 'PDGFRB', 'PDGFRA', 'PMS2', 'MAP2K1', 'PPP2R1A', 'SMAD4', 'PIK3CD', 'JAK2', 'ATM', 'SMAD2', 'SMAD3', 'SMO', 'POLE', 'ATR', 'NTRK2', 'PIM1', 'ABL1', 'BRIP1', 'NTRK3', 'IKZF1', 'FLT3', 'NCOR1', 'TSC1', 'STAT3', 'NPM1', 'NFKBIA', 'BCL10', 'FGF19', 'RBM10', 'FANCC', 'FANCA', 'HLA-B', 'KDM6A', 'HLA-A', 'MAPK1', 'FBXW7', 'TGFBR2', 'TGFBR1', 'FUBP1', 'TET1', 'ERCC4', 'TET2', 'RXRA', 'MTOR', 'BCOR', 'DUSP4', 'ATRX', 'EP300', 'RAD51C', 'RAD51B', 'HIST1H1C', 'KNSTRN', 'DICER1', 'ARID5B', 'SOS1', 'VHL', 'ESR1', 'FOXO1', 'MET', 'SHOC2', 'EZH2', 'CDK4', 'KDR', 'CDK6', 'RAD50', 'CDK8', 'RHEB', 'NTRK1', 'GNAS', 'CIC', 'ERBB2', 'ACVR1', 'CDKN2C', 'ERCC2', 'ERCC3', 'SF3B1', 'HNF1A', 'MSH6', 'PTCH1', 'CTNNB1', 'LATS2', 'LATS1', 'ERG', 'CASP8', 'MAP2K4', 'ELF3', 'SMARCA4', 'CEBPA', 'XRCC2', 'BCL2L11', 'MYOD1', 'AXIN1', 'ALK', 'RAD54L', 'NRAS', 'MLH1', 'MED12', 'KLF4', 'AURKA', 'AURKB', 'RYBP', 'TERT', 'RAD21', 'KMT2C', 'KMT2B', 'KMT2A', 'DNMT3A', 'SMARCB1', 'KMT2D', 'SPOP', 'TMPRSS2', 'RAB35', 'SRC', 'CDK12', 'AR', 'TP53BP1', 'EPCAM', 'RAC1', 'KRAS', 'BAP1', 'NF1', 'PTPRT', 'SOX9', 'NOTCH1', 'NOTCH2', 'U2AF1', 'PTPRD', 'FGFR4', 'BRAF', 'GNA11', 'FOXL2', 'EGFR', 'SDHB', 'EWSR1', 'BCL2'])
264

Merging the two train data set and building into one

In [15]:
train = pd.merge(train_varients, train_text, on='ID')
print train.head()
   ID    Gene             Variation  Class  \
0   0  FAM58A  Truncating Mutations      1   
1   1     CBL                 W802*      2   
2   2     CBL                 Q249E      2   
3   3     CBL                 N454D      3   
4   4     CBL                 L399V      4   

                                                TEXT  
0  Cyclin-dependent kinases (CDKs) regulate a var...  
1   Abstract Background  Non-small cell lung canc...  
2   Abstract Background  Non-small cell lung canc...  
3  Recent evidence has demonstrated that acquired...  
4  Oncogenic mutations in the monomeric Casitas B...  
In [16]:
# reading the numbers!
print("there are ", len(train["Variation"]),"rows for the training set")
print("there are ",len(unique_variation), " different values for variations")
print("there are ",len(unique_genes), " different values for genes")
('there are ', 3321, 'rows for the training set')
('there are ', 2996, ' different values for variations')
('there are ', 264, ' different values for genes')
In [17]:
print("Train variant shape : ",train_varients.shape)
print("Train text shape : ",train_text.shape)
('Train variant shape : ', (3321, 4))
('Train text shape : ', (3321, 2))
In [18]:
#(Q) What are the column names in our dataset?
print (train.columns)
Index([u'ID', u'Gene', u'Variation', u'Class', u'TEXT'], dtype='object')
In [19]:
#checking for null values in the merged data
train.TEXT.isnull().values.any()
Out[19]:
False
In [20]:
# counting
train["Class"].value_counts()
Out[20]:
7    953
4    686
1    568
2    452
6    275
5    242
3     89
9     37
8     19
Name: Class, dtype: int64
In [21]:
%matplotlib inline
import matplotlib.pyplot as plt
import seaborn as sns


sns.countplot(x="Class", data=train_varients)
plt.ylabel('Frequency')
plt.xlabel('Class Count')
plt.title("Frequency of Classes")
plt.show()

Findings:

Class levels 3, 8, 9 are notably having least frequency or occurance.

Class Levels 5, 6 are of comparable, medium-low frequency

Class Levels 1, 2, 4 are of comparable, medium-high frequency

Class Level 7 is clearly the most frequent one

In [22]:
# we add a new column in the dataframe called 'LenText' which contains the length of each text field
train.loc[:,'LenText'] =(train['TEXT'].str.len()).astype(int)
print train.head()
   ID    Gene             Variation  Class  \
0   0  FAM58A  Truncating Mutations      1   
1   1     CBL                 W802*      2   
2   2     CBL                 Q249E      2   
3   3     CBL                 N454D      3   
4   4     CBL                 L399V      4   

                                                TEXT  LenText  
0  Cyclin-dependent kinases (CDKs) regulate a var...    39765  
1   Abstract Background  Non-small cell lung canc...    36831  
2   Abstract Background  Non-small cell lung canc...    36831  
3  Recent evidence has demonstrated that acquired...    36308  
4  Oncogenic mutations in the monomeric Casitas B...    41427  
In [23]:
train.groupby('Class')['LenText'].mean().plot(kind='bar',title='Length of text for each class',figsize=(10,6))
plt.show()

Observation:

For Class 9 mutation, the description of the text is highest

For Class 7 and 8, the description length is similar

For Class 1,2 and 4, the description length is almost similar

For Class 3,5 and 6, the description length is least

In [24]:
# creating a new column called Text Count
train.loc[:, 'Text_count']  = train["TEXT"].apply(lambda x: len(x.split()))
train.head()
Out[24]:
ID Gene Variation Class TEXT LenText Text_count
0 0 FAM58A Truncating Mutations 1 Cyclin-dependent kinases (CDKs) regulate a var... 39765 6089
1 1 CBL W802* 2 Abstract Background Non-small cell lung canc... 36831 5722
2 2 CBL Q249E 2 Abstract Background Non-small cell lung canc... 36831 5722
3 3 CBL N454D 3 Recent evidence has demonstrated that acquired... 36308 5572
4 4 CBL L399V 4 Oncogenic mutations in the monomeric Casitas B... 41427 6202
In [25]:
# plot for word count
train.groupby('Class')['Text_count'].mean().plot(kind='bar',title='Average Word count for each class',figsize=(10,6))
plt.show()

This has similar distribution to that of the character count

In [26]:
# box plot for class and word count
plt.figure(figsize=(12,8))
sns.boxplot(x='Class', y='Text_count', data=train)
plt.xlabel('Class', fontsize=12)
plt.ylabel('Text - Number of words', fontsize=12)
plt.show()
In [27]:
print max(train['Text_count'])
76708

This plot tells us that class 3,6 is distinguishable from rest of the classes

Finding null values in text field

In [28]:
count_grp = train.groupby('Class')['Text_count']
count_grp.describe()
Out[28]:
count mean std min 25% 50% 75% max
Class
1 568.0 9444.545775 6513.469240 1.0 4970.00 7286.5 12868.25 52969.0
2 452.0 9306.261062 7624.222077 116.0 4184.25 6810.0 12209.50 61923.0
3 89.0 6751.123596 3724.401696 1737.0 4283.00 5572.0 7415.00 27371.0
4 686.0 8978.188047 7276.221057 53.0 4566.00 6351.0 11521.75 43893.0
5 242.0 7504.409091 3890.285827 183.0 5245.00 6451.0 9513.50 24214.0
6 275.0 7143.058182 3827.958077 1.0 4505.50 6587.0 7626.00 24597.0
7 953.0 11437.771249 10105.042389 1.0 4871.00 8254.0 14619.00 76708.0
8 19.0 10810.105263 5645.073662 2111.0 5586.00 11237.0 15535.00 20612.0
9 37.0 12798.567568 10208.668344 1146.0 4942.00 10917.0 15797.00 45126.0
In [29]:
train[train['Text_count']==1.0]
Out[29]:
ID Gene Variation Class TEXT LenText Text_count
1109 1109 FANCA S1088F 1 null 4 1
1277 1277 ARID5B Truncating Mutations 1 null 4 1
1407 1407 FGFR3 K508M 6 null 4 1
1639 1639 FLT1 Amplification 6 null 4 1
2755 2755 BRAF G596C 7 null 4 1
In [30]:
train[train['Text_count']==1.0].count()
Out[30]:
ID            5
Gene          5
Variation     5
Class         5
TEXT          5
LenText       5
Text_count    5
dtype: int64

So we can conclude that there are some fields where there is no values.

In [33]:
gene_class={}
for i in range(1,10):
    gene_class[i]=set()
    for index, row in train.iterrows():
        if(i==(row['Class'])):
                gene_class[i].add(row['Gene'])
print gene_class
{1: set(['ERRFI1', 'STK11', 'FAM58A', 'B2M', 'RB1', 'RAD51D', 'BRCA1', 'MAP3K1', 'INPP4B', 'NTRK2', 'GATA3', 'HNF1A', 'CDH1', 'NF2', 'CDKN2B', 'CDKN2C', 'IGF1R', 'CDKN2A', 'NF1', 'BARD1', 'NUP93', 'RET', 'PMS2', 'SETD2', 'FGFR2', 'FGFR3', 'FGFR1', 'ARID1B', 'ARID1A', 'TCF7L2', 'FOXA1', 'PTEN', 'FAT1', 'RUNX1', 'APC', 'PTPN11', 'CTCF', 'BRCA2', 'FOXP1', 'CDKN1B', 'CDKN1A', 'FUBP1', 'CARM1', 'XRCC2', 'ASXL1', 'ASXL2', 'PIK3R3', 'PIK3R2', 'PIK3R1', 'JAK1', 'CHEK2', 'CTLA4', 'STAG2', 'ARID2', 'PBRM1', 'RNF43', 'RAD21', 'RHOA', 'MGA', 'TP53', 'ETV6', 'CBL', 'SHQ1', 'PRDM1', 'CREBBP', 'AGO2', 'PDGFRB', 'PDGFRA', 'SMAD4', 'ATM', 'SMAD2', 'SMAD3', 'POLE', 'ATR', 'BRIP1', 'TSC2', 'FLT3', 'NCOR1', 'TSC1', 'RASA1', 'RBM10', 'FANCC', 'FANCA', 'HLA-B', 'KDM6A', 'HLA-A', 'FBXW7', 'TGFBR2', 'TGFBR1', 'KMT2D', 'TET1', 'ERCC4', 'TET2', 'BCOR', 'ATRX', 'EP300', 'RAD51C', 'ERCC2', 'HIST1H1C', 'KNSTRN', 'DICER1', 'ARID5B', 'KEAP1', 'RAD50', 'CIC', 'PPP2R1A', 'MSH2', 'ERCC3', 'EGFR', 'MSH6', 'VHL', 'FOXO1', 'SOX9', 'CASP8', 'MAP2K4', 'ELF3', 'SMARCA4', 'CEBPA', 'BCL2L11', 'AXIN1', 'CDK12', 'MLH1', 'MED12', 'GLI1', 'RYBP', 'KMT2C', 'KMT2A', 'DNMT3A', 'SMARCB1', 'PIM1', 'SPOP', 'AR', 'TP53BP1', 'NPM1', 'BCL10', 'PTPRT', 'ERBB2', 'NOTCH1', 'NOTCH2', 'PTPRD', 'BAP1', 'PTCH1']), 2: set(['AKT1', 'AKT2', 'AKT3', 'ARAF', 'IKBKE', 'NKX2-1', 'ROS1', 'AXL', 'PIK3CA', 'PIK3CB', 'NTRK1', 'NTRK2', 'FGFR3', 'MYC', 'FGFR1', 'ERBB2', 'IGF1R', 'ERBB4', 'FGFR2', 'GATA3', 'KRAS', 'CCND1', 'PTPN11', 'CCND2', 'KIT', 'FOXA1', 'CCND3', 'KDM5C', 'HRAS', 'NFKBIA', 'RIT1', 'XPO1', 'MYCN', 'JAK2', 'PIK3R2', 'CARD11', 'RHOA', 'ETV1', 'EPAS1', 'TP53', 'ETV6', 'DDR2', 'MPL', 'CBL', 'KDM5A', 'MEF2B', 'NFE2L2', 'NSD1', 'BTK', 'AGO2', 'PDGFRB', 'PDGFRA', 'RET', 'SMO', 'ABL1', 'NTRK3', 'RAF1', 'FLT3', 'STAT3', 'FGF19', 'KDR', 'FBXW7', 'SRC', 'MTOR', 'EP300', 'YAP1', 'ALK', 'SOS1', 'ESR1', 'MET', 'GNAS', 'ACVR1', 'TGFBR2', 'ERCC2', 'EGFR', 'CTNNB1', 'MAP2K2', 'MAP2K1', 'ERG', 'MAP2K4', 'MYOD1', 'NRAS', 'MED12', 'KLF4', 'AURKB', 'TERT', 'DNMT3B', 'KMT2A', 'DNMT3A', 'PIM1', 'TMPRSS2', 'AR', 'NOTCH1', 'NOTCH2', 'BRAF', 'EWSR1']), 3: set(['PDGFRA', 'EGFR', 'SMAD3', 'AKT1', 'SOX9', 'CASP8', 'FLT3', 'TSC1', 'HNF1A', 'CHEK2', 'FBXW7', 'BRCA1', 'ERBB4', 'TMPRSS2', 'NUP93', 'MTOR', 'PMS2', 'RAD51C', 'KRAS', 'TP53', 'ALK', 'ESR1', 'CBL', 'KIT', 'PTEN', 'BRAF']), 4: set(['STK11', 'MEN1', 'B2M', 'NTRK1', 'CDH1', 'ERBB2', 'CDKN2A', 'PIK3R1', 'IDH1', 'RET', 'PMS1', 'PMS2', 'SETD2', 'GATA3', 'PTPN11', 'KIT', 'PTEN', 'FAT1', 'RUNX1', 'APC', 'KDM5C', 'BRCA2', 'CDKN1B', 'SRSF2', 'TSC2', 'PPP6C', 'NF1', 'CHEK2', 'TCF3', 'PBRM1', 'RHOA', 'TP53', 'ETV6', 'CBL', 'MEF2B', 'PRDM1', 'BTK', 'PDGFRB', 'PDGFRA', 'SMAD4', 'ATM', 'SMAD2', 'SMAD3', 'POLE', 'BRIP1', 'IKZF1', 'TSC1', 'CREBBP', 'RBM10', 'FANCA', 'FBXW7', 'TGFBR2', 'TGFBR1', 'ERCC4', 'TET2', 'FOXL2', 'EP300', 'KNSTRN', 'DICER1', 'VHL', 'LATS1', 'KEAP1', 'SHOC2', 'CDK6', 'RAD50', 'BRCA1', 'MSH2', 'ERCC3', 'EGFR', 'MSH6', 'LATS2', 'MAP2K1', 'SOX9', 'MAP2K4', 'SMARCA4', 'CDK12', 'MLH1', 'AURKA', 'TERT', 'KMT2C', 'SPOP', 'CDKN2B', 'EPCAM', 'BAP1', 'PTPRT', 'NOTCH1', 'CASP8', 'RAD54L', 'PTPRD', 'BRAF', 'SDHC', 'SDHB']), 5: set(['KMT2C', 'PDGFRA', 'SMAD4', 'ERCC2', 'ATM', 'EGFR', 'SMAD2', 'SMO', 'AKT1', 'MAP2K1', 'BRCA2', 'RB1', 'ELF3', 'NTRK2', 'TSC1', 'CDK12', 'PIK3CA', 'MAP3K1', 'BRCA1', 'MLH1', 'RET', 'PIK3R2', 'KRAS', 'MSH2', 'FBXW7', 'ERBB2', 'ERBB3', 'STK11', 'PIM1', 'IDH1', 'TET2', 'AR', 'MTOR', 'MSH6', 'PMS2', 'FGFR2', 'FGFR3', 'FGFR1', 'TP53', 'ALK', 'VHL', 'NF2', 'NOTCH1', 'CBL', 'PTEN', 'BRAF', 'PTCH1', 'CREBBP']), 6: set(['PDGFRB', 'PPP2R1A', 'SMAD4', 'CDKN1B', 'CDKN1A', 'EGFR', 'SMAD2', 'SMO', 'VHL', 'MAP2K1', 'BRCA2', 'WHSC1L1', 'RARA', 'BRIP1', 'ROS1', 'FLT1', 'XPO1', 'BRCA1', 'PIK3CA', 'KMT2B', 'MAP3K1', 'NRAS', 'MLH1', 'FGFR2', 'JAK2', 'PIK3R2', 'DUSP4', 'MYC', 'CHEK2', 'MSH2', 'KDR', 'NF2', 'ERBB2', 'ERBB3', 'KMT2A', 'CDKN2A', 'ERBB4', 'PIM1', 'TET2', 'RXRA', 'AR', 'FOXL2', 'FGF4', 'FGF3', 'FGFR3', 'KRAS', 'TP53', 'RB1', 'MAP2K2', 'DDR2', 'PTPN11', 'CBL', 'KIT', 'MET', 'RUNX1', 'BRAF']), 7: set(['AKT1', 'AKT2', 'AKT3', 'ARAF', 'NKX2-1', 'ROS1', 'PIK3CA', 'PIK3CB', 'PIK3CD', 'NTRK1', 'NTRK3', 'MYC', 'KRAS', 'ERBB2', 'ERBB3', 'ERBB4', 'IDH1', 'BRD4', 'FGFR4', 'FGFR2', 'FGFR3', 'FGFR1', 'CCND1', 'WHSC1', 'PTPN11', 'KIT', 'FOXA1', 'PTEN', 'RUNX1', 'CCND3', 'IL7R', 'HRAS', 'RRAS2', 'RIT1', 'RICTOR', 'MYCN', 'PPM1D', 'RAF1', 'CCNE1', 'JUN', 'JAK2', 'PIK3R2', 'JAK1', 'CDK4', 'MYD88', 'CARD11', 'VEGFA', 'RHOA', 'EZH2', 'PAX8', 'ETV1', 'GNAQ', 'ESR1', 'DDR2', 'MPL', 'PAK1', 'MEF2B', 'NFE2L2', 'BTK', 'PDGFRB', 'PDGFRA', 'EIF1AX', 'RET', 'SMO', 'ABL1', 'TSC2', 'FLT3', 'STAT3', 'MAPK1', 'KDR', 'SRC', 'NFKBIA', 'TET1', 'MTOR', 'EP300', 'YAP1', 'ALK', 'SOS1', 'ETV6', 'MET', 'SOX9', 'MDM2', 'CDK6', 'CDK8', 'RHEB', 'GNAS', 'PPP2R1A', 'ACVR1', 'EGFR', 'CTNNB1', 'MAP2K2', 'MAP2K1', 'ERG', 'RAB35', 'NRAS', 'MDM4', 'MED12', 'AURKA', 'TERT', 'RAC1', 'TMPRSS2', 'AR', 'NOTCH1', 'CASP8', 'BRAF', 'GNA11', 'EWSR1', 'BCL2']), 8: set(['ERBB2', 'DNMT3B', 'BCOR', 'SF3B1', 'IDH1', 'EGFR', 'RAD51B', 'AKT1', 'H3F3A', 'IDH2', 'RHOA', 'PTPN11']), 9: set(['CIC', 'TP53', 'U2AF1', 'IDH2', 'IDH1', 'SF3B1', 'MED12', 'AKT2', 'EZH2', 'CTCF'])}
In [34]:
unique_gene_count={}

for k,v in sorted(gene_class.items()):
    unique_gene_count[k]=len(v)
print unique_gene_count
{1: 142, 2: 96, 3: 26, 4: 92, 5: 48, 6: 56, 7: 108, 8: 12, 9: 10}
In [35]:
plt.figure()
plt.bar(unique_gene_count.keys(),unique_gene_count.values())
plt.title('Number of Unique genes per Class')
plt.show()
In [36]:
# unique variation per class
variation_class={}
for i in range(1,10):
    variation_class[i]=set()
    for index, row in train.iterrows():
        if(i==(row['Class'])):
                variation_class[i].add(row['Variation'])
print variation_class
{1: set(['E70K', 'G508S', 'W1610G', 'R802*', 'A209T', 'V1688del', 'G207E', 'R1835P', 'M253K', 'N238S', 'V1714G', 'R2450*', 'Y24C', 'I1307K', 'D520N', 'T24A', 'L455M', 'K218T', 'G637W', 'C324Y', 'H214N', 'L348F', 'G67R', 'G17E', 'K1702E', 'L348S', 'H1746N', 'H214R', 'H214Q', 'G245D', 'G245A', 'W1291R', 'I195T', 'W1837C', 'K129E', 'P153H', 'W1837G', 'G1743R', 'I251S', 'E40W', 'D1709E', 'K291E', 'V143A', 'R283Q', '1_2009trunc', 'P179L', 'R1040L', 'P179R', 'D1739Y', 'L46R', 'D257N', 'R183Q', 'R183P', 'D300N', 'M1689T', 'K11R', 'G423R', 'H179N', 'R183G', 'H284P', 'R174C', 'E606G', 'H284N', 'S240R', 'R337L', 'A298T', 'S215C', 'Q545A', 'V1736G', 'D1739E', 'V242F', 'N510K', 'P133T', 'T1685A', 'D617G', 'R280T', 'R280A', 'R202S', 'Y297A', 'A707T', 'L370fs', 'S567L', 'Q1503P', 'A717G', 'Y426A', 'F341C', 'V1838E', 'L330R', 'G271E', 'I1807S', 'K517R', 'S1088F', 'R2602T', 'R698W', 'E330G', 'M133T', 'C242F', 'C135S', 'R659L', 'D423N', 'K507A', 'A211D', 'R158C', 'R315*', 'S1036P', 'I255F', 'L1657P', 'Y835F', 'M1775E', 'S142I', 'G1596V', 'R378A', 'T17A', 'Y412F', 'F119S', 'M1628V', 'K507Q', 'K428A', 'S102F', 'R2842C', 'L388M', 'L493V', 'R1896M', 'C277R', 'C277Q', 'C277W', 'R273C', 'R273G', 'D108H', 'D108N', 'L481F', 'G1656D', 'V1653M', 'R1515H', 'E3002K', 'V1741G', 'Fusions', 'M1R', 'T605M', 'S1164I', 'Y98N', 'R292A', 'F74S', 'K292T', 'V804G', 'A1843T', 'A1843P', 'E218*', 'T413N', 'L826P', 'R251Q', 'G480W', 'C141Y', 'L1407P', 'E1836K', 'D121G', 'L461V', 'S297F', 'D493A', 'Y280H', 'W406A', 'L43V', 'R373H', 'A161T', 'D609E', 'Q331R', 'W131A', 'Q331H', 'M237K', 'V536M', 'G266E', 'P291Qfs*51', 'I642V', 'Q58_Q59insL', 'Deletion', 'T1481fs', 'R976H', 'R732Q', 'A122*', 'F1734S', 'G665A', 'P1709L', 'G1788D', 'R647A', 'P305L', 'M1652K', 'G244D', 'R3052W', 'E2663V', 'A290T', 'A347T', 'G1803A', 'A636P', 'R265S', 'L448P', 'L234fs', 'D171G', 'S746fs', 'R331P', 'P463L', 'N296I', 'D2723G', 'R421*', 'M713I', 'D2723H', 'F328V', 'S196N', 'L708P', 'G1567D', 'G2748D', 'R182W', 'L1584R', 'A1708V', 'E75G', 'V356R', 'L193F', 'H132Y', 'E1978*', 'R755S', 'R175C', 'G1763V', 'R987W', 'R514C', 'P316L', 'V370D', 'M1689R', 'S279Y', 'T779fs', 'D450E', 'D450H', 'V1809F', 'W279*', 'S1715C', 'Q1756C', 'Y599F', 'V1673D', 'V1673F', 'F248S', 'R658Q', 'C706F', 'F134Y', 'R631C', 'T1219I', 'Y598C', 'V648G', 'P26S', 'R841K', 'W257C', 'V272L', 'R369Q', 'R11K', 'R369W', 'E258V', 'K291Q', 'R1088C', 'L550P', 'R601Q', 'A648T', 'R162*', 'Q538P', 'G1125A', 'S65A', 'R1751P', 'R310H', 'S276L', 'H1904R', 'A111P', 'A159T', 'E1660G', 'G101S', 'H168N', 'D3095E', 'S459del', 'N82K', 'Y238A', 'R1446H', 'Truncating Mutations Upstream of Transactivation Domain', 'R1446C', 'R248Q', 'F568fs', 'P151H', 'R248L', 'E106G', 'R248K', 'V220F', 'R156C', 'D557H', 'E219K', 'L1854P', 'F400I', 'H68R', 'G87R', 'L64P', 'X582_splice', 'P38L', 'K288Q', 'E664K', 'R181L', 'K765R', 'R453C', 'W237_Y242del', 'R1076C', 'R110P', 'S217T', 'Q510E', 'S217A', 'S217C', 'L188V', '596_619splice', 'R80P', 'R5Q', 'V319D', 'R110L', 'I2675V', 'S1841A', 'S1498N', 'R2659T', 'G129A', 'S1841N', 'K79E', 'S1841R', 'P70R', 'C278F', 'P278A', 'F1761I', 'R243A', 'R282W', 'R282Q', 'A4419S', 'L844R', 'R812A', 'F615S', 'D300H', 'Exon 13 deletion', 'N1387A', 'K420A', 'R262T', 'I2627F', 'P1311T', 'R583A', 'A349P', 'D513Y', 'I162M', 'R380A', 'E285K', 'Y384A', 'F1704S', 'I279P', 'V157F', 'R1699L', 'L146R', 'H1686R', 'V1565I', 'Q144R', 'Q1554H', 'G114R', 'Y112H', 'V1833M', 'G199R', 'Y472H', 'V1833E', 'V197L', 'R689Q', 'S723F', 'R487Q', 'R487W', 'P152L', 'V274F', 'A1752V', 'R2336P', 'R337H', 'L535P', 'S256W', 'R290G', 'V197E', 'D186A', 'D1709A', 'R337P', 'S256F', 'R267Q', 'K341A', 'Truncating Mutations', 'Q1811R', 'Y98H', 'R201W', 'R487*', 'W349C', 'V299G', 'C229R', 'Epigenetic Silencing', 'R249S', 'S241L', 'S241F', 'Y298A', 'P250L', 'V1810G', 'A149P', 'S206C', 'E137K', 'K442Nfs*', 'S428F', 'L607I', 'W1718L', 'H233N', 'R290H', 'T1691K', 'T1691I', 'Y1703H', 'Y1703S', 'D887N', 'E1071W', 'R2784W', 'P4309A', 'T150I', 'R462C', 'L118R', 'L344R', 'L118P', 'Y353L', 'I151S', 'D130A', 'L1705P', 'A40E']), 2: set(['Amplification', 'CUL1-BRAF Fusion', 'R1275L', 'L485_Q494del', 'Q227L', 'S151A', 'G623R', 'EWSR1-ETV4 Fusion', 'M244V', 'R833C', 'E81K', 'C2060G', 'S453fs*', 'G863S', 'Q227R', 'S645C', 'Y1463S', 'N581Y', 'F359C', 'N581S', 'G914R', 'N581D', 'N676D', 'T783A', 'D289_D292del', 'K558_V559del', 'V561A', 'A2034V', 'H396R', 'H396P', 'F53C', 'V569_L576del', 'GPIAP1-PDGFRB Fusion', 'P34R', 'P261L', 'K558_E562del', 'TFG-ROS1 Fusion', 'Q477E', 'D808N', 'TRA-NKX2-1 Fusion', 'P106L', 'L387M', 'P551_W557delinsL', 'E40L', 'G466R', 'V1671I', 'E40T', 'E40Q', 'P491S', 'E746G', 'MSN-ROS1 Fusion', 'L485_P490delinsY', 'N857S', 'E746Q', 'L485_P490delinsF', 'P596L', 'T73I', 'G2032R', 'S505N', 'T241P', 'K117N', 'E1051K', 'G857E', 'T315I', 'N463S', 'N553S', 'E69K', 'V487_P492delinsA', 'H68Y', 'MIR143-NOTCH1 Fusion', 'F876L', 'E579K', 'V794M', 'E139D', 'G469V', 'D473H', 'D473G', 'Y553_K558del', 'G13R', 'NIN-PDGFRB Fusion', 'G13V', 'G469E', 'A504_Y505ins', 'G13E', 'G13C', 'K558N', 'V773A', 'D820A', 'D820G', 'D820E', 'T599_V600insEAT', 'W557G', 'N676S', 'C634W', 'W802*', 'F311L', 'S752_I759del', 'T58I', 'TMPRSS2-ETV4 Fusion', 'P753S', 'KIAA1509-PDGFRB Fusion', 'S33F', 'ATG7-RAF1 Fusion', 'Q79R', 'Y1003*', 'V726M', 'G328E', 'T205A', 'E633K', 'D594G', 'P691S', 'C482R', 'ACPP-PIK3CB Fusion', 'EWSR1-ATF1 Fusion', 'D594Y', 'TP53BP1-PDGFRB Fusion', 'Y1003C', 'Fusions', 'P287S', 'L232LI', 'K1452N', 'G382D', 'L861F', 'G2101A', 'E265K', 'R882C', 'L861P', 'P531S', 'V2006L', 'K292I', 'G465E', 'KDELR2-ROS1 Fusion', 'F384L', 'K650M', 'V777A', 'K650N', 'K650T', 'F384Y', 'K650Q', 'K650R', 'F384V', 'A829P', 'Y806C', 'EWSR1-FEV Fusion', 'Y34C', 'K467T', 'G106_R108del', 'N564_Y578del', 'CAD-ALK Fusion', 'W557_V559delinsC', 'GOLGA4-PDGFRB Fusion', 'E554_I571del', 'G128V', 'Truncating Mutations in the PEST Domain', 'R678Q', 'YAP1-MAMLD1 Fusion', 'P2412T', 'E275K', 'V559C', 'R537P', 'V559A', 'E571K', 'T674I', 'BIN2-PDGFRB Fusion', 'S121A', 'TMPRSS2-ETV5 Fusion', 'E490K', 'PVT1-MYC Fusion', 'Y64A', 'R370C', 'EWSR1-CREB1 Fusion', 'K526E', 'L747V', 'L747S', 'M541L', 'Y570H', 'L747F', 'M918V', 'V344A', 'G434R', 'SSBP2-JAK2 Fusion', 'R836C', 'E554_V559del', 'SPAG9-JAK2 Fusion', 'V769M', 'Y553N', 'P44L', 'P284L', 'A1374V', 'A767_V769del', 'M37K', 'Q2416*', 'V379I', 'L611V', 'T710A', 'E255V', 'Exon 2 mutations', 'E255K', 'V774A', 'V414A', 'E279K', 'STRN-ALK Fusion', 'EGFR-PURB Fusion', 'V774M', 'M664I', 'E453A', 'D821N', 'A919V', 'G469del', 'L37P', 'T733I', 'E758G', 'T488_P492del', 'P577_D579del', 'S464L', 'K111N', 'C809G', 'CEP85L-ROS1 Fusion', 'A633T', 'T28I', 'Q252H', 'Q635E', 'PTPRZ1-MET Fusion', 'W515K', 'I563_L576del', 'G250E', 'H231R', 'N486_P490del', 'I290A', 'K409Q', 'S1986Y', 'P490_Q494del', 'S1986F', 'D384N', 'Y532C', '2010_2471trunc', 'R93Q', 'A77T', 'N826Y', 'A2425T', 'K575M', 'Q61H', 'NSD1-NUP98 Fusion', 'BCAN-NTRK1 Fusion', 'IGH-NKX2 Fusion', 'L858Q', 'T50I', 'H773Y', 'D579del', 'K806A', 'N345I', 'S463P', 'L858M', 'G503V', 'Q22K', 'D837N', 'K603Q', 'FGFR2-KIAA1967 Fusion', 'D761Y', 'P2415del', 'D277H', 'I111P', 'I111R', 'T37A', 'Q556_K558del', 'I111A', 'WDR48-PDGFRB Fusion', 'E281K', '533_534del', 'A146V', 'Overexpression', 'G106D', 'R48W', 'L838P', 'Y253H', 'G13D', 'D387V', 'I290R', 'M1255I', 'E746V', 'A859_L883delinsV', 'V60E', 'E459K', 'PRKG2-PDGFRB Fusion', 'D287H', 'R544S', 'GIT2-PDGFRB Fusion', 'H2428Q', 'G596V', 'V852I', 'CEP85L-PDGFRB Fusion', 'D101Y', 'CHTOP-NTRK1 Fusion', 'G1971E', 'L726F', 'N655K', 'E542Q', 'C77F', 'G719C', 'G719D', 'H1106D', 'YAP1-FAM118B Fusion', 'V289F', 'A627T', 'PDE4DIP-PDGFRB Fusion', 'V2098I', 'A530T', 'A530V', 'E1935G', 'E768D', 'K641N', 'M351T', 'M552_K558del', 'L283_D294del', 'S214A', 'D119N', 'C378R', 'D816E', 'MKRN1-BRAF Fusion', 'D816A', 'LIMA1-ROS1 Fusion', 'T574_R588delinsL', 'P838L', 'S36Y', 'SPTBN1-PDGFRB Fusion', 'Q2405Rfs*17', 'N219D', 'I653T', 'A59T', 'FGFR2-MGEA5 Fusion', 'D1203N', 'R671Q', 'L384M', '534_536del', 'E82D', 'P531L', 'P531A', 'V804L', 'E82V', 'T798M', 'F2108L', 'V560E', 'R267P', 'L1224F', 'Truncating Mutations', 'N822Y', 'V1605del', 'V765A', 'G123S', 'K483E', 'T574insTQLPYD', 'K483M', 'Y647C', 'R134Q', 'CCDC6-ROS1 Fusion', 'P577_W582delinsPYD', 'V299L', 'F317L', 'TRB-NKX2-1 Fusion', 'D1091N', 'L692F', 'W1456R', 'L703P', 'L221R', 'W557_V560del', 'Q249E', 'I852M', 'K659E', 'P2417A', 'P780L', 'D1010Y', 'M552_W557del', 'F57L', 'S335C', 'ATF7IP-PDGFRB Fusion', 'F57V', 'S904F', 'E554_K558del', 'D1010H', 'D1010N', 'L248V', 'Q22R', 'E355A', 'V1578del', 'D1029Y', 'E709V', 'E709Q', 'E709G', 'G12F', 'L1947R', 'G12C', 'E267G', 'G12A', 'R462E', 'L122R', 'L597R', 'L597S', 'Q472H', 'D2033N']), 3: set(['D324N', 'T167A', 'N870S', 'L362R', 'C248T', 'I1183T', 'R2505Q', 'E839K', 'R2505*', 'V271L', 'G831E', 'D1270G', 'R133H', 'R1097H', 'T244R', 'G264S', 'V534E', 'G591V', 'N132K', 'P2476L', 'L52F', 'A121V', 'N181S', 'R1464*', 'T1087I', 'A1099T', 'Wildtype', 'A19V', 'S432L', 'S1651F', 'G829R', 'T131S', 'G334R', 'A1200V', 'I834V', 'S562L', 'G305W', 'G305R', 'P85L', 'I31M', 'R170Q', 'V557I', 'I562M', 'V348L', 'I42V', 'R2430M', 'E996K', 'Q59E', 'P142H', 'D67Y', 'M160V', 'F1888V', 'K830R', 'A41P', 'A41T', 'L202F', 'S153R', 'T1977S', 'Y364H', 'D1071N', 'R246K', 'G1035S', 'P47S', 'N454D', 'D2512Y', 'D2512G', 'T75M', 'A633V', 'Y237A', 'R1231Q', 'Y105C', 'K125R', 'S2215T', 'I18V', 'C628Y', 'G1286R', 'T417I', 'M737I', 'H570R', 'T196A', 'R1060H', 'V369G', 'I21V', 'L1204F', 'S186Y', 'I122V', 'R462I', 'V194M', 'G1706A']), 4: set(['R130A', 'R130G', 'R130K', 'R796G', 'Y901C', 'S376F', 'H93Q', 'H93R', 'R479Q', 'F170I', 'R154W', 'H93D', 'F241S', 'Y234C', 'R479H', 'R479L', 'Y234H', 'T1623I', 'G70D', 'S786F', 'S765P', 'F367S', 'V354E', 'P1749R', 'S858R', 'N78S', 'R272C', 'S270L', 'A23E', 'L191H', 'Q337*', 'H118P', 'D390Y', 'R130*', 'R173P', 'D603N', 'L32P', 'R173H', 'E946*', 'D351H', 'R173C', 'R49C', 'RUNX1-EVI1 Fusion', 'R897Q', 'A121P', 'G17V', 'P86H', 'G67W', 'G67R', 'G67S', 'G1738R', 'K373E', 'V411L', 'D927G', 'G17A', 'H206D', 'A121E', 'K120M', 'G165V', 'E326L', 'G165R', 'W1837R', 'R496H', 'A57V', 'G165E', 'L1301R', 'G245S', 'T468M', 'E161del', 'DNA binding domain insertions', 'V118D', 'L117P', 'CASP8L', 'R268A', 'K342N', 'H83Y', 'R342P', 'D24Y', 'R922*', 'H191D', 'L1780P', 'D560Y', 'R470C', 'G375P', 'H179R', 'M53I', 'H179Q', 'H179Y', 'C396R', 'R420Q', 'G423V', 'G1809K', 'P48T', 'H179L', 'G52R', 'A1685S', 'I853T', 'N48K', 'S1653F', 'M224R', 'P94A', 'S10N', 'H83N', 'R304*', 'E69G', 'S1653P', 'V321M', 'F81V', 'L792R', 'G35R', 'Copy Number Loss', 'R304W', 'C44F', 'R1753T', 'S215G', 'G35A', 'D331G', 'C44Y', 'T910M', 'K4E', 'R82P', 'D1739G', 'H114Y', 'Q145H', 'K125M', 'H398Y', 'R201Q', 'G75R', 'D1739V', 'D171N', 'T1685I', 'D86N', 'L181P', 'D107Y', 'H123Y', 'Y155C', 'L424V', 'E483*', 'R64P', 'F154L', 'R280K', 'F341V', 'T77P', 'S243C', 'L424I', 'E330K', 'V126D', 'L158Q', 'L158P', 'V1713A', 'F1088Sfs*2', 'C135R', 'C71Y', 'G419V', 'Y87N', 'R158L', 'S1715N', 'R158H', 'R1276P', 'V277D', 'R264C', 'Y87C', 'A532H', 'A95D', 'P648L', 'R181C', 'R659*', 'R1391S', 'S227F', 'M1775K', 'E77K', 'S227A', 'R1391G', 'I122S', 'K52R', 'V217D', 'R640G', 'P169H', 'G85R', 'C47G', 'K382E', 'W308C', 'C64G', 'C24R', 'D194Y', 'X434_splice', 'A1789T', 'A34D', 'R905Q', 'R167W', 'P130S', 'Y163C', 'D258N', 'L230P', 'M1K', 'W1502A', 'F158C', 'R130L', 'T160I', 'P1502L', 'P83L', 'E875G', 'A126G', 'R273L', 'S387Y', 'G20E', 'W117C', 'T1700A', 'R112G', 'G14V', 'R337C', 'A126V', 'A617T', 'H61R', 'D140G', 'H61D', 'M1R', 'G333S', 'D404G', 'A598T', 'M1T', 'R882L', 'A1752P', 'Y65C', 'K745M', 'R177Pfs*126', 'R574fs', 'H412Y', 'D845A', 'F133V', 'C61G', 'R1276Q', 'T576del', 'R80C', 'F133L', 'R373Q', 'G430C', 'N81Y', 'P48R', 'F1088Lfs*5', 'D162G', 'M1_E165DEL', 'C381A', 'D162H', 'V173L', 'A161S', 'G266R', 'V173E', 'Hypermethylation', 'W131G', 'S56I', 'S24F', 'E286K', 'DNA binding domain missense mutations', 'T1365M', 'T117M', 'R482Q', 'C176F', 'R167Q', 'R117G', 'P48L', 'E50K', 'Deletion', 'M1V', 'K78I', 'Y489C', 'R321Q', 'T1324N', 'K78A', 'I47F', 'R100*', 'A889P', 'G186R', 'L180P', 'H597Y', 'I49S', 'R1093*', 'TGFBR1*6A', 'R100T', 'S80R', 'A134D', 'V155A', 'K62R', 'H65Y', 'K83N', 'V665A', 'DNA binding domain deletions', 'G123R', 'F102C', 'G244R', 'G244S', 'P47A', 'H1620R', 'R1594Q', 'X475_splice', 'R320Q', 'C27A', 'L142P', 'E921K', 'L23F', 'T1203K', 'L830R', 'E866K', 'M117I', 'H174R', 'Q367P', 'R15K', 'Y1248F', 'Y510C', 'H41R', 'P214L', 'A1708E', 'A205T', 'G325R', 'L1273F', 'Q1756fs', 'S70fsX93', 'T131A', 'R418G', 'H115R', 'L67P', 'R121Q', 'L57V', 'V155F', 'N71S', 'M134L', 'K289E', 'R175L', 'R175H', 'K1436Q', 'G909R', 'F893L', 'N71K', 'N71I', 'R87P', 'V769E', 'G127E', 'T167A', 'G127N', 'R383*', 'G1788V', 'R711*', 'P336S', 'R87L', 'Y42C', 'P114L', 'P114S', 'R228K', 'M35R', 'E1356G', 'L996F', 'A614D', 'P186S', 'H193P', 'I122L', 'H193N', 'S1715R', 'R213Q', 'Y113*', 'L325F', 'K97M', 'K83E', 'C134W', 'P219S', 'D74N', 'N1125I', 'P278S', 'N235D', 'D74Y', 'N551K', 'E23fs', 'E88K', 'R1627', 'C124N', 'Y220S', 'T131I', 'Y236S', 'T131L', 'Q50P', 'Q395*', 'Y220C', 'C124S', 'C124R', 'Y236D', 'Y236C', 'M18T', 'T82A', 'S65L', 'Q50*', 'H168R', 'N2875K', 'I33del', 'Y846C', 'G1770V', 'S65W', 'P326L', 'D252G', 'K120E', 'R139G', 'D84Y', 'Y69H', 'T401I', 'K120N', 'D84V', 'L158V', 'D84N', 'V128del', 'D84H', 'L97R', 'E580*', 'D84G', 'G101W', 'R133*', 'C1385', 'P480L', 'L410R', 'N564K', 'P96Q', 'Q96P', 'M199del', 'G1232D', 'N198_F199delinsI', 'L112P', 'N564D', 'L112R', 'R248W', 'L63P', 'P40T', 'L345Q', 'V710A', 'P81T', 'L210R', 'Y1853*', 'P151S', 'C696Y', 'P81L', 'G163D', 'R159G', 'P151A', 'A197T', 'F21A', 'Y1853C', 'I32del', 'C136R', 'R156H', 'T844M', 'A60V', 'D537Y', 'C91A', 'L108P', 'R69C', 'D1399Y', 'S1463F', 'C91S', 'C136Y', 'M136R', 'D537E', 'R161G', 'R265C', '385_418del', 'N239S', 'K125L', 'D357Y', 'I668V', 'E160*', 'S35Q', 'T37R', 'K125E', 'S100F', 'K162D', 'R905G', 'I1766S', 'V1075Yfs*2', 'D1810A', 'H123Q', 'V1736A', 'R181H', 'R611Q', 'R611W', 'H123D', 'Q164R', 'R24P', 'R273H', 'R80L', 'P704S', 'R166Q', 'G23D', 'L188Q', 'I559_D560insDKRMNS', 'A126D', 'Promoter Hypermethylation', 'S2G', 'S170R', 'E439del', 'G129E', 'S68W', 'S459F', 'A39P', 'D402Y', 'R2659K', 'E685V', 'S170N', 'G129R', 'F808L', 'D92N', 'E41A', 'P278L', 'F1761S', 'A126S', 'G1809R', 'E1552del', 'P278R', 'E1705A', 'H492R', 'E1705K', 'C569Y', 'R465H', 'Y27S', 'R1343L', 'W714*', 'TEL-RUNX1 Fusion', 'R348*', 'R389*', 'G376R', 'S1655F', 'W345*', 'R905W', 'N1333Gfs*', 'W183A', 'E239A', 'T19N', "3' Deletion", 'P395A', 'R415G', 'C105F', 'R873Q', 'R79P', 'D92A', 'H105R', 'D92G', 'D92E', 'R1699W', 'D92H', 'R1699Q', 'P95L', 'V343E', 'G251C', 'H1686Q', 'F12L', 'N45S', 'E285V', 'R20Q', 'C275S', 'V851A', 'A36P', 'R680*', 'R1625C', 'Y68D', 'R200W', 'G93W', 'E1322*', 'F347L', 'A120S', 'P417A', 'Y174N', 'G478C', 'W719*', 'R978*', 'R497H', 'R515G', 'S425C', 'R1209W', 'T319del', 'Y1414C', 'R1204W', 'R689S', 'E321K', 'R99P', 'K128T', 'R109*', 'K128Q', 'R1204G', 'K128N', 'L704N', 'G1738E', 'Q816*', 'D769A', 'A634V', 'K753A', 'Y371H', 'K753M', 'P525L', 'Y371S', 'A1022E', 'R2336H', 'P173S', 'R177*', 'V391I', 'Truncating Mutations', 'P1675L', 'K379E', 'G1706E', 'C384R', 'D254N', 'L344P', 'N234I', 'V1398D', 'Q233*', 'R361C', 'R1189*', 'H78Q', 'Q429*', 'D408Y', 'S119N', 'C1697R', 'Y16C', 'D408E', 'M117V', 'D408H', 'Q579_L581del', 'R177Q', 'E35*', 'C242S', 'L399V', 'S241Y', 'S46N', 'S241T', 'S46I', 'D92V', 'L1764P', 'H876Q', 'C41Y', 'RUNX1-RUNX1T1 Fusion', 'Y551F', 'D326N', 'Q984K', 'N1026S', 'V343L', 'C39Y', 'P286H', 'L145R', 'C238S', 'S1722F', 'P286R', 'T340A', 'C238F', 'W1718C', 'K459_S460delinsN', 'R287A', 'P95H', 'Y68H', 'E157G', 'V995M', 'R505L', 'L274P', 'V157D', 'Y149D', 'G751R', 'R505C', 'P95S', 'V705M', 'R249W', 'I168F', 'V705E', 'R1200W', 'D2870A', 'N276S', 'R249M', 'D422N']), 5: set(['E31K', 'V1808A', 'V1665M', 'H870R', 'V2908G', 'S478P', 'L209F', 'Y1295A', 'N480del', 'S1140G', 'P1771R', 'E1586G', 'I1250T', 'L128F', 'N372H', 'R441P', 'V294M', 'K39N', 'A1234T', 'Y42C', 'L668F', 'A1789S', 'Q324L', 'D32Y', 'S1172L', 'V714M', 'R571W', 'P130S', 'C1767S', 'K310R', 'E501G', 'P848L', 'A1066V', 'V430M', 'S3660L', 'R1608S', 'V1809A', 'F1662S', 'N1730S', 'T47D', 'D1778N', 'E362H', 'A389T', 'H133Q', 'V509A', 'L49H', 'K975E', 'T1720A', 'G39E', 'E518K', 'R481G', 'R143Q', 'N1236K', 'S384F', 'E1250K', 'V750E', 'A2643G', 'M374V', 'C712R', 'L57del', 'H1402Y', 'Q110R', 'M980T', 'K320E', 'E462G', 'S2670L', 'P33S', 'S1512I', 'P798L', 'A1020V', 'W368H', 'R115L', 'R715G', 'S860L', 'K2950N', 'F1695L', 'A272V', 'V648I', 'S1297del', 'W1782C', 'L390F', 'L1844R', 'G697C', 'R841Q', 'S362L', 'L246V', 'V384D', 'R866C', 'K2729N', 'V777M', 'F384L', 'R496C', 'M1663K', 'K513R', 'C1787S', 'L585I', 'V191D', 'N810Y', 'D1853N', 'P1856S', 'E1346K', 'K618T', 'V191I', 'L147F', 'E143K', 'E1644G', 'W531C', 'R1751Q', 'R170W', 'N1647K', 'S308A', 'N550H', 'D631G', 'L1267S', 'H643D', 'I391M', 'R2973C', 'E190K', 'E5K', 'M1008I', 'S1290A', 'T352M', 'R1095H', 'I31M', 'D2665G', 'I347M', 'D631A', 'Q1500P', 'Q1826H', 'P387S', 'N653H', 'C582F', 'S1613C', 'N319D', 'R1589P', 'H701P', 'D1778Y', 'R420H', 'K45Q', 'N382H', 'G464E', 'A1830T', 'N319T', 'D1778G', 'P1806A', 'P1771L', 'R1589H', 'S462Y', 'T123A', 'A1669S', 'G857A', 'V755I', 'P1614S', 'T485K', 'S1301R', 'E627D', 'K82T', 'S1613G', 'C360R', 'A1823T', 'T730S', 'R217C', 'A263V', 'E1794D', 'H1805P', 'I130M', 'I491M', 'I463S', 'E1286V', 'P1637L', 'D245V', 'Q120E', 'M1783I', 'S330A', 'E1682V', 'E475K', 'I219V', 'E466K', 'P123M', 'T338I', 'L2865V', 'H297N', 'L358R', 'E541K', 'D646Y', 'R1262A', 'T1151M', 'T1720I', 'E1682K', 'I326V', 'P1812R', 'Q689R', 'R1563S', 'S1497A', 'E116K', 'T1343I', 'P428L', 'L165P', 'V1534M', 'I99M', 'S273C', 'L272F', 'D65N', 'E1060A', 'E1214K', 'A8S', 'E2856A', 'A727V', 'E1282V', 'P42T', 'R592H', 'E120Q', 'V677I', 'L507P', 'M1783L', 'S1473P', 'N1819S', 'S1101N', 'R1726G', 'V716M', 'D1349H', 'T529I', 'A500T', 'I2285V', 'F346V', 'P1859R', 'T529N', 'T529M', 'P25L', 'H1421Y', 'D1692N', 'V1378I', 'H1862L', 'S760A', 'F354L', 'Q1785H', 'S1486C', 'R2520Q', 'Q60K', 'M1775V', 'P1819S', 'D1733G']), 6: set(['C1265S', 'Amplification', 'M887I', 'S1424C', 'E142D', 'F1524V', 'L2396F', 'N1102Y', 'H94Y', 'E135K', 'G785S', 'K508M', 'V894I', 'N116H', 'S4F', 'E812K', 'R498L', 'K375A', 'W398V', 'N1819Y', 'R276W', 'I26N', 'D1280V', 'D96N', 'S4P', 'E542G', 'I15T', 'R1190W', 'P380R', 'W2626C', 'K413E', 'M587V', 'D1818G', 'V839G', 'P654L', 'D1546N', 'S921R', 'V1643A', 'R2842H', 'C24Y', 'Y803N', 'K376N', 'R71G', 'Y53H', 'L1951R', 'D3170G', 'K1299E', 'M2676T', 'T3349A', 'N387P', 'K38N', 'H2074N', 'G248V', 'P375S', 'R2502H', 'P1856T', 'S1002R', 'E518A', 'R2502C', 'G356A', 'I638F', 'Y513A', 'M1411T', 'D603G', 'L28P', 'A1131T', 'R2304C', 'E78K', 'T2250A', 'M1400V', 'P655R', 'H1746Q', 'S59R', 'K2472T', 'L2721H', 'W563L', 'T1852S', 'H1966Y', 'Y179C', 'D935N', 'D1352Y', 'E221Q', 'L1019V', 'D737N', 'T582P', 'D1344H', 'S31R', 'R2318Q', 'V506A', 'S222D', 'L785F', 'N2113S', 'D806H', 'D1384V', 'I1018W', 'Y1045W', 'A1170V', 'S1651P', 'R258H', 'Q201H', 'Q58L', 'L622H', 'R725C', 'I1018F', 'R970C', 'R258C', 'L1026F', 'G60D', 'Deletion', 'M299I', 'T771R', 'L63F', 'Y1045*', 'C630Y', 'E719G', 'G216R', 'E719K', 'N517S', 'W509R', 'Y1294A', 'I89T', 'Y3092C', 'E60L', 'N56T', 'Q2384K', 'W1038C', 'A75P', 'E172K', 'T3211K', 'D544H', 'T630I', 'K56N', 'L582F', 'N588D', 'R683T', 'K753E', 'S1670A', 'V84L', 'K607T', 'R18H', 'R1758G', 'K45T', 'G701S', 'R2418G', 'I1349T', 'I157T', 'K45N', 'L559R', 'T654M', 'D1778H', 'S783P', 'L770V', 'V1804A', 'R561C', 'Truncating Mutations', 'V1804D', 'L82P', 'Y931C', 'A2770T', 'A2351G', 'D2312V', 'E29V', 'G602R', 'M1783T', 'S326R', 'T2681R', 'H1918Y', 'M1652T', 'L115R', 'K2411T', 'R838Q', 'E501K', 'E124Q', 'G1194D', 'K1434I', 'N375S', 'E1735K', 'M784V', 'Q1396R', 'N1878K', 'K1690N', 'T2722R', 'F212Y', 'N987I', 'Y3098H', 'S869L', 'G464A', 'V3079I', 'R428A', 'R3052Q', 'S121C', 'K50L', 'G936R', 'L929S', 'P1776S', 'L2106P', 'I68K', 'V1075F', 'E636K', 'N1228D', 'V211D', 'T654I', 'E717K', 'L584F', 'P1812S', 'V109G', 'K50E', 'S1039F', 'T77M', 'S1733F', 'V465M', 'R669C', 'S308A', 'S1025C', 'K101M', 'D1420Y', 'L915M', 'S72R', 'I111N', 'G2274V', 'Y40A', 'V1306I', 'P168T', 'G42R', 'L1904V', 'R659P', 'C528S', 'S226D', 'G1529R', 'Q2858R', 'R961W', 'A2717S', 'Q12Y', 'M1293A', 'V248D', 'V344G', 'F1061W', 'S247Y', 'T1354M', 'M18K', 'P648S', 'H115N', 'A148T', 'L749P', 'I90T', 'F79S', 'R2888C', 'K65M', 'S247F', 'I89N', 'C1365Y', 'N2436I', 'V2969M', 'H1382Y', 'R164Q', 'R661W', 'C39S', 'S1303N', 'C39R', 'C554W', 'L983F', 'S2483N', 'V11A', 'Y406H', 'MYC-nick']), 7: set(['I103N', 'A767_V769dup', 'Amplification', 'G1123D', 'K601N', 'Y62C', 'IGH-FGFR3 Fusion', 'V270A', 'V544_L545insAVLVLLVIVIISLI', 'S646F', 'D399N', 'E49K', 'Q531*', 'N233Y', 'SRGAP3-RAF1 Fusion', 'X963_splice', 'G863D', 'T80R', 'E804G', 'E79Q', 'I2500M', 'V564I', 'E746_A750delinsQ', 'M391R', 'Exon 9 mutations', 'M269R', 'K656E', 'R776H', 'L1195V', 'D770_N771insNPG', 'A750_E758del', 'T80K', 'E203K', 'E2419K', 'A459V', 'Y1235D', 'V659E', 'V555_L576del', 'S32I', 'Q1064R', 'MAGI3-AKT3 Fusion', 'MPRIP-NTRK1 Fusion', 'A502_Y503dup', 'N676K', 'S729C', 'H697Y', 'F594L', 'G31V', 'F958S', 'G31R', 'D289del', 'V1188L', 'C1156F', 'S37Y', 'A598V', 'R834Q', 'C121S', 'EP300-MOZ Fusion', 'N535K', 'S259P', 'R49L', 'N540S', 'C1156Y', 'F460L', 'A113_splice', 'R634W', 'D61Y', 'F53S', 'p61BRAF', 'V1676D', 'R206H', 'BTBD1-NTRK3 Fusion', 'V1576E', 'M1250T', 'C450_K451insMIEWMI', 'V1092I', 'P449_L455del', 'KIF5B-ALK Fusion', 'D61N', 'V555_V559del', 'G776S', 'F53L', 'D595V', 'C618R', 'P124Q', 'P124S', 'C250T', 'D835del', 'L251P', 'W603_E604insDREYEYDLKW', 'D717V', 'S614R', 'A883F', 'P124L', 'A18D', 'D153V', 'P34L', 'R683S', 'V32G', 'P2273S', 'G464R', 'R47Q', 'M504V', 'G466V', 'E40K', 'D600_L601insFREYEYD', 'S501_A502dup', '963_D1010splice', 'Y801H', 'G778_P780dup', 'C630R', 'G466E', 'R342Q', 'N1100Y', 'A750_E758delinsP', 'T599_V600insV', 'A232V', 'G375C', 'A59G', 'EZR-ERBB4 Fusion', 'D770_N771insD', 'IGH-BCL2 Fusion', 'L910P', 'I1616T', 'E734Q', 'T992I', 'Q2223K', 'C228T', 'RET-CCDC6 Fusion', 'K59del', 'T241M', 'K117R', 'L46F', 'E586K', 'Y823D', 'L1152R', 'FGFR1OP1-FGFR1 Fusion', 'L1152P', 'V742A', 'L747_T751delinsP', 'T790M', 'Y285C', 'EGFR-KDD', 'W24R', 'T80A', 'P253R', 'S45Y', 'K525E', 'P539R', 'S45A', 'S45F', 'L702H', 'T73I', 'Q56P', 'V321M', 'K539L', 'G1123S', 'T1977R', 'R108H', 'M774_A775insAYVM', 'V14I', 'R108K', 'D325A', 'R776C', 'L601_K602insREYEYDL', 'CEP110-FGFR1 Fusion', 'R174*', 'Y578C', 'T1977K', 'M1328I', 'T1977I', 'H694R', '550_592del', 'V104M', 'FGFR2-BICC1 Fusion', 'E79K', 'D419del', 'F522C', 'T41A', 'S37F', 'K1062M', 'S37C', 'S840_N841insGS', 'S37A', 'D835N', 'L611_E612insCSSDNEYFYVDFREYEYDLKWEFPRENL', 'S2215F', '256_286trunc', 'ATF7IP-JAK2 Fusion', 'R886W', 'E144K', 'G469A', 'I28T', 'S2215Y', 'G13C', 'CD74-NTRK1 Fusion', 'G13D', 'E330K', 'L1678P', 'Y849S', 'M90I', 'E452K', 'C49Y', 'E76A', 'Y849C', 'S1206R', 'Exon 20 insertions', 'L1593P', 'E2014K', 'C634Y', 'G31A', 'T283A', 'AGK-BRAF Fusion', 'C634R', 'K745_A750del', 'S1206C', 'M232T', 'F1174I', 'F71I', 'S371C', 'G505S', 'FGFR3 - BAIAP2L1 Fusion', 'T58I', 'G881D', 'I538V', 'I767M', 'G1201E', 'L536H', 'A763_Y764insFQEA', 'L536R', 'F1200I', 'L536P', 'V843I', 'ESR1-YAP1 Fusion', 'F28L', 'Q546K', 'E14*', 'L1600P', 'D67N', 'E632_L633del', 'V658F', 'Q546E', 'V658A', 'Q79K', 'Q276*', 'L78T', 'C47S', 'G776delinsVC', 'S33Y', 'Q546R', 'F958V', 'Q79E', 'G810S', 'K335I', 'F691L', 'E746_S752delinsI', 'ETV6-FLT3 Fusion', 'R38H', 'E746_S752delinsA', 'L493P', 'G853D', 'Y375_K455del', 'A750P', 'G116S', 'R342W', 'N848K', 'G328V', 'D814V', 'PAX8-PPAR? Fusion', 'K935I', 'I448V', 'D594A', 'S387N', 'Overexpression', 'V555M', 'G118D', 'D594N', 'N561D', 'W731L', 'D661Y', 'ETV6-NTRK3 Fusion', 'LRIG3-ROS1 Fusion', 'E1799K', 'S427G', 'S222D', 'D835H', 'S768_V769insVAS', 'P287A', 'ESRP1-RAF1 Fusion', 'Y1003F', 'N841I', 'ROS1-CD74 Fusion', 'Exon 19 deletion/insertion', 'T670I', 'I1171T', 'F129L', 'I1171N', 'P287T', 'S584L', 'A1065T', 'K398A', 'TRKAIII Splice Variant', 'L2427R', 'ESR1-CCDC170 Fusion', 'C456_R481del', 'C609Y', 'EGFRvII', 'ETV6-PDGFRB Fusion', 'H1047R', 'A146V', 'FUS-ERG Fusion', 'L861Q', 'L861R', '422_605trunc', 'P1139S', 'EGFRvIV', 'R552S', 'ZNF198-FGFR1 Fusion', 'V2006I', 'K650E', 'V777L', 'P551_E554del', 'G774V', 'G380R', 'G1202R', 'K57T', 'S653C', 'L866M', 'EWSR1-ERG Fusion', 'W557_K558del', 'G60E', 'L1122V', 'Exon 1 mutations', 'S1206Y', 'C634S', 'K648N', 'FGFR2-AHCYL1 Fusion', 'T1151dup', 'R170W', 'RANBP2-ALK Fusion', 'R88Q', 'C481S', '560_561insER', 'T263P', 'KANK1-PDGFRB Fusion', 'L755P', 'F1174L', 'G284R', 'S243N', 'Y599_D600insPAPQIMSTSTLISENMNIA', 'I1680N', 'K117N', 'Y1230H', 'EBF1-PDGFRB Fusion', 'T74P', 'NPM-ALK Fusion', 'Y1230C', 'A1701P', 'N53A', 'W80A', 'E1021K', 'L783F', 'L747_E749del', 'E1384K', 'STRN-PDGFRA Fusion', 'G253C', 'D401N', 'G81D', 'KIF5B-PDGFRA Fusion', 'Y1278S', 'I843del', 'L412F', 'T725M', 'D106A', 'F156L', 'G81R', 'G81S', 'T34_A289del', 'F31I', 'TMPRSS2-ERG Fusion', 'A1519T', 'Y791F', 'T875N', 'L19F', 'SLC34A2-ROS1 Fusion', 'V600D_K601insFGLAT', 'V559G', 'N334K', 'L2230V', 'D839G', 'F351L', 'P449T', 'L1574P', 'K550_K558del', 'N542_E543del', 'P262H', 'D816V', 'E565G', 'E311_K312del', 'H845Y', 'N549K', 'T599R', 'W257G', 'Y373C', 'G724S', 'X1007_splice', 'N549S', 'FIG-ROS1 Fusion', 'L747_A750delinsP', 'G60R', 'N549T', 'L576del', 'Y1035C', 'N126D', 'P306H', 'S476I', 'S23R', 'K642E', 'CUX1-FGFR1 Fusion', 'L747_A750del', 'V471F', 'E746_T751delinsA', 'E598_Y599insGLVQVTGSSDNEYFYVDFREYE', 'IGL-MYC Fusion', 'S250P', 'S257W', 'R506_K507insVLR', 'R15S', 'D661V', 'G370C', 'V1070E', 'Y537C', 'L536Q', 'R844H', 'BCR-PDGFRA Fusion', 'V45L', 'Y537S', 'S451E', 'S451F', 'C18Y', 'L30F', 'N1044K', 'F537_K539delinsL', 'A11_G12insGA', 'TMPRSS2-ETV1 Fusion', 'L576P', 'A171V', 'EWSR1-FLI1 Fusion', 'A864T', '981_1028splice', 'G776delinsLC', 'D842_M844del', 'V35M', 'E868G', 'E872K', 'K601E', 'S33A', 'M2327I', 'EP300-MLL Fusion', 'Q579R', 'PCM1-JAK2 Fusion', 'S768_D770dup', 'L617F', 'EGFRvV', 'T389K', 'V559D', 'K601Q', 'L239R', 'I867S', 'P44A', 'X1009_splice', 'T417_D419delinsI', 'BCR-JAK2 Fusion', 'K499E', 'D842Y', 'L1460P', 'S259A', 'S259F', 'D842V', 'F161L', 'E746_A750del', 'D842I', 'R203C', 'E598_Y599insDVDFREYE', 'A72V', 'N505I', 'Q58_E62del', 'A72S', 'W557R', 'A339V', 'K22A', 'E545Q', 'NFASC-NTRK1 Fusion', 'D835E', 'E746_T751insIP', 'D835A', 'L265P', 'MLL-TET1 Fusion', 'G1079D', 'E545G', 'D835Y', 'E545A', 'L747_T751del', 'E545K', 'G322S', 'SND1-BRAF Fusion', 'AKAP9-BRAF Fusion', 'R879C', 'D594V', 'R841K', 'Y35C', 'N771_H773dup', 'L52R', 'K111E', 'Q546P', 'Y35H', 'Y35N', 'D29H', 'CD74-ROS1 Fusion', 'EWSR1-DDIT3 Fusion', 'A723D', 'SEC16A1-NOTCH1 Fusion', 'HIP1-PDGFRB Fusion', 'T599dup', 'E709_T710delinsD', 'H36P', 'EWSR1-ETV1 Fusion', 'PAX5-JAK2 Fusion', 'Q22E', 'S65N', 'W515L', 'S784F', 'FAM131B-BRAF Fusion', 'F1592S', 'F1245C', 'E836K', 'K1026E', 'D29Y', 'EWSR1-YY1 Fusion', 'W742C', 'F1245V', 'D83V', 'C1483F', 'E552K', 'N546K', 'F123I', 'D32N', 'FLT3 internal tandem duplications', 'C1483R', 'L790F', 'C1483W', 'V769_D770insGVV', 'D770_P772dup', 'C1483Y', 'S502T', 'F1888L', 'Y599_D600insEYEYEYEY', 'F1888I', 'D770_N771insVDSVDNP', 'E846K', 'PAPSS1-BRAF Fusion', 'R93W', 'A883T', 'L1240V', 'Delta-NTRK1 Fusion', 'R683G', 'A77P', 'A77S', 'R683K', 'H1094R', 'M1043I', 'P573_D579del', 'K509I', 'E884K', 'V654A', 'EZR-ROS1 Fusion', 'H1094Y', 'T507K', 'E731K', 'L63V', 'Exon 20 insertion', 'K147E', 'Q61L', 'H773dup', 'Q61K', 'R248C', 'LMNA-NTRK1 Fusion', 'R724H', 'F594_R595insSDNEYFYVDF', 'G466A', 'H1094L', 'A728V', 'L858R', 'K550_V555delinsI', 'Y64A', 'L747_P753del', 'L469V', 'S249C', 'N1178H', 'V561D', 'L1433S', 'Q22K', 'H773L', 'G373R', 'T599I', 'E380Q', 'V559_V560del', 'Q209L', 'G464V', 'D846Y', 'L321A', 'T244_I245insCPT', 'C620R', 'G1128A', 'I744_K745delinsKIPVAI', 'BCR-ABL1 Fusion', 'T195I', 'IGK-MYC Fusion', 'G1128S', 'IGH-MYC Fusion', 'E76K', 'N1068fs*4', 'I289M', 'N387K', 'S566_E571delinsR', 'I1616N', 'BCR-FGFR1 Fusion', 'CCND1-IGH Fusion', 'Y599_D600insSTDNEYFYVDFREYEY', 'A146T', 'Exon 11 mutations', 'D594E', 'V560G', 'R24C', 'TRIM24-BRAF Fusion', 'R1192P', 'L485_P490del', 'L838V', 'G106V', 'ETV6-PDGFRA Fusion', 'Y253F', 'D350G', 'H773insLGNP', 'Y63C', 'Exon 19 deletion', 'D1067Y', 'D1067V', 'EWSR1-WT1 Fusion', 'V60M', 'N659S', 'R552G', 'S267_D273dup', 'A1459P', 'N217I', 'Y842C', 'R544W', 'V536E', 'D1067A', 'L747P', 'G596R', 'ERLIN2?FGFR1 Fusion', 'S703I', 'FGFR2-FAM76A Fusion', 'Y652H', 'K189N', 'D587H', 'Y640F', 'I35S', 'G596C', 'CPEB1-NTRK3 Fusion', 'D842_H845del', 'R306S', 'R23A', 'E626K', 'T341P', 'T785A', 'C334S', 'T286A', 'L726I', 'T286I', 'T878S', 'L2431P', 'E322K', 'S33C', 'S891A', 'TPR-NTRK1 Fusion', 'F468C', 'T878A', 'D32H', 'G719S', 'T1025S', 'E542V', 'D32A', 'V600E', 'E612_F613insGYVDFREYEYDLKWEFRPRENLEF', 'H878Y', 'M1043V', 'G719A', 'H650Q', 'V561_I562insER', 'V592A', 'E102_I103del', 'N826S', 'E542K', 'C157Y', 'Y553_Q556del', 'R324L', 'L833V', 'H773_V774insH', 'V271A', 'C620Y', 'H538Q', 'V600K', 'S45del', 'V600M', 'G796S', 'Y32S', 'V1180L', 'K57E', 'A246P', 'V600D', 'V600G', 'G776_V777insYVMA', 'I836del', 'Y591D', 'S310Y', 'K641R', 'C443Y', 'S310F', 'I408V', 'Q56_V60del', 'V579A', 'FGFR1-TACC1 Fusion', 'BRD4-NUT Fusion', 'Y375C', 'V550E', 'G660D', 'S214C', 'Y139D', 'Y772_A775dup', 'Y555C', 'D816Y', 'D816F', 'D816G', 'G34E', 'I843_D846del', 'D816N', 'A151T', 'S214T', 'D816H', 'W383R', 'M535I', 'TFG-NTRK1 Fusion', 'K5N', 'C420R', 'E1210K', 'E525K', 'EGFR-RAD51 Fusion', 'Y599_D600insGLYVDFREYEY', 'Y130C', 'S846I', 'D641N', 'R201C', 'FGFR2-CCDC6 Fusion', 'I2500F', 'D641G', 'Y472C', 'E1099K', 'Exon 20 insertions/deletions', 'Q809R', 'V560D', 'S45P', 'R1598P', 'T417_D419delinsRG', 'FGFR2-TACC3 Fusion', 'D323H', 'W24S', 'TPM3-NTRK1 Fusion', 'X1008_splice', 'P29S', 'E321G', 'I204T', 'Q72L', 'H355M', 'P29L', 'S123T', 'E563K', 'N659K', 'G34V', 'KDR-PDGFRA Fusion', 'Y572C', 'E82G', 'D769H', 'V804M', 'C456_N468del', 'A634D', 'R748G', 'AR-V7', 'S252W', 'D769Y', 'N659R', 'H410R', 'D60N', 'A546D', 'L1596H', 'FGFR2?PPHLN1 Fusion', 'G325A', 'L755S', 'E317K', 'E207K', 'G12C', 'H1047Y', 'W535L', 'TEL-JAK2 Fusion', 'N822I', 'N822H', 'V842I', 'W24C', 'KIF5B-RET Fusion', 'L485F', 'Q61H', 'G735S', 'G2430A', 'P577S', 'Q347_A348del', 'P551_V555del', 'F877L', 'K558delinsNP', 'A391E', 'V422del', 'V559del', 'L1196M', 'C125S', 'H1047L', 'K659N', 'W742L', 'L1196Q', 'R368C', 'F57C', 'V600R', 'RANBP1-ALK Fusion', 'S768I', 'E127G', 'R669G', 'E168D', 'E365K', 'Y418_D419del', 'K550_W557del', 'G1269S', 'Q61R', 'H845_N848delinsP', 'L225LI', 'E17K', 'KIAA1549-BRAF Fusion', 'P34R', 'L617M', 'I1170S', 'G1269A', 'L747_P753delinsS', 'I1170N', 'ARv567es', 'Promoter Mutations', 'F590G', 'EGFRvIII', 'R201H', 'L1198F', 'G12V', 'H875Y', 'G12R', 'G12S', 'C311R', 'SDC4-ROS1 Fusion', 'L1198P', 'E709A', 'W290_I291delinsC', 'G12D', 'Exon 19 insertion', 'E709K', 'G12A', 'C611Y', 'L597V', 'C382R', 'V597A', 'L597Q', 'E746_T751delinsVA', 'D572A', 'G309A', 'G309E', 'EWSR1-NR4A3 Fusion']), 8: set(['ZC3H7B-BCOR Fusion', 'G35R', 'K700R', 'G35V', 'Fusions', 'G311D', 'K28M', 'K179M', 'BCOR-RARA Fusion', 'S492R', 'DNMT3B7', 'HMGA2-RAD51B Fusion', 'R132Q', 'E40N', 'R172M', 'T798I', 'K590R', 'R172G', 'BCOR-CCNB3 Fusion']), 9: set(['R132G', 'H662R', 'H662Q', 'R132H', 'A677G', 'Q157P', 'R625L', 'S34Y', 'R625H', 'R132C', 'R625D', 'R172S', 'R625G', 'CIC-DUX4 Fusion', 'R140Q', 'G161V', 'R625C', 'C135Y', 'R172K', 'R339W', 'S34F', 'K666M', 'K666N', 'G44S', 'R100A', 'K666R', 'K666T', 'R957Q', 'E622D', 'Y646S', 'Y646H', 'K700E', 'K181M', 'Y646N', 'Y646C', 'Y646F', 'E622Q'])}
In [37]:
unique_var_count={}

for k,v in sorted(variation_class.items()):
    unique_var_count[k]=len(v)
print unique_var_count
{1: 423, 2: 399, 3: 89, 4: 669, 5: 242, 6: 265, 7: 889, 8: 19, 9: 37}
In [38]:
plt.figure()
plt.bar(unique_var_count.keys(),unique_var_count.values())
plt.title('Number of Unique variation per Class')
plt.show()
In [39]:
unique_gene_variation={}
for gene in unique_genes:
    data=train.loc[train.Gene==gene]
    unique_gene_variation[gene]=data.Variation.unique()
print unique_gene_variation
print len(unique_gene_variation)
{'ERRFI1': array(['Deletion'], dtype=object), 'ASXL2': array(['Truncating Mutations'], dtype=object), 'STK11': array(['L67P', 'M136R', 'G163D', 'W308C', 'K78A', 'D194Y', 'R304W',
       'Deletion', 'Truncating Mutations', 'H174R', 'A205T', 'K78I',
       'F354L'], dtype=object), 'MEN1': array(['Deletion'], dtype=object), 'FAM58A': array(['Truncating Mutations'], dtype=object), 'AKT1': array(['E267G', 'L321A', 'Q59E', 'E40K', 'K179M', 'L362R', 'V270A',
       'R370C', 'L52R', 'E49K', 'E17K', 'Q79E', 'G311D', 'L78T', 'Q79K',
       'V271A', 'P42T', 'K39N', 'W80A', 'D32Y', 'T195I', 'D325A', 'K189N',
       'N53A', 'D323H', 'R23A', 'L202F', 'C77F'], dtype=object), 'AKT2': array(['D32H', 'K181M', 'I289M', 'R170W', 'R368C', 'G161V', 'Fusions',
       'H355M', 'Amplification', 'E17K', 'D399N'], dtype=object), 'AKT3': array(['MAGI3-AKT3 Fusion', 'E17K', 'Amplification', 'Fusions'], dtype=object), 'ARAF': array(['N217I', 'S214A', 'S214T', 'Q347_A348del', 'F351L', 'S214C', 'G322S'], dtype=object), 'MLH1': array(['L550P', 'R265S', 'R100*', 'M587V', 'A111P', 'P654L', 'R265C',
       'R659P', 'G101S', 'V716M', 'E172K', 'V506A', 'R659*', 'Q689R',
       'R755S', 'T117M', 'G67R', 'W714*', 'R217C', 'I219V', 'R725C',
       'P648S', 'P648L', 'Truncating Mutations', 'A120S', 'V384D', 'R487*',
       'G67W', 'L622H', 'L582F', 'K618T', 'L559R', 'R659L', 'L749P', 'T82A'], dtype=object), 'IKBKE': array(['Amplification'], dtype=object), 'NKX2-1': array(['IGH-NKX2 Fusion', 'TRB-NKX2-1 Fusion', 'Amplification', 'Fusions',
       'TRA-NKX2-1 Fusion', 'A339V'], dtype=object), 'ROS1': array(['S1986F', 'Amplification', 'D2033N', 'S1986Y', 'EZR-ROS1 Fusion',
       'L1947R', 'ROS1-CD74 Fusion', 'LIMA1-ROS1 Fusion', 'V2098I',
       'G2032R', 'G1971E', 'KDELR2-ROS1 Fusion', 'SLC34A2-ROS1 Fusion',
       'L1951R', 'MSN-ROS1 Fusion', 'Fusions', 'LRIG3-ROS1 Fusion',
       'SDC4-ROS1 Fusion', 'TFG-ROS1 Fusion', 'G2101A',
       'CEP85L-ROS1 Fusion', 'CD74-ROS1 Fusion', 'C2060G',
       'CCDC6-ROS1 Fusion', 'E1935G', 'FIG-ROS1 Fusion'], dtype=object), 'AXL': array(['Amplification'], dtype=object), 'RARA': array(['R276W'], dtype=object), 'KMT2D': array(['Deletion', 'Truncating Mutations'], dtype=object), 'EP300': array(['Fusions', 'Y1414C', 'R1627', 'C1385', 'EP300-MOZ Fusion', 'D1399Y',
       'EP300-MLL Fusion', 'Deletion', 'P1502L'], dtype=object), 'TSC1': array(['M224R', 'F158C', 'K82T', 'S35Q', 'L117P', 'H68R', 'R246K', 'T417I',
       'G305W', 'N198_F199delinsI', 'V220F', 'H105R',
       'Truncating Mutations', 'G305R', 'V128del', 'H206D', 'G1035S',
       'L191H', 'L180P', 'R1097H'], dtype=object), 'PIK3CA': array(['R38H', 'I31M', 'G106D', 'K111E', 'R88Q', 'R108H', 'M1043I',
       'E542G', 'G106V', 'D350G', 'E545Q', 'V344A', 'E78K', 'N1068fs*4',
       'R93Q', 'N1044K', 'E116K', 'V344G', 'T1025S', 'C420R', 'K111N',
       'Q60K', 'N345I', 'E545K', 'E365K', 'E81K', 'Q546K', 'Q546P',
       'E542K', 'H701P', 'R93W', 'H1047R', 'Q546E', 'A1066V',
       'Amplification', 'P449_L455del', 'H1047L', 'P449T', 'P539R',
       'A1020V', 'G914R', 'E542Q', 'Q546R', 'E579K', 'D1029Y', 'R115L',
       'G106_R108del', 'G118D', 'E545A', 'H1047Y', 'E453A', 'C378R',
       'E545G', 'I391M', 'E542V', 'M1043V'], dtype=object), 'PIK3CB': array(['N553S', 'E552K', 'ACPP-PIK3CB Fusion', 'R48W', 'Fusions', 'D1067V',
       'D1067Y', 'E633K', 'E1051K', 'D1067A'], dtype=object), 'MAP3K1': array(['T1481fs', 'Truncating Mutations', 'E1286V', 'T779fs'], dtype=object), 'EIF1AX': array(['A113_splice'], dtype=object), 'INPP4B': array(['Truncating Mutations'], dtype=object), 'WHSC1L1': array(['Amplification'], dtype=object), 'GATA3': array(['Truncating Mutations', 'Amplification', 'R177Pfs*126'], dtype=object), 'FGFR2': array(['R251Q', 'K517R', 'E731K', 'Y375C', 'K659N', 'Fusions', 'T730S',
       'FGFR2-MGEA5 Fusion', 'V755I', 'G271E', 'L770V', 'R203C', 'E565G',
       'FGFR2-AHCYL1 Fusion', 'A648T', 'N549S', 'P253R', 'G701S', 'T341P',
       'FGFR2?PPHLN1 Fusion', 'N549K', 'S252W', 'FGFR2-BICC1 Fusion',
       'V248D', 'K526E', 'L617F', 'I642V', 'M535I', 'K525E', 'E636K',
       'FGFR2-TACC3 Fusion', 'FGFR2-KIAA1967 Fusion', 'S267_D273dup',
       'C278F', 'A389T', 'L617M', 'W290_I291delinsC', 'C382R', 'K641N',
       'FGFR2-FAM76A Fusion', 'FGFR2-CCDC6 Fusion', 'E219K', 'K641R',
       'K310R', 'D101Y', 'N549T', 'M391R', 'V564I', 'K659E', 'E475K'], dtype=object), 'HNF1A': array(['T196A', 'Truncating Mutations', 'P291Qfs*51'], dtype=object), 'CDH1': array(['R732Q', 'Deletion', 'A617T', 'D254N', 'A634V', 'A298T', 'T340A',
       'D257N', 'Epigenetic Silencing'], dtype=object), 'PIK3R3': array(['Truncating Mutations'], dtype=object), 'MYC': array(['T73I', 'PVT1-MYC Fusion', 'IGL-MYC Fusion', 'MYC-nick', 'Fusions',
       'IGH-MYC Fusion', 'IGK-MYC Fusion', 'Amplification'], dtype=object), 'HIST1H1C': array(['S102F'], dtype=object), 'PIK3R2': array(['G373R', 'K376N', 'Amplification', 'N561D', 'D557H', 'A727V',
       'A171V', 'S273C'], dtype=object), 'BRCA1': array(['E1735K', 'R1753T', 'C44Y', 'R1835P', 'Truncating Mutations',
       'G1803A', 'A1830T', 'G1743R', 'I68K', 'C64G', 'S4P', 'K50L',
       'Y105C', 'L63F', 'L1407P', 'L1657P', 'F79S', 'D1778H', 'D1546N',
       'M1689T', 'D96N', 'W1837C', 'Deletion', 'P1856S', 'F1761I',
       'A1823T', 'I1807S', 'K38N', 'S1651P', 'L246V', 'V191I', 'T37R',
       'D1818G', 'S72R', 'Y179C', 'C39Y', 'K45N', 'L52F', 'H1746N',
       'P1749R', 'H1746Q', 'I21V', 'W1718C', 'R170W', 'T77M', 'V1713A',
       'V1833M', 'S1101N', 'T1691I', 'L147F', 'H1421Y', 'E362H', 'R170Q',
       'S1715C', 'E1794D', 'L358R', 'S308A', 'V1741G', 'R1699L', 'V1804A',
       'E23fs', 'F1734S', 'C61G', 'D245V', 'P142H', 'L1705P', 'H1686R',
       'P1776S', 'M1783T', 'S1841R', 'V1534M', 'W1782C', 'G1706A',
       'G1788V', 'W1837G', 'A1708E', 'S1512I', 'M1783I', 'E1682V',
       'M1783L', 'G1706E', 'R496C', 'S153R', 'N810Y', 'K1702E', 'A1789S',
       'R866C', 'V11A', 'D1739V', 'M1775E', 'D67Y', 'D1344H', 'V271L',
       'S1841N', 'F1761S', 'E1586G', 'S1841A', 'I89N', 'L82P', 'P798L',
       'E29V', 'M1775K', 'E1060A', 'I90T', 'I15T', 'V1736G', 'L1854P',
       'E1836K', 'F1704S', 'P1812S', 'H1402Y', 'N550H', 'Q1756fs',
       'G1738R', 'S1497A', 'N1647K', 'Q1785H', 'H41R', 'R1589P', 'L165P',
       'C24R', 'L1844R', 'C1767S', 'K65M', 'V1809A', 'Y1703H', 'C27A',
       'S1655F', 'N1819Y', 'S1486C', 'P1859R', 'Y1853*', 'E1682K', 'K56N',
       'S1651F', 'V1714G', 'D1778N', 'Y1703S', 'A1669S', 'W1837R',
       'V1833E', 'P1806A', 'M18T', 'M1411T', 'S1140G', 'Y1853C', 'T1700A',
       'T1852S', 'H1805P', 'D1778Y', 'T1685A', 'D1733G', 'Q1811R',
       'D1739G', 'S1297del', 'I89T', 'C1697R', 'L1764P', 'P1637L',
       'G1770V', 'A1752V', 'G1656D', 'G1763V', 'I31M', 'V1688del',
       'D1739Y', 'E1660G', 'S1715R', 'F1662S', 'M1628V', 'S186Y', 'S4F',
       'R1699W', 'L28P', 'N132K', 'V1838E', 'C24Y', 'E60L', 'G1738E',
       'G1788D', 'S59R', 'V1809F', 'M1689R', 'D1739E', 'M1400V', 'M1008I',
       'E1282V', 'E1214K', 'V1665M', 'C39R', 'M1775V', 'R1589H', 'I26N',
       'N1819S', 'V1808A', 'H1862L', 'T1720I', 'M1R', 'T47D', 'P1771L',
       'V1736A', 'C1787S', 'V1810G', 'N1236K', 'R1699Q', 'S1722F',
       'A1789T', 'R1758G', 'K45Q', 'D1692N', 'I1766S', 'L1267S', 'E143K',
       'T1720A', 'L1780P', 'I42V', 'Q12Y', 'W1718L', 'P1856T', 'R1751Q',
       'A1752P', 'R1726G', 'V1653M', 'N1730S', 'E1644G', 'H1686Q',
       'Q1826H', 'R71G', 'S1164I', 'V1804D', 'S1301R', 'C44F', 'V191D',
       'Q1756C', 'R841Q', 'M1652K', 'R1751P', 'C47G', 'C360R', 'M1663K',
       'S1715N', 'K50E', 'A1708V', 'E1250K', 'S1613C', 'L668F', 'P1614S',
       'A1843P', 'C39S', 'P1812R', 'V1378I', 'P1771R', 'T1685I', 'T1691K',
       'F1695L', 'K45T', 'S1473P', 'M1652T', 'E1346K', 'R133H', 'A1843T',
       'D1778G', 'S1613G', 'M18K'], dtype=object), 'ERBB3': array(['S846I', 'A1131T', 'G284R', 'P262H', 'V714M', 'T389K',
       'Amplification', 'Q809R', 'R669C', 'A232V', 'V104M'], dtype=object), 'IGF1R': array(['M1255I', 'A1374V', 'G1125A'], dtype=object), 'BRCA2': array(['P168T', 'R2336P', 'T2250A', 'A2351G', 'L2396F', 'M784V', 'V894I',
       'G2748D', 'R2502C', 'S1424C', 'R2418G', 'K2729N', 'K1690N',
       'Y3092C', 'I2675V', 'A2643G', 'L2106P', 'N1878K', 'T3349A',
       'C1265S', 'T582P', 'T1354M', 'L1904V', 'R2304C', 'S384F', 'E2663V',
       'R3052W', 'S142I', 'N2436I', 'N319T', 'R2659T', 'E3002K', 'G1194D',
       'Q2858R', 'K2411T', 'D1352Y', 'V1306I', 'N1102Y', 'F1524V',
       'R2502H', 'K1434I', 'K2472T', 'S1670A', 'Y3098H', 'L2721H', 'Y42C',
       'R1190W', 'C1365Y', 'A2717S', 'N372H', 'R2520Q', 'S1172L', 'Q2384K',
       'P1819S', 'R2842H', 'P655R', 'L1019V', 'R2888C', 'N588D', 'N517S',
       'K2950N', 'R2784W', 'I2627F', 'S2483N', 'D2312V', 'D2723H',
       'Deletion', 'L929S', 'V2908G', 'R18H', 'S196N', 'A1170V', 'R2336H',
       'D806H', 'V1643A', 'H1966Y', 'T630I', 'E2856A', 'S1733F', 'E462G',
       'I1349T', 'D3095E', 'R2973C', 'D935N', 'D2665G', 'D3170G', 'R2602T',
       'R2842C', 'C554W', 'N1228D', 'V3079I', 'H1918Y', 'L209F', 'G602R',
       'I2285V', 'S326R', 'V2969M', 'R2659K', 'L2865V', 'N987I', 'D1420Y',
       'T2681R', 'S869L', 'K513R', 'N56T', 'W2626C', 'S206C', 'A2770T',
       'D2723G', 'T2722R', 'H2074N', 'G2274V', 'R2318Q', 'P375S', 'R174C',
       'A75P', 'N2113S', 'D1280V', 'T3211K', 'R3052Q', 'S2670L', 'M2676T',
       'Q1396R', 'K607T', 'G1529R'], dtype=object), 'ARID1A': array(['Truncating Mutations'], dtype=object), 'ERBB4': array(['E452K', 'EZR-ERBB4 Fusion', 'R306S', 'E563K', 'V348L', 'T244R',
       'Y285C', 'R544W', 'M887I', 'D595V', 'Fusions', 'N181S', 'R838Q',
       'G785S', 'G936R', 'E317K', 'E542K', 'E872K', 'K935I', 'E836K'], dtype=object), 'BARD1': array(['Deletion'], dtype=object), 'IDH2': array(['R140Q', 'R172S', 'R172G', 'R172K', 'R172M'], dtype=object), 'IDH1': array(['Y139D', 'R132C', 'R100A', 'G123R', 'R132H', 'R132G', 'R132Q',
       'P33S', 'A134D', 'V294M', 'H133Q', 'I130M', 'I99M', 'G70D', 'R49C',
       'E190K'], dtype=object), 'U2AF1': array(['S34F', 'Q157P', 'S34Y'], dtype=object), 'NUP93': array(['G591V', 'Exon 13 deletion', 'K442Nfs*'], dtype=object), 'KLF4': array(['K409Q'], dtype=object), 'SMAD2': array(['R321Q', 'P305L', 'F346V', 'V319D', 'W368H', 'Deletion', 'D450E',
       'Y406H', 'K420A', 'T413N', 'S276L', 'D300H', 'K375A', 'N387P',
       'Y426A', 'R428A', 'D300N', 'R310H', 'D450H'], dtype=object), 'BRD4': array(['Amplification', 'BRD4-NUT Fusion'], dtype=object), 'PMS1': array(['Q233*'], dtype=object), 'MDM2': array(['Amplification'], dtype=object), 'SETD2': array(['Deletion', 'Truncating Mutations', 'R1625C'], dtype=object), 'MDM4': array(['Amplification'], dtype=object), 'FGFR3': array(['G382D', 'A500T', 'K650E', 'D617G', 'K650T', 'D641N',
       'IGH-FGFR3 Fusion', 'N540S', 'D641G', 'Y647C', 'K650Q', 'S371C',
       'E627D', 'K650R', 'R248C', 'K508M', 'V677I', 'G637W',
       'FGFR3 - BAIAP2L1 Fusion', 'D646Y', 'C582F', 'H643D', 'R669G',
       'I538V', 'E466K', 'F384L', 'Amplification', 'G375C', 'G697C',
       'V555M', 'Y373C', 'S249C', 'A391E', 'K650M', 'K650N', 'G370C',
       'N653H', 'Fusions', 'G380R'], dtype=object), 'FGFR1': array(['FGFR1OP1-FGFR1 Fusion', 'A263V', 'CEP110-FGFR1 Fusion',
       'ZNF198-FGFR1 Fusion', 'E664K', 'Amplification',
       'FGFR1-TACC1 Fusion', 'K656E', 'Fusions', 'BCR-FGFR1 Fusion',
       'ERLIN2?FGFR1 Fusion', 'N546K', 'CUX1-FGFR1 Fusion'], dtype=object), 'JAK2': array(['L611V', 'K539L', 'D544H', 'Y1045*', 'SPAG9-JAK2 Fusion',
       'SSBP2-JAK2 Fusion', 'W1038C', 'Y931C', 'PCM1-JAK2 Fusion',
       'BCR-JAK2 Fusion', 'R683T', 'F537_K539delinsL', 'L983F',
       'N542_E543del', 'PAX5-JAK2 Fusion', 'L1026F', 'I1018F',
       'ATF7IP-JAK2 Fusion', 'S1039F', 'S1025C', 'V1075F', 'T875N',
       'R683S', 'Y1045W', 'C618R', 'Amplification', 'F1061W', 'H538Q',
       'R683G', 'I1018W', 'TEL-JAK2 Fusion', 'R683K', 'Fusions'], dtype=object), 'CCND1': array(['D289del', 'D289_D292del', 'CCND1-IGH Fusion',
       'Truncating Mutations', 'Amplification', '256_286trunc', 'T286A',
       'T286I', 'C47S', 'P287A', 'L283_D294del', 'P287S', 'P287T'], dtype=object), 'ARID1B': array(['Deletion', 'Truncating Mutations'], dtype=object), 'PTPN11': array(['Deletion', 'D61Y', 'T73I', 'T507K', 'E69K', 'E76K', 'G503V',
       'Q79R', 'R498L', 'A72V', 'D106A', 'Q510E', 'M504V', 'T468M', 'V45L',
       'E76A', 'P491S', 'K590R', 'F71I', 'A72S', 'G464A', 'Y63C', 'S502T',
       'D61N', 'Y62C', 'E139D'], dtype=object), 'CCND2': array(['Amplification'], dtype=object), 'B2M': array(['M1R', 'M1K', 'M1V', 'M1T', 'Deletion', 'Truncating Mutations'], dtype=object), 'TCF7L2': array(['Truncating Mutations'], dtype=object), 'KIT': array(['D816E', 'K558N', 'L576del', 'D816F', 'A829P', 'D816N',
       'I563_L576del', 'T670I', 'E839K', 'K642E', 'K550_K558del',
       'P577_D579del', 'N822H', 'Y578C', 'Y553_K558del', 'C809G',
       'P551_E554del', 'T417_D419delinsRG', 'T417_D419delinsI', 'H697Y',
       'N655K', 'D816Y', 'F522C', 'C443Y', 'K558delinsNP', 'W557R',
       'V559C', 'Y553_Q556del', 'M541L', 'P577_W582delinsPYD', 'D820G',
       'E554_K558del', 'P551_V555del', 'S501_A502dup', 'D419del',
       'Amplification', 'T574_R588delinsL', 'V559_V560del', 'M552_K558del',
       'K509I', 'W557G', 'L576P', 'A504_Y505ins', 'V559del', 'V559D',
       'D816V', 'Q556_K558del', 'P551_W557delinsL', 'Exon 11 mutations',
       'V560G', 'D814V', 'D816H', 'N822Y', 'I653T', 'N463S',
       'N564_Y578del', 'D579del', 'V560D', 'D737N', 'V852I', 'V559G',
       'D816G', 'A502_Y503dup', 'D820E', 'K558_V559del',
       'Exon 9 mutations', 'S476I', 'V569_L576del', 'N505I', 'Y553N',
       'E554_V559del', 'D572A', 'N822I', 'P838L', 'D60N', '550_592del',
       'W557_V559delinsC', 'P573_D579del', 'W557_K558del', 'E554_I571del',
       'K558_E562del', 'D820A', 'W557_V560del', 'Y823D', 'Y418_D419del',
       'K550_W557del', 'Y570H', 'R796G', 'V555_V559del', 'V560E',
       'K550_V555delinsI', 'V654A', 'E490K', 'R634W', 'V555_L576del',
       'M552_W557del', 'D816A', 'V559A', 'T574insTQLPYD'], dtype=object), 'FOXA1': array(['G87R', 'L388M', 'Deletion', 'S453fs*', 'L455M', 'F400I',
       'Amplification', 'M253K'], dtype=object), 'PTEN': array(['D92G', 'C124N', 'C124S', 'L112R', 'A34D', 'E157G', 'D107Y',
       'A121P', 'H118P', 'A121V', 'C136Y', 'R130*', 'T401I', 'L181P',
       'L112P', 'G129R', 'Y68H', 'P38L', 'Y174N', 'T131L', 'R15S', 'Y155C',
       'S227F', 'C71Y', 'K125R', 'S10N', 'R173C', 'D162H', 'I32del',
       'F347L', 'P95S', 'M199del', 'N276S', 'T131A', 'S362L', 'V217D',
       'G129A', 'T167A', 'A126G', 'G165E', 'P169H', 'N48K', 'G165R',
       'Deletion', 'R130K', 'R15K', 'H123D', 'G127N', 'L325F', 'G20E',
       'S170R', 'K125M', 'C105F', 'D92A', 'A121E', 'A126V', 'Y65C',
       'C136R', 'T131S', 'F341V', 'A126S', 'R130G', 'H93Q', 'A126D',
       'G251C', 'F241S', 'I168F', 'P95L', 'D92E', 'K125L', 'C124R', 'L23F',
       'D162G', 'G129E', 'K128N', 'K289E', 'I122V', 'Y68D', 'V343E',
       'R159G', 'T131I', 'F21A', 'R130L', 'H123Q', 'V369G', 'G127E',
       'K342N', 'I33del', 'Y16C', 'D252G', 'H61D', 'L108P', 'R173P',
       'D331G', 'G165V', 'K62R', 'I122L', 'R161G', 'T160I', 'R130A',
       'A39P', 'D24Y', 'T319del', 'L345Q', 'S170N', 'K125E', 'I122S',
       'K128T', 'V343L', 'Y27S', 'M134L', 'Q110R', 'M35R', 'F154L', 'P96Q',
       'D326N', 'H61R', 'H93R', 'D92V', 'D92H', 'R173H', 'H93D', 'H123Y',
       'D92N', 'K128Q', 'Truncating Mutations'], dtype=object), 'FAT1': array(['Deletion', 'A4419S', 'P4309A', 'Truncating Mutations'], dtype=object), 'RUNX1': array(['R177*', 'Y113*', 'R139G', 'K83N', 'R177Q', 'R166Q', 'P173S',
       'R201Q', 'S70fsX93', 'W279*', 'Truncating Mutations', 'R174*',
       'D171G', 'Amplification', 'RUNX1-EVI1 Fusion', 'TEL-RUNX1 Fusion',
       'H78Q', 'G42R', 'RUNX1-RUNX1T1 Fusion', 'D171N', 'A122*', 'Fusions',
       'R80C', 'K83E'], dtype=object), 'WHSC1': array(['E1099K'], dtype=object), 'APC': array(['I1307K', 'N1026S', 'Deletion', 'R640G', 'A290T'], dtype=object), 'KDM5A': array(['Amplification'], dtype=object), 'CTCF': array(['H284N', 'H284P', 'Truncating Mutations', 'R339W', 'Deletion'], dtype=object), 'KDM5C': array(['D402Y', 'P480L', 'V1075Yfs*2', 'Overexpression', 'M1_E165DEL'], dtype=object), 'IL7R': array(['T244_I245insCPT'], dtype=object), 'KMT2C': array(['S3660L', 'A1685S', 'Truncating Mutations', 'Deletion'], dtype=object), 'FOXP1': array(['Truncating Mutations', 'R514C'], dtype=object), 'CDKN1B': array(['Deletion', 'P133T', 'V109G', 'Truncating Mutations'], dtype=object), 'CDKN1A': array(['Deletion', 'Truncating Mutations', 'S31R'], dtype=object), 'RRAS2': array(['Q72L'], dtype=object), 'ERG': array(['TMPRSS2-ERG Fusion', 'EWSR1-ERG Fusion', 'FUS-ERG Fusion',
       'Fusions'], dtype=object), 'CARM1': array(['S217C', 'S217T', 'S217A'], dtype=object), 'RIT1': array(['A77S', 'A77P', 'A77T', 'M90I'], dtype=object), 'BCL2': array(['IGH-BCL2 Fusion'], dtype=object), 'RICTOR': array(['Amplification'], dtype=object), 'XPO1': array(['C528S', 'E571K'], dtype=object), 'MYCN': array(['Amplification', 'P44L'], dtype=object), 'PPM1D': array(['422_605trunc'], dtype=object), 'FUBP1': array(['Truncating Mutations'], dtype=object), 'SRSF2': array(['P95H'], dtype=object), 'ASXL1': array(['Deletion'], dtype=object), 'TSC2': array(['S1036P', 'G1567D', 'L410R', 'V299G', 'H1620R',
       'Truncating Mutations', 'L1584R', 'Q1554H', 'T1623I', 'H597Y',
       'E1552del', 'T1203K', 'V1673F', 'V705M', 'P1675L', 'G1596V',
       'R905W', 'R905Q', 'V1673D', 'R611W', 'L830R', 'S1653P', 'R905G',
       'S1653F', 'V769E', 'Q1503P', 'Deletion', 'A889P', 'W1610G', 'R462C',
       'R1200W', 'L792R', 'L493V', 'L493P', 'C696Y', 'L826P', 'A614D',
       'L146R', 'R611Q', 'S1498N', 'F615S', 'Y598C', 'V705E', 'L844R',
       'L448P', 'P1709L', 'E75G'], dtype=object), 'CCNE1': array(['Amplification', 'Overexpression'], dtype=object), 'JUN': array(['Amplification'], dtype=object), 'PPP6C': array(['H114Y', 'P186S', 'S270L', 'R264C'], dtype=object), 'NF2': array(['T352M', 'E106G', 'K413E', 'L46R', 'A211D', 'Truncating Mutations',
       'L535P', 'Q538P', 'K79E', 'Q324L', 'L64P'], dtype=object), 'PIK3R1': array(['T576del', 'G376R', 'X475_splice', 'L370fs', 'E218*', 'Deletion',
       'I559_D560insDKRMNS', 'N564D', 'W237_Y242del', 'E137K', 'E160*',
       'Q579_L581del', 'X582_splice', 'D560Y', 'K379E', 'R574fs',
       'X434_splice', 'K459_S460delinsN', 'K288Q', 'R162*', 'R348*',
       'R262T', 'N564K', 'E439del'], dtype=object), 'JAK1': array(['S646F', 'F958S', 'R724H', 'K1026E', 'Y652H', 'S703I', 'L783F',
       'V658F', 'A723D', 'R879C', 'A634D', 'Truncating Mutations',
       'G1079D', 'L910P', 'Y1035C', 'F958V', 'S729C', 'K648N'], dtype=object), 'MYD88': array(['M232T', 'S243N', 'L265P'], dtype=object), 'CHEK2': array(['K373E', 'I157T', 'E161del', 'P85L', 'S428F', 'R117G', 'E321K'], dtype=object), 'TMPRSS2': array(['TMPRSS2-ETV1 Fusion', 'TMPRSS2-ETV5 Fusion', 'TMPRSS2-ETV4 Fusion',
       'T75M', 'Fusions', 'M160V'], dtype=object), 'CTLA4': array(['T17A'], dtype=object), 'TCF3': array(['N551K'], dtype=object), 'STAG2': array(['Truncating Mutations'], dtype=object), 'ARID2': array(['Deletion', 'Truncating Mutations'], dtype=object), 'RAD50': array(['L234fs', 'L1273F', 'Deletion'], dtype=object), 'RNF43': array(['Truncating Mutations'], dtype=object), 'EGFR': array(['L747_T751delinsP', 'S752_I759del', 'I491M', 'D770_P772dup',
       'G719A', 'G735S', 'N771_H773dup', 'K467T', 'E746_T751insIP',
       'D770_N771insD', 'G724S', 'K745_A750del', 'EGFRvIII', 'V765A',
       'EGFRvII', 'L858M', 'H870R', 'E758G', 'H773L', 'T710A', 'L747P',
       'E709V', 'D761Y', 'K806A', 'H773Y', 'V851A', 'EGFR-RAD51 Fusion',
       'D770_N771insNPG', 'E746_A750del', 'A864T', 'E746Q',
       'Exon 19 deletion/insertion', 'EGFRvV', 'A859_L883delinsV', 'E709Q',
       'E804G', 'A750_E758del', 'V769_D770insGVV', 'G796S', 'R836C',
       'P691S', 'G857E', 'G719S', 'L747V', 'T783A', 'L692F',
       'A750_E758delinsP', 'V726M', 'E746V', 'L747_P753delinsS',
       'Exon 19 deletion', 'S768I', 'C620Y', 'S921R', 'N826Y',
       'H773insLGNP', 'H773_V774insH', 'L747_A750delinsP', 'C628Y',
       'Exon 20 insertion', 'Exon 19 insertion', 'L704N', 'L861Q', 'L747F',
       'N826S', 'L747_E749del', 'Amplification', 'L858Q', 'L747_T751del',
       'R108K', 'V774A', 'S645C', 'L858R', 'EGFR-KDD', 'L747_A750del',
       'V774M', 'H773dup', 'E746G', 'L861P', 'K745M', 'E746_T751delinsA',
       'L747S', 'S492R', 'S768_D770dup', 'D837N', 'L861R', 'G719D',
       'E709_T710delinsD', 'A763_Y764insFQEA', 'S784F', 'Y801H',
       'I744_K745delinsKIPVAI', 'V742A', 'E746_A750delinsQ',
       'L747_P753del', 'T725M', 'L703P', 'T790M', 'A767_V769dup', 'E866K',
       'EGFRvIV', 'S768_V769insVAS', 'E746_T751delinsVA', 'E734Q', 'R776H',
       'T263P', 'T785A', 'A750P', 'L838V', 'P753S', 'A767_V769del',
       'G863S', 'I853T', 'L833V', 'R324L', 'V769M', 'W731L', 'E884K',
       'T34_A289del', 'G719C', 'E330K', 'S464L', 'R841K',
       'D770_N771insVDSVDNP', 'E746_S752delinsA', 'R776C', 'E868G',
       'P848L', 'L861F', 'G810S', 'EGFR-PURB Fusion', 'E709G', 'E709K',
       'P596L', 'E709A', 'D587H', 'G863D', 'L838P', 'E746_S752delinsI',
       'V843I', 'G465E'], dtype=object), 'VEGFA': array(['Amplification'], dtype=object), 'RAD21': array(['Deletion', 'Truncating Mutations'], dtype=object), 'RHOA': array(['T37A', 'G17V', 'G17E', 'G14V', 'E40Q', 'L57V', 'R5Q',
       'Amplification', 'T19N', 'Y42C', 'E40T', 'Overexpression', 'E40L',
       'G17A', 'Y34C', 'E40W', 'E40N'], dtype=object), 'FGF4': array(['Amplification'], dtype=object), 'FGF3': array(['Amplification'], dtype=object), 'PAX8': array(['PAX8-PPAR? Fusion'], dtype=object), 'ETV1': array(['TMPRSS2-ETV1 Fusion', 'Fusions', 'EWSR1-ETV1 Fusion'], dtype=object), 'EPAS1': array(['534_536del', 'P531L', 'A530V', 'P531S', 'A530T', 'Y532C', 'P531A',
       '533_534del'], dtype=object), 'MGA': array(['Deletion', 'Truncating Mutations'], dtype=object), 'TP53': array(['S215G', 'R273L', 'R202S', 'H179Q', 'K120E', 'C277Q', 'R290H',
       'R337P', 'P278S', 'Y234H', 'C277W', 'S241T', 'V173E', 'Q331R',
       'H115N', 'P219S', 'R249W', 'V274F', 'R175C', 'R248W', 'S121A',
       'Y220S', 'P153H', 'R181C', 'R337C', 'K120N', 'G245D', 'R175H',
       'R181L', 'G244S', 'S240R', 'R273C', 'E221Q', 'Q331H', 'Y220C',
       'R342P', 'C242F', 'C242S', 'C135S', 'M237K', 'V197L', 'S121C',
       'S241F', 'DNA binding domain deletions', 'R248K', 'H214R', 'L330R',
       'P151S', 'Truncating Mutations', 'R110L', 'C176F', 'H179Y', 'C277R',
       'L344R', 'R267Q', 'R156H', 'D186A', 'C275S', 'E285K',
       'DNA binding domain insertions', 'V157D', 'Y236S', 'R337H', 'R213Q',
       'R280K', 'R110P', 'R249S', 'H168N', 'R290G', 'H193P', 'N239S',
       'L348S', 'M133T', 'H179L', 'G244R', 'P316L', 'H297N', 'V272L',
       'G244D', 'G245S', 'L348F', 'Y163C', 'H179N', 'N235D', 'H168R',
       'E285V', 'I255F', 'R158H', 'A161T', 'P278A', 'R249M', 'F341C',
       'K292T', 'G266R', 'H179R', 'C141Y', 'V143A', 'E326L', 'C229R',
       'K291Q', 'Q144R', 'R248Q', 'P250L', 'A347T', 'V157F', 'I251S',
       'F134Y', 'P152L', 'G199R', 'I162M', 'C135R',
       'DNA binding domain missense mutations', 'A161S', 'G334R', 'V173L',
       'R158L', 'H193N', 'R175L', 'P47S', 'R280A', 'R280T', 'K292I',
       'Y236C', 'R248L', 'R156C', 'Y236D', 'T150I', 'L344P', 'H233N',
       'P278L', 'K291E', 'E258V', 'S241Y', 'C238F', 'F212Y', 'R282W',
       'R267P', 'H214Q', 'T123A', 'S215C', 'R282Q', 'R273H', 'G245A',
       'C135Y', 'R283Q', 'R181H', 'P151H', 'F328V', 'Y234C', 'P151A',
       'R273G', 'V197E', 'E286K', 'K120M', 'I195T', 'G356A', 'R158C',
       'H214N', 'C238S', 'L145R', 'P278R', 'R337L', 'G266E'], dtype=object), 'GNAQ': array(['Q209L'], dtype=object), 'ETV6': array(['R418G', 'R369Q', 'Fusions', 'P214L', 'R369W', '385_418del',
       'ETV6-FLT3 Fusion', 'ETV6-NTRK3 Fusion'], dtype=object), 'DDR2': array(['L239R', 'T654M', 'G253C', 'L63V', 'G505S', 'G774V', 'I638F',
       'Amplification', 'T654I'], dtype=object), 'MPL': array(['S505N', 'W515K', 'W515L', 'P106L'], dtype=object), 'CBL': array(['W802*', 'Q249E', 'N454D', 'L399V', 'V391I', 'V430M', 'Deletion',
       'Y371H', 'C384R', 'P395A', 'K382E', 'R420Q', 'C381A', 'P428L',
       'D390Y', 'Truncating Mutations', 'Q367P', 'M374V', 'Y371S', 'H94Y',
       'C396R', 'G375P', 'S376F', 'P417A', 'H398Y'], dtype=object), 'PAK1': array(['Amplification'], dtype=object), 'MEF2B': array(['N81Y', 'D83V', 'Y69H', 'E77K', 'K4E', 'Amplification'], dtype=object), 'SHQ1': array(['Deletion'], dtype=object), 'PRDM1': array(['Y149D', 'P48R', 'C569Y', 'Deletion', 'P48T'], dtype=object), 'NFE2L2': array(['E79K', 'E82G', 'T80A', 'E79Q', 'G81R', 'W24R', 'L30F', 'W24C',
       'G81D', 'T80K', 'V32G', 'D29Y', 'G31A', 'D29H', 'G81S', 'I28T',
       'G31V', 'G31R', 'E82V', 'E82D', 'T80R', 'W24S'], dtype=object), 'NSD1': array(['Fusions', 'NSD1-NUP98 Fusion'], dtype=object), 'BTK': array(['R544S', 'C481S', 'Y551F'], dtype=object), 'AGO2': array(['R812A', 'R647A', 'R583A', 'Amplification', 'Q545A'], dtype=object), 'PDGFRB': array(['KANK1-PDGFRB Fusion', 'R987W', 'NIN-PDGFRB Fusion',
       'GPIAP1-PDGFRB Fusion', 'GOLGA4-PDGFRB Fusion',
       'WDR48-PDGFRB Fusion', 'TP53BP1-PDGFRB Fusion', 'Fusions',
       'PRKG2-PDGFRB Fusion', 'EBF1-PDGFRB Fusion', 'ATF7IP-PDGFRB Fusion',
       'Amplification', 'E1071W', 'PDE4DIP-PDGFRB Fusion',
       'BIN2-PDGFRB Fusion', 'R561C', 'CEP85L-PDGFRB Fusion',
       'GIT2-PDGFRB Fusion', 'SPTBN1-PDGFRB Fusion', 'HIP1-PDGFRB Fusion',
       'KIAA1509-PDGFRB Fusion', 'ETV6-PDGFRB Fusion', 'V665A'], dtype=object), 'PDGFRA': array(['C450_K451insMIEWMI', 'A633T', 'R481G', 'H845Y', 'P130S',
       'KDR-PDGFRA Fusion', 'N870S', 'Q579R', 'R748G', 'R841K',
       'C456_N468del', 'N659K', 'H570R', 'Y555C', 'S584L', 'E996K',
       'V536E', 'S478P', 'I843_D846del', 'H650Q', 'G829R', 'A633V',
       'V561_I562insER', 'I562M', 'Y375_K455del', 'V561A', 'D842_M844del',
       'K830R', 'D842_H845del', 'S566_E571delinsR', 'D842I', 'N659R',
       'KIF5B-PDGFRA Fusion', 'Y849S', 'I834V', 'D1071N', 'G853D', 'D842V',
       'T674I', 'P577S', 'E311_K312del', 'Amplification', 'W349C', 'D846Y',
       'N848K', 'I843del', 'L507P', 'N659S', 'V561D', 'C456_R481del',
       'H845_N848delinsP', 'STRN-PDGFRA Fusion', 'D842Y', 'Y849C',
       'BCR-PDGFRA Fusion', '560_561insER', 'ETV6-PDGFRA Fusion', 'V658A',
       'V544_L545insAVLVLLVIVIISLI', 'F808L'], dtype=object), 'PMS2': array(['G857A', 'E5K', 'E41A', 'R20Q', 'G207E', 'Truncating Mutations',
       'R802*', 'R421*', 'I668V', 'I18V', 'S46I', 'L585I', 'S46N', 'T485K',
       'E541K', 'R315*'], dtype=object), 'LATS2': array(['Deletion'], dtype=object), 'PPP2R1A': array(['W257G', 'S256W', 'P179R', 'R183P', 'R182W', 'R258H', 'S256F',
       'R258C', 'P179L', 'R183G', 'W257C', 'R183Q'], dtype=object), 'SMAD4': array(['D537E', 'W509R', 'K507Q', 'D537Y', 'G508S', 'R378A', 'R441P',
       'R361C', 'L43V', 'Truncating Mutations', 'R497H', 'R515G', 'W398V',
       'G419V', 'R420H', 'Y513A', 'V465M', 'D493A', 'V370D', 'R100T',
       'P130S', 'Deletion', 'K507A', 'E330G', 'K428A', 'R496H', 'E330K',
       'C324Y', 'Y353L', 'D351H', 'A532H', 'D357Y', 'H132Y'], dtype=object), 'PIK3CD': array(['E525K', 'N334K', 'E1021K'], dtype=object), 'ATM': array(['E1978*', 'Truncating Mutations', 'Deletion', 'N2875K', 'D2870A',
       'D1853N'], dtype=object), 'KRAS': array(['G13E', 'Q22K', 'Q61H', 'A11_G12insGA', 'G12F', 'L19F', 'G12D',
       'F156L', 'Y32S', 'V14I', 'Y64A', 'N116H', 'E31K', 'G60D', 'T74P',
       'R164Q', 'G60R', 'A146T', 'G13C', 'Wildtype', 'A146V', 'P34L',
       'K147E', 'G12R', 'Q61L', 'T58I', 'G12S', 'K5N', 'A18D', 'S65N',
       'D153V', 'G12A', 'P34R', 'K117N', 'G12V', 'Amplification', 'Y40A',
       'Q61R', 'G12C', 'D119N', 'A59G', 'F28L', 'Q22R', 'Q22E'], dtype=object), 'RET': array(['RET-CCDC6 Fusion', 'A883T', 'C609Y', 'T338I', 'C634R', 'S891A',
       'E632_L633del', 'C618R', 'C611Y', 'K603Q', 'D631G', 'E921K',
       'Fusions', 'S904F', 'L790F', 'R873Q', 'Y806C', 'S765P', 'V804L',
       'R897Q', 'D631A', 'A883F', 'C634S', 'A919V', 'C634W', 'M980T',
       'E768D', 'KIF5B-RET Fusion', 'R833C', 'V804M', 'M918V', 'C630R',
       'V648I', 'I852M', 'C620R', 'C634Y', 'V804G', 'R886W', 'F893L',
       'Y791F'], dtype=object), 'SMAD3': array(['Y298A', 'K341A', 'V356R', 'D258N', 'R373H', 'R287A', 'Deletion',
       'Y297A', 'Y238A', 'R243A', 'R373Q', 'R292A', 'E239A', 'D408Y',
       'Y384A', 'W406A', 'D408H', 'R268A', 'D408E', 'V277D', 'S425C',
       'Y237A', 'P336S'], dtype=object), 'SMO': array(['E518K', 'D384N', 'V321M', 'Q477E', 'D473G', 'D473H', 'H231R',
       'Q635E', 'N219D', 'T241M', 'L221R', 'S387N', 'I408V', 'L412F',
       'W535L', 'Amplification', 'F460L', 'V414A', 'A459V', 'K575M',
       'E518A'], dtype=object), 'POLE': array(['S459del', 'S297F', 'P286R', 'S459F', 'S279Y', 'F367S', 'V411L',
       'P286H', 'L424I', 'L424V'], dtype=object), 'ATR': array(['Truncating Mutations'], dtype=object), 'NTRK2': array(['M713I', 'Fusions', 'R715G'], dtype=object), 'ABL1': array(['BCR-ABL1 Fusion', 'L248V', 'G250E', 'M244V', 'Fusions', 'E355A',
       'L387M', 'E281K', 'E459K', 'F311L', 'Y253F', 'T315I', 'E279K',
       'E255V', 'E255K', 'H396P', 'Y253H', 'V299L', 'L384M', 'V379I',
       'M351T', 'V289F', 'Q252H', 'F317L', 'H396R', 'F359C'], dtype=object), 'BRIP1': array(['A349P', 'P47A', 'M299I', 'R173C', 'Truncating Mutations',
       'Deletion', 'K52R'], dtype=object), 'NTRK3': array(['ETV6-NTRK3 Fusion', 'BTBD1-NTRK3 Fusion', 'Fusions',
       'CPEB1-NTRK3 Fusion', 'G623R'], dtype=object), 'RAF1': array(['Fusions', 'S259P', 'ESRP1-RAF1 Fusion', 'I448V',
       'ATG7-RAF1 Fusion', 'S259F', 'Amplification', 'S427G',
       'SRGAP3-RAF1 Fusion', 'P261L', 'S257W', 'S259A'], dtype=object), 'FLT3': array(['FLT3 internal tandem duplications', 'D835N', 'N676K',
       'Y599_D600insGLYVDFREYEY', 'D835A', 'F590G', 'T167A', 'D324N',
       'Y591D', 'F594L', 'V557I', 'I867S', 'D835del', 'I836del',
       'Y599_D600insSTDNEYFYVDFREYEY', 'E598_Y599insDVDFREYE',
       'W603_E604insDREYEYDLKW', 'N676D', 'S840_N841insGS', 'A627T',
       'G831E', 'R834Q', 'N676S', 'M664I', 'F691L', 'M737I', 'V194M',
       'V592A', 'S451F', 'S451E', 'D835H', 'Y842C', 'D835E', 'Y364H',
       'L601_K602insREYEYDL', 'F594_R595insSDNEYFYVDF',
       'L611_E612insCSSDNEYFYVDFREYEYDLKWEFPRENL',
       'E612_F613insGYVDFREYEYDLKWEFRPRENLEF', 'V579A',
       'Y599_D600insPAPQIMSTSTLISENMNIA', 'N841I', 'Y572C',
       'E598_Y599insGLVQVTGSSDNEYFYVDFREYE', 'D835Y',
       'Y599_D600insEYEYEYEY', 'D839G', 'Amplification',
       'D600_L601insFREYEYD', 'Y599F'], dtype=object), 'NCOR1': array(['Truncating Mutations'], dtype=object), 'FLT1': array(['Amplification'], dtype=object), 'STAT3': array(['H410R', 'D661Y', 'Y640F', 'D661V', 'S614R', 'F384L', 'F384Y',
       'F384V'], dtype=object), 'BCL10': array(['Deletion', 'Truncating Mutations'], dtype=object), 'FGF19': array(['Amplification'], dtype=object), 'RBM10': array(['Truncating Mutations', 'Deletion', 'V354E'], dtype=object), 'FANCC': array(['Truncating Mutations'], dtype=object), 'CCND3': array(['Amplification', 'P284L', 'Q276*', 'T286A', 'T283A', 'I290A',
       'I290R'], dtype=object), 'FANCA': array(['S858R', 'S1088F', 'Truncating Mutations', 'H492R', 'Y510C',
       'Deletion', 'L274P', 'W183A', 'L210R'], dtype=object), 'HLA-B': array(['Truncating Mutations'], dtype=object), 'KDM6A': array(['Deletion', 'Truncating Mutations'], dtype=object), 'HRAS': array(['G12V', 'G13D', 'Amplification', 'A146T', 'K117R', 'Q61H', 'G13R',
       'G13V', 'A146V', 'Q61L', 'Q61K', 'K117N', 'T58I', 'F28L', 'G13C',
       'G12A', 'Q22K', 'G12S', 'Y64A', 'P34R', 'G12C', 'A59T'], dtype=object), 'MAPK1': array(['E322K', 'Amplification'], dtype=object), 'FBXW7': array(['R465H', 'S562L', 'R479L', 'S462Y', 'S227A', 'D520N', 'R479Q',
       'R479H', 'R658Q', 'R505C', 'R689Q', 'K11R', 'T205A', 'R505L',
       'Deletion', 'I347M', 'Copy Number Loss', 'G423R', 'R482Q'], dtype=object), 'TGFBR2': array(['R537P', 'D404G', 'P525L', 'Truncating Mutations', 'Deletion'], dtype=object), 'TGFBR1': array(['N45S', 'R487Q', 'C41Y', 'G52R', 'P83L', 'S241L', 'S387Y',
       'TGFBR1*6A'], dtype=object), 'NFKBIA': array(['E14*', 'S36Y', 'M37K', 'S32I'], dtype=object), 'IKZF1': array(['Deletion'], dtype=object), 'TET1': array(['Deletion', 'MLL-TET1 Fusion', 'Truncating Mutations'], dtype=object), 'SF3B1': array(['K700R', 'R625C', 'E622D', 'H662Q', 'R625D', 'K666M', 'H662R',
       'R625H', 'K666T', 'R625L', 'E622Q', 'R625G', 'K666N', 'K666R',
       'K700E', 'R957Q'], dtype=object), 'TET2': array(['D1384V', 'Deletion', 'R1262A', 'Y1295A', 'H1904R', 'R1896M',
       'W1291R', 'Y1294A', 'S1303N', 'K1299E', 'H1382Y', 'M1293A',
       'N1387A', 'S1290A'], dtype=object), 'RXRA': array(['S247F', 'S247Y'], dtype=object), 'ERCC2': array(['G665A', 'E606G', 'A717G', 'N238S', 'Amplification', 'S746fs',
       'R631C', 'D609E', 'D513Y', 'V536M', 'R487W', 'P463L', 'L461V',
       'R592H', 'Y24C', 'D423N', 'F568fs', 'V242F', 'R601Q'], dtype=object), 'MTOR': array(['P2273S', 'A1519T', 'D2512G', 'L1433S', 'L2427R', 'V2006L',
       'I2500F', 'R2430M', 'L2431P', 'A8S', 'R2505Q', 'I2500M', 'E2419K',
       'T1977K', 'F1888I', 'C1483F', 'W1456R', 'V2006I', 'A41P', 'F1888V',
       'T1977R', 'S2215T', 'L2230V', 'T1977I', 'A1459P', 'C1483W',
       'A2034V', 'F1888L', 'Q2223K', 'M2327I', 'S2215F', 'T1977S',
       'F2108L', 'Y1463S', 'P2476L', 'K1452N', 'S2215Y', 'E2014K',
       'D2512Y', 'E1799K', 'C1483Y', 'L1460P', 'R2505*', 'A41T', 'C1483R'], dtype=object), 'BCOR': array(['BCOR-RARA Fusion', 'ZC3H7B-BCOR Fusion', 'Deletion', 'Fusions',
       'BCOR-CCNB3 Fusion'], dtype=object), 'DUSP4': array(['Deletion'], dtype=object), 'ATRX': array(['Deletion', 'Truncating Mutations'], dtype=object), 'RAD51D': array(['Truncating Mutations'], dtype=object), 'RAD51C': array(['Truncating Mutations', 'G264S'], dtype=object), 'RAD51B': array(['HMGA2-RAD51B Fusion'], dtype=object), 'YAP1': array(['Fusions', 'Amplification', 'YAP1-MAMLD1 Fusion',
       'YAP1-FAM118B Fusion'], dtype=object), 'KNSTRN': array(['S24F', 'R11K', 'A40E', 'P26S'], dtype=object), 'DICER1': array(['G1809R', 'G1809K', 'D1709E', 'D1709A', 'E1705A', 'D1810A',
       'Truncating Mutations', 'E1705K', 'Deletion'], dtype=object), 'ARID5B': array(['Truncating Mutations'], dtype=object), 'SOS1': array(['M269R', 'R552S', 'R552G', 'G434R', 'E846K', 'N233Y'], dtype=object), 'VHL': array(['P86H', 'D121G', 'R64P', 'Y98H', 'S68W', 'V84L', 'L128F',
       'Truncating Mutations', 'V155A', 'G114R', 'H115R', 'L118P', 'R200W',
       'Y98N', 'Q164R', 'A149P', 'L188Q', 'R69C', 'Y112H', 'I151S',
       'L158P', 'N78S', 'F119S', 'S65W', 'Q145H', 'L158Q', 'H191D',
       'L188V', 'L118R', 'W117C', 'L63P', 'E70K', 'P25L', 'R167W', 'R82P',
       'S65L', 'L158V', 'S65A', 'Q96P', 'Deletion', 'R167Q'], dtype=object), 'ESR1': array(['Fusions', 'V534E', 'L536P', 'L469V', 'S463P', 'E380Q', 'A546D',
       'ESR1-YAP1 Fusion', 'L536Q', 'L536H', 'V422del',
       'ESR1-CCDC170 Fusion', 'L536R', 'S432L', 'Y537C', 'Y537S'], dtype=object), 'FOXO1': array(['T24A'], dtype=object), 'KEAP1': array(['D422N', 'Deletion', 'R470C', 'R272C', 'G430C', 'S243C', 'R320Q',
       'R415G', 'V155F', 'G423V', 'Truncating Mutations', 'G186R', 'G480W',
       'G333S'], dtype=object), 'SHOC2': array(['S2G'], dtype=object), 'EZH2': array(['Y646N', 'Y646F', 'Y646H', 'Y646C', 'Y646S', 'A677G',
       'Overexpression', 'Amplification'], dtype=object), 'CDK4': array(['Amplification', 'R24C', 'K22A'], dtype=object), 'KDR': array(['Amplification', 'C482R', 'T771R', 'R961W', 'D717V', 'A1065T',
       'Q472H'], dtype=object), 'CDK6': array(['A197T', 'Amplification'], dtype=object), 'PBRM1': array(['Truncating Mutations', 'Deletion', 'N1333Gfs*'], dtype=object), 'CDK8': array(['Amplification'], dtype=object), 'RHEB': array(['Y35C', 'Y35N', 'Y35H'], dtype=object), 'NTRK1': array(['TPR-NTRK1 Fusion', 'V710A', 'R342W', 'Fusions',
       'BCAN-NTRK1 Fusion', 'NFASC-NTRK1 Fusion', 'TRKAIII Splice Variant',
       'R342Q', 'Delta-NTRK1 Fusion', 'CD74-NTRK1 Fusion',
       'TPM3-NTRK1 Fusion', 'CHTOP-NTRK1 Fusion', 'Amplification',
       'TFG-NTRK1 Fusion', 'LMNA-NTRK1 Fusion', 'MPRIP-NTRK1 Fusion'], dtype=object), 'GNAS': array(['R844H', 'Q227L', 'Q227R', 'R201H', 'R201C'], dtype=object), 'CIC': array(['R1515H', 'Truncating Mutations', 'R201W', 'Fusions',
       'CIC-DUX4 Fusion'], dtype=object), 'CDKN2B': array(['Truncating Mutations', 'Deletion', 'Hypermethylation', 'E35*',
       'P40T', 'D86N', 'A23E'], dtype=object), 'ACVR1': array(['R206H', 'G328E', 'G328V'], dtype=object), 'CDKN2C': array(['Deletion'], dtype=object), 'MSH2': array(['E580*', 'R711*', 'Deletion', 'Q337*', 'L390F', 'D603N', 'Q395*',
       'W345*', 'A636P', 'A272V', 'D603G', 'G751R', 'E483*', 'R680*',
       'R383*', 'S860L', 'Truncating Mutations', 'S723F', 'Q429*', 'Q816*',
       'R389*'], dtype=object), 'ERCC3': array(['R109*', 'Truncating Mutations'], dtype=object), 'ERCC4': array(['R689S', 'Deletion', 'E875G', 'L230P', 'Truncating Mutations',
       'S786F'], dtype=object), 'MSH6': array(['R922*', 'R976H', 'E1322*', 'T1219I', 'G39E', 'Deletion', 'R1076C',
       'R1095H', 'Truncating Mutations', 'V509A', 'E946*', 'F1088Lfs*5',
       'F1088Sfs*2'], dtype=object), 'CTNNB1': array(['D32A', 'S33F', 'S37C', 'G34E', 'D32N', 'K335I', 'S33Y', 'S45A',
       'S45Y', 'S33C', 'S37F', 'S37Y', 'H36P', 'S37A', 'G34V', 'S45del',
       'T41A', 'S33A', 'S45F', 'W383R', 'N387K', 'I35S', 'P44A', 'S45P',
       'S23R'], dtype=object), 'MAP2K2': array(['S226D', 'F57V', 'F57L', 'K101M', 'L46F', 'N126D', 'C125S', 'F57C',
       'S222D', 'E207K', 'V35M'], dtype=object), 'MAP2K1': array(['T28I', 'L115R', 'C121S', 'K57T', 'I204T', 'I111A', 'R49L', 'P387S',
       'F129L', 'Q56_V60del', 'S123T', 'Q56P', 'F53C', 'K97M',
       'E102_I103del', 'V60M', 'K59del', 'D65N', 'Amplification',
       'Q58_E62del', 'Y130C', 'F53L', 'I111P', 'P124L', 'I103N', 'I111N',
       'P124Q', 'V60E', 'V211D', 'E120Q', 'F53S', 'G128V', 'P306H',
       'I111R', 'E203K', 'E144K', 'K57E', 'N382H', 'P124S', 'L37P', 'R47Q',
       'S222D', 'D67N'], dtype=object), 'CDKN2A': array(['Q50P', 'P114L', 'E69G', 'G67R', 'L97R', 'D84G', 'Q50*', 'V126D',
       'Truncating Mutations', 'D84H', 'M53I', 'T77P', 'N71I', 'G67S',
       'N71S', 'R80P', 'D84V', 'D74N', 'D74Y', 'P81L', 'L32P', 'D108H',
       'Deletion', 'G35R', 'D84Y', 'N71K', 'R24P', 'R99P', 'A148T', 'G23D',
       'R80L', 'G35A', 'H83Y', 'A36P', 'P48L', 'D108N', 'P81T', 'G101W',
       'A57V', 'R87L', 'H83N', 'D84N', 'P114S', 'I49S', 'R112G', 'S56I',
       'R79P', 'A60V', 'R87P', 'E88K', 'G93W', 'V118D'], dtype=object), 'NOTCH2': array(['1_2009trunc', '2010_2471trunc'], dtype=object), 'MAP2K4': array(['V321M', 'R154W', 'R228K', 'P326L', 'Truncating Mutations', 'N234I',
       'R134Q', 'G85R', 'R304*'], dtype=object), 'ELF3': array(['I279P', 'R331P', 'S330A', 'S308A', 'Truncating Mutations', 'K320E'], dtype=object), 'SMARCA4': array(['R978*', 'R1189*', 'T910M', 'R1093*', 'G1232D',
       'Truncating Mutations'], dtype=object), 'H3F3A': array(['G35R', 'K28M', 'G35V'], dtype=object), 'CEBPA': array(['Truncating Mutations'], dtype=object), 'XRCC2': array(['Deletion', 'Truncating Mutations'], dtype=object), 'BCL2L11': array(['Truncating Mutations', 'Deletion'], dtype=object), 'MYOD1': array(['L122R'], dtype=object), 'HLA-A': array(['596_619splice', 'Truncating Mutations'], dtype=object), 'ALK': array(['G1128A', 'G1128S', 'L1122V', 'T1151M', 'NPM-ALK Fusion', 'G1123D',
       'D1203N', 'RANBP2-ALK Fusion', 'T1087I', 'A1200V', 'E1384K',
       'L1240V', 'I1171N', 'Y1278S', 'G1202R', 'F1174I', 'G881D', 'S1206C',
       'I1171T', 'CAD-ALK Fusion', 'L1152R', 'C1156F', 'G1123S', 'S1206R',
       'F1174L', 'S1206Y', 'STRN-ALK Fusion', 'RANBP1-ALK Fusion',
       'F1245C', 'M1328I', 'Q1064R', 'I1250T', 'Amplification', 'T1343I',
       'T1151dup', 'L1198F', 'F1245V', 'G1201E', 'V1180L', 'D1091N',
       'KIF5B-ALK Fusion', 'L1196M', 'P1139S', 'L1204F', 'G1269S',
       'G1269A', 'L1196Q', 'A1099T', 'G1286R', 'R1060H', 'D1270G',
       'E1210K', 'N1178H', 'K1062M', 'Fusions', 'R1192P', 'R1231Q',
       'V597A', 'D1349H', 'R1464*', 'I1183T', 'L1198P', 'C1156Y', 'L1152P',
       'H694R', 'I1170N', 'R1275L', 'I1170S', 'A1234T'], dtype=object), 'RAD54L': array(['G325R'], dtype=object), 'NRAS': array(['Amplification', 'T58I', 'Q61K', 'G13V', 'Q61R', 'G60E', 'G13D',
       'G12V', 'Q61H', 'T50I', 'G12D', 'Q61L'], dtype=object), 'RB1': array(['L607I', 'R698W', 'R661W', 'W563L', 'C712R', 'N480del', 'S567L',
       'C706F', 'Truncating Mutations', 'Deletion'], dtype=object), 'SDHB': array(['S100F'], dtype=object), 'MED12': array(['Exon 1 mutations', 'G44S', 'Exon 2 mutations', 'L1224F', 'Deletion'], dtype=object), 'GLI1': array(['R380A'], dtype=object), 'AURKA': array(['Amplification', 'F31I', 'K162D'], dtype=object), 'AURKB': array(['Overexpression'], dtype=object), 'RYBP': array(['Truncating Mutations'], dtype=object), 'TERT': array(['Y846C', 'C228T', 'H412Y', 'H876Q', 'Promoter Mutations', 'P704S',
       'Amplification', 'C250T'], dtype=object), 'BRAF': array(['D594N', 'E501K', 'A728V', 'A598T', 'T599_V600insV', 'L485F',
       'L485_P490del', 'L485_P490delinsF', 'A598V', 'I326V', 'Q201H',
       'N581D', 'A246P', 'G596V', 'V600R', 'V471F', 'T529M', 'V600M',
       'G478C', 'E501G', 'R462E', 'T529I', 'TRIM24-BRAF Fusion',
       'P490_Q494del', 'N581S', 'K483E', 'R506_K507insVLR',
       'FAM131B-BRAF Fusion', 'CUL1-BRAF Fusion', 'L597Q', 'R671Q',
       'L485_P490delinsY', 'KIAA1549-BRAF Fusion', 'T488_P492del', 'K601N',
       'V487_P492delinsA', 'D594Y', 'p61BRAF', 'V600E', 'AGK-BRAF Fusion',
       'T599dup', 'S151A', 'G464R', 'L597R', 'PAPSS1-BRAF Fusion', 'D594G',
       'R462I', 'G469A', 'K601Q', 'G469V', 'MKRN1-BRAF Fusion', 'I463S',
       'F468C', 'T241P', 'D594A', 'T529N', 'G466A', 'K483M', 'L597S',
       'L584F', 'W531C', 'L485_Q494del', 'D594V', 'SND1-BRAF Fusion',
       'T599_V600insEAT', 'G466E', 'G466R', 'Amplification', 'G596R',
       'G469del', 'N486_P490del', 'AKAP9-BRAF Fusion', 'Fusions', 'V600K',
       'G596C', 'E586K', 'V600D', 'G469E', 'V600G', 'V600D_K601insFGLAT',
       'D287H', 'G464V', 'K601E', 'E275K', 'K499E', 'L597V', 'D594E',
       'T599I', 'N581Y', 'G466V', 'T599R', 'Y472C', 'G464E'], dtype=object), 'DNMT3B': array(['Amplification', 'DNMT3B7'], dtype=object), 'KMT2B': array(['Truncating Mutations'], dtype=object), 'KMT2A': array(['Amplification', 'Truncating Mutations', 'Fusions'], dtype=object), 'DNMT3A': array(['R882C', 'Deletion', 'Truncating Mutations'], dtype=object), 'SMARCB1': array(['Deletion', 'Truncating Mutations'], dtype=object), 'PIM1': array(['L193F', 'N82K', 'H68Y', 'E124Q', 'E142D', 'E135K', 'Y53H', 'P123M'], dtype=object), 'SPOP': array(['F133V', 'M117V', 'R121Q', 'D140G', 'Y87C', 'P94A', 'W131A',
       'W131G', 'S119N', 'S80R', 'G75R', 'K129E', 'E50K', 'F133L', 'Y87N',
       'D130A', 'N296I', 'F102C', 'M117I', 'Truncating Mutations'], dtype=object), 'CARD11': array(['C49Y', 'F123I', 'G116S', 'G123S', 'L232LI', 'L251P', 'D387V',
       'L225LI', 'E127G', 'S250P', 'D401N', 'E626K'], dtype=object), 'RAB35': array(['F161L', 'A151T'], dtype=object), 'SRC': array(['Q531*', 'Amplification'], dtype=object), 'AR': array(['L702H', 'Q58_Q59insL', 'P380R', 'Q58L', 'ARv567es', 'A159T',
       'T878A', 'H875Y', 'G216R', 'W742L', 'W742C', 'T878S',
       'Amplification', 'L57del', 'F876L', 'L272F', 'Q120E', 'F877L',
       'G248V', 'AR-V7'], dtype=object), 'TP53BP1': array(['Truncating Mutations', 'Deletion'], dtype=object), 'EPCAM': array(["3' Deletion"], dtype=object), 'RAC1': array(['C157Y', 'P29S', 'Q61R', 'C18Y', 'Amplification', 'G12V', 'P29L'], dtype=object), 'NPM1': array(['Truncating Mutations', 'Deletion'], dtype=object), 'RASA1': array(['L481F', 'Y472H', 'Truncating Mutations', 'Deletion'], dtype=object), 'LATS1': array(['Truncating Mutations'], dtype=object), 'NF1': array(['R1391G', 'Y489C', 'R1391S', 'K1436Q', 'T1324N', 'L1301R', 'V1398D',
       'R1276P', 'S1463F', 'Truncating Mutations', 'R2450*', 'R1276Q',
       'E1356G', 'R1204W', 'Deletion', 'R1204G'], dtype=object), 'PTPRT': array(['T1365M', 'V648G', 'T844M', 'A707T', 'Promoter Hypermethylation',
       'R1343L', 'A209T', 'Y280H', 'D927G', 'N510K', 'F248S', 'L708P',
       'V995M', 'Y412F', 'F74S', 'R1040L', 'R453C', 'R1209W', 'A1022E',
       'Q984K', 'T605M', 'K218T', 'N1125I'], dtype=object), 'AXIN1': array(['Truncating Mutations'], dtype=object), 'BAP1': array(['Deletion', 'A95D', 'E685V', 'C91S', 'Truncating Mutations',
       'F170I', 'I47F', 'C91A', 'F81V'], dtype=object), 'ERBB2': array(['E812K', 'L49H', 'R678Q', 'T798M', 'D808N', 'V659E', 'S760A',
       'V839G', 'L866M', 'C630Y', 'I767M', 'S1002R', 'E719G', 'K753E',
       'E719K', 'Y772_A775dup', 'G776S', 'S310F', 'S335C', 'L785F',
       'L915M', 'C334S', 'G776delinsVC', 'G660D', 'V777A', 'K753M',
       'V842I', 'V750E', 'N319D', 'Exon 20 insertions/deletions', 'V773A',
       'E717K', 'K753A', 'T733I', 'Y835F', 'S653C', 'M774_A775insAYVM',
       'D769H', 'L726I', 'H878Y', 'Exon 20 insertions', 'Amplification',
       'L755S', 'D821N', 'G309A', 'E265K', 'D769Y', 'T798I', 'L755P',
       'Y1248F', 'P780L', 'V777L', 'S783P', 'D769A', 'C311R',
       'G776delinsLC', 'D277H', 'R143Q', 'G309E', 'E321G', 'L726F',
       'V777M', 'Y803N', 'N857S', 'D845A', 'G778_P780dup', 'S310Y',
       'V794M', 'G776_V777insYVMA'], dtype=object), 'SOX9': array(['F12L', 'Truncating Mutations', 'K398A', 'L142P', 'H65Y', 'A19V',
       'P70R'], dtype=object), 'NOTCH1': array(['I1616N', 'V1578del', 'R1598P', 'L1574P', 'V1605del', 'L1596H',
       'L1600P', 'Q2416*', 'Truncating Mutations in the PEST Domain',
       'I1680N', 'L1593P', 'A1701P', 'H2428Q',
       'Truncating Mutations Upstream of Transactivation Domain', 'L1678P',
       'R1608S', 'Fusions', 'I1616T', 'V1676D', 'P2412T', 'Q2405Rfs*17',
       'P2415del', 'R1594Q', 'SEC16A1-NOTCH1 Fusion', 'G2430A', 'F1592S',
       'P2417A', 'V1576E', 'MIR143-NOTCH1 Fusion', 'V1671I', 'A2425T'], dtype=object), 'CASP8': array(['G325A', 'Truncating Mutations', 'Deletion', 'CASP8L', 'C248T',
       'Promoter Hypermethylation'], dtype=object), 'CDK12': array(['Y901C', 'K975E', 'Deletion', 'R882L', 'W719*', 'K765R', 'L996F',
       'G909R', 'Truncating Mutations', 'D887N'], dtype=object), 'PTPRD': array(['R1088C', 'V1565I', 'Truncating Mutations', 'P1311T', 'Deletion'], dtype=object), 'FGFR4': array(['N535K', 'V550E', 'Amplification'], dtype=object), 'CREBBP': array(['R1563S', 'Deletion', 'R1446H', 'R1446C', 'Truncating Mutations',
       'W1502A', 'Q1500P'], dtype=object), 'GNA11': array(['Q209L'], dtype=object), 'FOXL2': array(['C134W', 'Amplification'], dtype=object), 'SDHC': array(['R133*'], dtype=object), 'PTCH1': array(['Truncating Mutations', 'R571W'], dtype=object), 'EWSR1': array(['EWSR1-FEV Fusion', 'EWSR1-YY1 Fusion', 'EWSR1-DDIT3 Fusion',
       'EWSR1-ETV1 Fusion', 'Fusions', 'EWSR1-FLI1 Fusion',
       'EWSR1-NR4A3 Fusion', 'EWSR1-CREB1 Fusion', 'EWSR1-ETV4 Fusion',
       'EWSR1-ATF1 Fusion', 'EWSR1-WT1 Fusion'], dtype=object), 'MET': array(['X1007_splice', 'R970C', 'Y1003C', 'V1188L', 'F1200I', 'N375S',
       'V1092I', 'Y1230C', 'Y1003F', 'Amplification', 'L1195V',
       'X963_splice', 'D1010H', 'H1094Y', 'X1009_splice', 'Overexpression',
       'Y1003*', 'Y1235D', 'H1094L', 'D1010Y', '981_1028splice', 'D1010N',
       'X1008_splice', 'H1094R', 'M1250T', 'PTPRZ1-MET Fusion', 'H1106D',
       'Y1230H', 'E168D', 'V1070E', '963_D1010splice', 'N1100Y', 'T992I'], dtype=object)}
264

No. Of Unique Variations Per Gene

In [40]:
unique_gene_variation_count={}
for key, value in sorted(unique_gene_variation.items()):
         unique_gene_variation_count[key]= len(value)
print unique_gene_variation_count
print max(unique_gene_variation_count.values())
{'ERRFI1': 1, 'ASXL2': 1, 'STK11': 13, 'MEN1': 1, 'FAM58A': 1, 'AKT1': 28, 'AKT2': 11, 'AKT3': 4, 'ARAF': 7, 'RB1': 10, 'IKBKE': 1, 'NKX2-1': 6, 'MDM2': 1, 'AXL': 1, 'RARA': 1, 'RAD51D': 1, 'MDM4': 1, 'PIK3CB': 10, 'MAP3K1': 4, 'EIF1AX': 1, 'INPP4B': 1, 'NTRK2': 3, 'GATA3': 3, 'TSC1': 20, 'HNF1A': 3, 'CDH1': 9, 'PIK3R3': 1, 'MYC': 8, 'FGFR1': 13, 'BTK': 3, 'PIK3R2': 8, 'CDKN2B': 7, 'ERBB3': 11, 'IGF1R': 3, 'CDKN2A': 52, 'NF1': 16, 'ERBB4': 20, 'BARD1': 1, 'IDH2': 5, 'IDH1': 16, 'U2AF1': 3, 'NUP93': 3, 'KLF4': 1, 'VHL': 41, 'RET': 40, 'BRD4': 2, 'PMS1': 1, 'FGFR4': 3, 'SETD2': 3, 'FGFR2': 50, 'FGFR3': 39, 'ALK': 69, 'CCND1': 13, 'ARID1B': 2, 'ARID1A': 1, 'CCND2': 1, 'B2M': 6, 'TCF7L2': 1, 'KIT': 99, 'FOXA1': 8, 'PTEN': 126, 'FAT1': 4, 'RUNX1': 24, 'RRAS2': 1, 'WHSC1': 1, 'APC': 5, 'CCND3': 7, 'CTCF': 5, 'KDM5C': 5, 'IL7R': 1, 'KMT2C': 4, 'FOXP1': 2, 'SDHC': 1, 'CDKN1B': 4, 'CDKN1A': 3, 'FUBP1': 1, 'ERG': 4, 'CARM1': 3, 'MSH2': 21, 'PTPN11': 26, 'RICTOR': 1, 'XPO1': 2, 'MYCN': 2, 'PPM1D': 1, 'MET': 33, 'ASXL1': 1, 'RAF1': 12, 'CCNE1': 2, 'JUN': 1, 'ROS1': 26, 'STAG2': 1, 'JAK2': 33, 'NF2': 11, 'H3F3A': 3, 'JAK1': 18, 'MYD88': 3, 'CHEK2': 7, 'SMAD2': 19, 'TGFBR2': 5, 'CTLA4': 1, 'TCF3': 1, 'RHOA': 17, 'ARID2': 2, 'RAD50': 3, 'RNF43': 1, 'VEGFA': 1, 'HRAS': 22, 'RAD21': 2, 'FGF4': 1, 'FGF3': 1, 'PAX8': 1, 'ETV1': 3, 'EPAS1': 8, 'MGA': 2, 'TP53': 163, 'GNAQ': 1, 'ETV6': 8, 'DDR2': 9, 'MPL': 4, 'SF3B1': 16, 'CBL': 25, 'KDM5A': 1, 'TMPRSS2': 6, 'MEF2B': 6, 'YAP1': 4, 'SHQ1': 1, 'PRDM1': 5, 'NFE2L2': 22, 'NSD1': 2, 'CREBBP': 7, 'AGO2': 5, 'PDGFRB': 23, 'PDGFRA': 60, 'PMS2': 16, 'PPP2R1A': 12, 'SMAD4': 33, 'PIK3CD': 3, 'ATM': 6, 'MAP2K1': 43, 'NTRK1': 16, 'SMO': 21, 'POLE': 10, 'ATR': 1, 'WHSC1L1': 1, 'PIM1': 8, 'ABL1': 26, 'BRIP1': 7, 'NTRK3': 5, 'IKZF1': 1, 'FLT3': 49, 'NCOR1': 1, 'FLT1': 1, 'STAT3': 8, 'NPM1': 2, 'RASA1': 4, 'FGF19': 1, 'MAPK1': 2, 'FANCC': 1, 'FANCA': 9, 'HLA-B': 1, 'DUSP4': 1, 'HLA-A': 2, 'KDR': 7, 'FBXW7': 19, 'SRC': 2, 'TGFBR1': 8, 'NFKBIA': 4, 'PPP6C': 4, 'TET1': 3, 'ERCC4': 6, 'TET2': 14, 'RXRA': 2, 'PIK3R1': 24, 'FOXL2': 2, 'RIT1': 4, 'BCOR': 5, 'ATRX': 2, 'EP300': 9, 'RAD51C': 2, 'RAD51B': 1, 'HIST1H1C': 1, 'KNSTRN': 4, 'DICER1': 9, 'ARID5B': 1, 'SOS1': 6, 'MAP2K2': 11, 'ESR1': 16, 'LATS1': 1, 'KEAP1': 14, 'SHOC2': 1, 'EZH2': 8, 'CDK4': 3, 'CDK6': 2, 'PBRM1': 3, 'CDK8': 1, 'RHEB': 3, 'GNAS': 5, 'CIC': 5, 'BRCA1': 264, 'ACVR1': 3, 'CDKN2C': 1, 'ERCC2': 19, 'ERCC3': 2, 'EGFR': 141, 'MSH6': 13, 'CTNNB1': 25, 'LATS2': 1, 'FOXO1': 1, 'BRCA2': 125, 'NOTCH2': 2, 'MAP2K4': 9, 'ELF3': 6, 'SMARCA4': 6, 'CEBPA': 1, 'XRCC2': 2, 'BCL2L11': 2, 'MYOD1': 1, 'AXIN1': 1, 'CDK12': 10, 'NRAS': 12, 'MLH1': 35, 'SDHB': 1, 'MED12': 5, 'GLI1': 1, 'AURKA': 3, 'AURKB': 1, 'RYBP': 1, 'TERT': 8, 'SRSF2': 1, 'PAK1': 1, 'DNMT3B': 2, 'KMT2B': 1, 'KMT2A': 3, 'DNMT3A': 3, 'SMARCB1': 2, 'KMT2D': 2, 'SPOP': 20, 'RBM10': 3, 'CARD11': 12, 'RAB35': 2, 'SMAD3': 23, 'AR': 20, 'TP53BP1': 2, 'PIK3CA': 56, 'TSC2': 47, 'KDM6A': 2, 'EPCAM': 1, 'RAC1': 7, 'KRAS': 44, 'BCL10': 2, 'PTPRT': 23, 'ERBB2': 69, 'SOX9': 7, 'NOTCH1': 31, 'CASP8': 6, 'RAD54L': 1, 'PTPRD': 5, 'BRAF': 93, 'GNA11': 1, 'MTOR': 45, 'BAP1': 9, 'PTCH1': 2, 'EWSR1': 11, 'BCL2': 1}
264

Plot Of No. Unique Variations Per Gene

In [41]:
plt.figure()
x=np.arange(len(unique_gene_variation_count))
plt.bar(x,unique_gene_variation_count.values(),align='center')
plt.xticks(x,unique_gene_variation_count.keys(),rotation=90)
ymax=max(unique_gene_variation_count.values())+1
plt.ylim(0,ymax)

plt.title("No. of Variations per Gene")

plt.show()

Observation

Out of 264 unique genes values in training data, 104 genes occur in a class of mutation.

In [42]:
unique_variation_gene={}
for variation in unique_variation:
    data=train.loc[train.Variation==variation]
    unique_variation_gene[variation]=data.Gene.unique()
print unique_variation_gene
{'Y62C': array(['PTPN11'], dtype=object), 'D399N': array(['AKT2'], dtype=object), 'V561D': array(['PDGFRA'], dtype=object), 'Q531*': array(['SRC'], dtype=object), 'E1586G': array(['BRCA1'], dtype=object), 'SRGAP3-RAF1 Fusion': array(['RAF1'], dtype=object), 'H93Q': array(['PTEN'], dtype=object), 'H93R': array(['PTEN'], dtype=object), 'K650T': array(['FGFR3'], dtype=object), 'V564I': array(['FGFR2'], dtype=object), 'H93D': array(['PTEN'], dtype=object), 'L1195V': array(['MET'], dtype=object), 'K4E': array(['MEF2B'], dtype=object), 'R2505Q': array(['MTOR'], dtype=object), 'E203K': array(['MAP2K1'], dtype=object), 'E2856A': array(['BRCA2'], dtype=object), 'P380R': array(['AR'], dtype=object), 'R2505*': array(['MTOR'], dtype=object), 'S786F': array(['ERCC4'], dtype=object), 'Y1463S': array(['MTOR'], dtype=object), 'N581Y': array(['BRAF'], dtype=object), 'D537Y': array(['SMAD4'], dtype=object), 'S32I': array(['NFKBIA'], dtype=object), 'K650R': array(['FGFR3'], dtype=object), 'N581S': array(['BRAF'], dtype=object), 'MAGI3-AKT3 Fusion': array(['AKT3'], dtype=object), 'A502_Y503dup': array(['KIT'], dtype=object), 'H697Y': array(['KIT'], dtype=object), 'Y53H': array(['PIM1'], dtype=object), 'L2396F': array(['BRCA2'], dtype=object), 'R2450*': array(['NF1'], dtype=object), 'Y24C': array(['ERCC2'], dtype=object), 'N581D': array(['BRAF'], dtype=object), 'S270L': array(['PPP6C'], dtype=object), 'D289del': array(['CCND1'], dtype=object), 'V1188L': array(['MET'], dtype=object), 'L191H': array(['TSC1'], dtype=object), 'H118P': array(['PTEN'], dtype=object), 'L785F': array(['ERBB2'], dtype=object), 'G106_R108del': array(['PIK3CA'], dtype=object), 'EP300-MOZ Fusion': array(['EP300'], dtype=object), 'D603N': array(['MSH2'], dtype=object), 'G480W': array(['KEAP1'], dtype=object), 'D289_D292del': array(['CCND1'], dtype=object), 'F133L': array(['SPOP'], dtype=object), 'D351H': array(['SMAD4'], dtype=object), 'D603G': array(['MSH2'], dtype=object), 'F460L': array(['SMO'], dtype=object), 'G17V': array(['RHOA'], dtype=object), 'S860L': array(['MSH2'], dtype=object), 'P86H': array(['VHL'], dtype=object), 'N345I': array(['PIK3CA'], dtype=object), 'G1738R': array(['BRCA1'], dtype=object), 'I853T': array(['EGFR'], dtype=object), 'C450_K451insMIEWMI': array(['PDGFRA'], dtype=object), 'V411L': array(['POLE'], dtype=object), 'T1324N': array(['NF1'], dtype=object), 'G17A': array(['RHOA'], dtype=object), 'V555_V559del': array(['KIT'], dtype=object), 'G776S': array(['ERBB2'], dtype=object), 'GPIAP1-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'E598_Y599insGLVQVTGSSDNEYFYVDFREYE': array(['FLT3'], dtype=object), 'P124Q': array(['MAP2K1'], dtype=object), 'G165V': array(['PTEN'], dtype=object), 'P124S': array(['MAP2K1'], dtype=object), 'G829R': array(['PDGFRA'], dtype=object), 'G165R': array(['PTEN'], dtype=object), 'R496C': array(['BRCA1'], dtype=object), 'D835del': array(['FLT3'], dtype=object), 'K558_E562del': array(['KIT'], dtype=object), 'R496H': array(['SMAD4'], dtype=object), 'R383*': array(['MSH2'], dtype=object), 'G165E': array(['PTEN'], dtype=object), 'R970C': array(['MET'], dtype=object), 'P153H': array(['TP53'], dtype=object), 'D808N': array(['ERBB2'], dtype=object), 'G1743R': array(['BRCA1'], dtype=object), 'V118D': array(['CDKN2A'], dtype=object), 'TRA-NKX2-1 Fusion': array(['NKX2-1'], dtype=object), 'P106L': array(['MPL'], dtype=object), 'S562L': array(['FBXW7'], dtype=object), 'A209T': array(['PTPRT'], dtype=object), 'L387M': array(['ABL1'], dtype=object), 'M1008I': array(['BRCA1'], dtype=object), 'C630Y': array(['ERBB2'], dtype=object), 'Y801H': array(['EGFR'], dtype=object), 'E719G': array(['ERBB2'], dtype=object), 'E719K': array(['ERBB2'], dtype=object), 'R283Q': array(['TP53'], dtype=object), 'K509I': array(['KIT'], dtype=object), 'I391M': array(['PIK3CA'], dtype=object), 'Q227L': array(['GNAS'], dtype=object), 'A750_E758delinsP': array(['EGFR'], dtype=object), 'T599_V600insV': array(['BRAF'], dtype=object), 'E746G': array(['EGFR'], dtype=object), 'D837N': array(['EGFR'], dtype=object), 'P179L': array(['PPP2R1A'], dtype=object), 'D162G': array(['PTEN'], dtype=object), 'E746Q': array(['EGFR'], dtype=object), 'E746V': array(['EGFR'], dtype=object), 'P179R': array(['PPP2R1A'], dtype=object), 'RET-CCDC6 Fusion': array(['RET'], dtype=object), 'K59del': array(['MAP2K1'], dtype=object), 'R18H': array(['BRCA2'], dtype=object), 'E1935G': array(['ROS1'], dtype=object), 'G423V': array(['KEAP1'], dtype=object), 'K11R': array(['FBXW7'], dtype=object), 'L559R': array(['MLH1'], dtype=object), 'G423R': array(['FBXW7'], dtype=object), 'R246K': array(['TSC1'], dtype=object), 'T790M': array(['EGFR'], dtype=object), 'A1685S': array(['KMT2C'], dtype=object), 'V1804D': array(['BRCA1'], dtype=object), 'M224R': array(['TSC1'], dtype=object), 'C381A': array(['CBL'], dtype=object), 'C384R': array(['CBL'], dtype=object), 'E69K': array(['PTPN11'], dtype=object), 'V321M': array(['MAP2K4', 'SMO'], dtype=object), 'H68Y': array(['PIM1'], dtype=object), 'K666M': array(['SF3B1'], dtype=object), 'K666N': array(['SF3B1'], dtype=object), 'G35V': array(['H3F3A'], dtype=object), 'R108H': array(['PIK3CA'], dtype=object), 'S215C': array(['TP53'], dtype=object), 'V14I': array(['KRAS'], dtype=object), 'R108K': array(['EGFR'], dtype=object), 'S215G': array(['TP53'], dtype=object), 'F876L': array(['AR'], dtype=object), 'G35A': array(['CDKN2A'], dtype=object), 'C44Y': array(['BRCA1'], dtype=object), 'N375S': array(['MET'], dtype=object), 'E1735K': array(['BRCA1'], dtype=object), 'K666R': array(['SF3B1'], dtype=object), 'Y578C': array(['KIT'], dtype=object), 'K666T': array(['SF3B1'], dtype=object), 'M1328I': array(['ALK'], dtype=object), 'D1739G': array(['BRCA1'], dtype=object), 'D1739E': array(['BRCA1'], dtype=object), 'A2643G': array(['BRCA2'], dtype=object), 'G75R': array(['SPOP'], dtype=object), 'D1739V': array(['BRCA1'], dtype=object), 'D419del': array(['KIT'], dtype=object), 'P133T': array(['CDKN1B'], dtype=object), 'IGH-MYC Fusion': array(['MYC'], dtype=object), 'L584F': array(['BRAF'], dtype=object), 'Y155C': array(['PTEN'], dtype=object), 'D1739Y': array(['BRCA1'], dtype=object), 'R2318Q': array(['BRCA2'], dtype=object), 'L424V': array(['POLE'], dtype=object), 'T77M': array(['BRCA1'], dtype=object), 'N510K': array(['PTPRT'], dtype=object), 'E144K': array(['MAP2K1'], dtype=object), 'T77P': array(['CDKN2A'], dtype=object), 'E60L': array(['BRCA1'], dtype=object), 'L424I': array(['POLE'], dtype=object), 'L158V': array(['VHL'], dtype=object), 'L1678P': array(['NOTCH1'], dtype=object), 'P1819S': array(['BRCA2'], dtype=object), 'G271E': array(['FGFR2'], dtype=object), 'L158Q': array(['VHL'], dtype=object), 'L158P': array(['VHL'], dtype=object), 'R2602T': array(['BRCA2'], dtype=object), 'R698W': array(['RB1'], dtype=object), 'Y849S': array(['PDGFRA'], dtype=object), 'S904F': array(['RET'], dtype=object), 'C135Y': array(['TP53'], dtype=object), 'C135S': array(['TP53'], dtype=object), 'C135R': array(['TP53'], dtype=object), 'Y849C': array(['PDGFRA'], dtype=object), 'D1349H': array(['ALK'], dtype=object), 'V1713A': array(['BRCA1'], dtype=object), 'S1486C': array(['BRCA1'], dtype=object), 'T529I': array(['BRAF'], dtype=object), 'AGK-BRAF Fusion': array(['BRAF'], dtype=object), 'T529N': array(['BRAF'], dtype=object), 'T529M': array(['BRAF'], dtype=object), 'I90T': array(['BRCA1'], dtype=object), 'S1036P': array(['TSC2'], dtype=object), 'F71I': array(['PTPN11'], dtype=object), 'S371C': array(['FGFR3'], dtype=object), 'G505S': array(['DDR2'], dtype=object), 'F311L': array(['ABL1'], dtype=object), 'S752_I759del': array(['EGFR'], dtype=object), 'K700R': array(['SF3B1'], dtype=object), 'K700E': array(['SF3B1'], dtype=object), 'T17A': array(['CTLA4'], dtype=object), 'F1200I': array(['MET'], dtype=object), 'S33C': array(['CTNNB1'], dtype=object), 'S33A': array(['CTNNB1'], dtype=object), 'F119S': array(['VHL'], dtype=object), 'V2908G': array(['BRCA2'], dtype=object), 'Y1003*': array(['MET'], dtype=object), 'N870S': array(['PDGFRA'], dtype=object), 'Y1295A': array(['TET2'], dtype=object), 'L78T': array(['AKT1'], dtype=object), 'Y513A': array(['SMAD4'], dtype=object), 'S33Y': array(['CTNNB1'], dtype=object), 'S102F': array(['HIST1H1C'], dtype=object), 'L388M': array(['FOXA1'], dtype=object), 'ETV6-FLT3 Fusion': array(['ETV6'], dtype=object), 'G328E': array(['ACVR1'], dtype=object), 'D96N': array(['BRCA1'], dtype=object), 'G328V': array(['ACVR1'], dtype=object), 'G106V': array(['PIK3CA'], dtype=object), 'T160I': array(['PTEN'], dtype=object), 'Overexpression': array(['CCNE1', 'AURKB', 'KDM5C', 'EZH2', 'MET', 'RHOA'], dtype=object), 'H2074N': array(['BRCA2'], dtype=object), 'N561D': array(['PIK3R2'], dtype=object), 'G1656D': array(['BRCA1'], dtype=object), 'S427G': array(['RAF1'], dtype=object), 'N116H': array(['KRAS'], dtype=object), 'S462Y': array(['FBXW7'], dtype=object), 'Y1003C': array(['MET'], dtype=object), 'P287A': array(['CCND1'], dtype=object), 'ESRP1-RAF1 Fusion': array(['RAF1'], dtype=object), 'Y1003F': array(['MET'], dtype=object), 'N841I': array(['FLT3'], dtype=object), 'E1021K': array(['PIK3CD'], dtype=object), 'V248D': array(['FGFR2'], dtype=object), 'P287S': array(['CCND1'], dtype=object), 'T605M': array(['PTPRT'], dtype=object), 'P287T': array(['CCND1'], dtype=object), 'C628Y': array(['EGFR'], dtype=object), 'TRKAIII Splice Variant': array(['NTRK1'], dtype=object), 'Y98N': array(['VHL'], dtype=object), 'L2427R': array(['MTOR'], dtype=object), 'R882L': array(['CDK12'], dtype=object), 'L1584R': array(['TSC2'], dtype=object), 'R882C': array(['DNMT3A'], dtype=object), 'R177Pfs*126': array(['GATA3'], dtype=object), 'L861P': array(['EGFR'], dtype=object), 'L861Q': array(['EGFR'], dtype=object), 'L861R': array(['EGFR'], dtype=object), 'G697C': array(['FGFR3'], dtype=object), '422_605trunc': array(['PPM1D'], dtype=object), 'S217T': array(['CARM1'], dtype=object), 'R251Q': array(['FGFR2'], dtype=object), 'H412Y': array(['TERT'], dtype=object), 'R80L': array(['CDKN2A'], dtype=object), 'F384L': array(['FGFR3', 'STAT3'], dtype=object), 'C61G': array(['BRCA1'], dtype=object), 'I1018W': array(['JAK2'], dtype=object), 'K513R': array(['BRCA2'], dtype=object), 'F384Y': array(['STAT3'], dtype=object), 'F359C': array(['ABL1'], dtype=object), 'D121G': array(['VHL'], dtype=object), 'D520N': array(['FBXW7'], dtype=object), 'I1018F': array(['JAK2'], dtype=object), 'F384V': array(['STAT3'], dtype=object), 'G60E': array(['NRAS'], dtype=object), 'Q12Y': array(['BRCA1'], dtype=object), 'D493A': array(['SMAD4'], dtype=object), 'R373Q': array(['SMAD3'], dtype=object), 'FGFR2-AHCYL1 Fusion': array(['FGFR2'], dtype=object), 'T1151dup': array(['ALK'], dtype=object), 'Y34C': array(['RHOA'], dtype=object), 'M1_E165DEL': array(['KDM5C'], dtype=object), 'L43V': array(['SMAD4'], dtype=object), 'R373H': array(['SMAD3'], dtype=object), 'A161T': array(['TP53'], dtype=object), 'A161S': array(['TP53'], dtype=object), 'Q331R': array(['TP53'], dtype=object), 'D631G': array(['RET'], dtype=object), 'Q331H': array(['TP53'], dtype=object), 'D544H': array(['JAK2'], dtype=object), 'I1680N': array(['NOTCH1'], dtype=object), 'P387S': array(['MAP2K1'], dtype=object), 'S24F': array(['KNSTRN'], dtype=object), 'Y1230H': array(['MET'], dtype=object), 'R1589P': array(['BRCA1'], dtype=object), 'DNA binding domain missense mutations': array(['TP53'], dtype=object), 'D814V': array(['KIT'], dtype=object), 'R482Q': array(['FBXW7'], dtype=object), 'L348F': array(['TP53'], dtype=object), 'R167Q': array(['VHL'], dtype=object), 'N53A': array(['AKT1'], dtype=object), 'Q58_Q59insL': array(['AR'], dtype=object), 'P1806A': array(['BRCA1'], dtype=object), 'E1384K': array(['ALK'], dtype=object), 'R1589H': array(['BRCA1'], dtype=object), 'S243N': array(['MYD88'], dtype=object), 'KIF5B-PDGFRA Fusion': array(['PDGFRA'], dtype=object), 'Y1278S': array(['ALK'], dtype=object), 'NPM-ALK Fusion': array(['ALK'], dtype=object), 'L202F': array(['AKT1'], dtype=object), 'A889P': array(['TSC2'], dtype=object), 'G128V': array(['MAP2K1'], dtype=object), 'E475K': array(['FGFR2'], dtype=object), 'G81S': array(['NFE2L2'], dtype=object), 'A1519T': array(['MTOR'], dtype=object), 'R217C': array(['MLH1'], dtype=object), 'P2412T': array(['NOTCH1'], dtype=object), 'G1788D': array(['BRCA1'], dtype=object), 'F1088Sfs*2': array(['MSH6'], dtype=object), 'S1297del': array(['BRCA1'], dtype=object), 'E275K': array(['BRAF'], dtype=object), 'Q1785H': array(['BRCA1'], dtype=object), 'G1035S': array(['TSC1'], dtype=object), 'M784V': array(['BRCA2'], dtype=object), 'G1788V': array(['BRCA1'], dtype=object), 'N334K': array(['PIK3CD'], dtype=object), 'H396P': array(['ABL1'], dtype=object), 'R537P': array(['TGFBR2'], dtype=object), 'S1498N': array(['TSC2'], dtype=object), 'H65Y': array(['SOX9'], dtype=object), 'G17E': array(['RHOA'], dtype=object), 'BIN2-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'P47S': array(['TP53'], dtype=object), 'TMPRSS2-ETV5 Fusion': array(['TMPRSS2'], dtype=object), 'P47A': array(['BRIP1'], dtype=object), 'K1702E': array(['BRCA1'], dtype=object), 'C311R': array(['ERBB2'], dtype=object), 'PVT1-MYC Fusion': array(['MYC'], dtype=object), 'R132G': array(['IDH1'], dtype=object), 'L611V': array(['JAK2'], dtype=object), 'R132C': array(['IDH1'], dtype=object), 'S1039F': array(['JAK2'], dtype=object), 'T599R': array(['BRAF'], dtype=object), 'S153R': array(['BRCA1'], dtype=object), 'L234fs': array(['RAD50'], dtype=object), 'R132H': array(['IDH1'], dtype=object), 'FIG-ROS1 Fusion': array(['ROS1'], dtype=object), 'R678Q': array(['ERBB2'], dtype=object), 'R132Q': array(['IDH1'], dtype=object), 'G1738E': array(['BRCA1'], dtype=object), 'A39P': array(['PTEN'], dtype=object), 'N126D': array(['MAP2K2'], dtype=object), 'D101Y': array(['FGFR2'], dtype=object), 'R203C': array(['FGFR2'], dtype=object), 'E866K': array(['EGFR'], dtype=object), 'D587H': array(['EGFR'], dtype=object), 'T417I': array(['TSC1'], dtype=object), 'V677I': array(['FGFR3'], dtype=object), 'R15K': array(['PTEN'], dtype=object), 'R15S': array(['PTEN'], dtype=object), 'G370C': array(['FGFR3'], dtype=object), 'T263P': array(['EGFR'], dtype=object), 'V344G': array(['PIK3CA'], dtype=object), 'Y537C': array(['ESR1'], dtype=object), 'V344A': array(['PIK3CA'], dtype=object), 'H597Y': array(['TSC2'], dtype=object), 'P25L': array(['VHL'], dtype=object), 'Y537S': array(['ESR1'], dtype=object), 'E75G': array(['TSC2'], dtype=object), 'R121Q': array(['SPOP'], dtype=object), 'L30F': array(['NFE2L2'], dtype=object), 'A41P': array(['MTOR'], dtype=object), 'V155F': array(['KEAP1'], dtype=object), 'N71S': array(['CDKN2A'], dtype=object), 'FGFR1-TACC1 Fusion': array(['FGFR1'], dtype=object), 'E1978*': array(['ATM'], dtype=object), 'P780L': array(['ERBB2'], dtype=object), 'E1286V': array(['MAP3K1'], dtype=object), 'SLC34A2-ROS1 Fusion': array(['ROS1'], dtype=object), 'G129R': array(['PTEN'], dtype=object), 'G776delinsLC': array(['ERBB2'], dtype=object), 'R2888C': array(['BRCA2'], dtype=object), 'F893L': array(['RET'], dtype=object), 'N71K': array(['CDKN2A'], dtype=object), 'N71I': array(['CDKN2A'], dtype=object), 'E868G': array(['EGFR'], dtype=object), 'V769E': array(['TSC2'], dtype=object), 'H870R': array(['EGFR'], dtype=object), 'R987W': array(['PDGFRB'], dtype=object), 'V769M': array(['EGFR'], dtype=object), 'Q579R': array(['PDGFRA'], dtype=object), 'PCM1-JAK2 Fusion': array(['JAK2'], dtype=object), 'S768_D770dup': array(['EGFR'], dtype=object), 'V1808A': array(['BRCA1'], dtype=object), 'R711*': array(['MSH2'], dtype=object), 'P284L': array(['CCND3'], dtype=object), 'A1374V': array(['IGF1R'], dtype=object), 'I867S': array(['FLT3'], dtype=object), 'A1234T': array(['ALK'], dtype=object), 'Y42C': array(['RHOA', 'BRCA2'], dtype=object), 'K292T': array(['TP53'], dtype=object), 'Q2416*': array(['NOTCH1'], dtype=object), 'C248T': array(['CASP8'], dtype=object), 'X1009_splice': array(['MET'], dtype=object), 'V716M': array(['MLH1'], dtype=object), 'V1809F': array(['BRCA1'], dtype=object), 'D842Y': array(['PDGFRA'], dtype=object), 'V1809A': array(['BRCA1'], dtype=object), 'I122L': array(['PTEN'], dtype=object), 'D842V': array(['PDGFRA'], dtype=object), 'T878S': array(['AR'], dtype=object), 'R1097H': array(['TSC1'], dtype=object), 'D842I': array(['PDGFRA'], dtype=object), 'Y599F': array(['FLT3'], dtype=object), 'I122S': array(['PTEN'], dtype=object), 'E598_Y599insDVDFREYE': array(['FLT3'], dtype=object), 'L1951R': array(['ROS1'], dtype=object), 'L1407P': array(['BRCA1'], dtype=object), 'Q58_E62del': array(['MAP2K1'], dtype=object), 'V774A': array(['EGFR'], dtype=object), 'W1837C': array(['BRCA1'], dtype=object), 'Y640F': array(['STAT3'], dtype=object), 'P124L': array(['MAP2K1'], dtype=object), 'V774M': array(['EGFR'], dtype=object), 'F134Y': array(['TP53'], dtype=object), 'Exon 1 mutations': array(['MED12'], dtype=object), 'S121A': array(['TP53'], dtype=object), 'DNA binding domain insertions': array(['TP53'], dtype=object), 'R2304C': array(['BRCA2'], dtype=object), 'C134W': array(['FOXL2'], dtype=object), 'G469del': array(['BRAF'], dtype=object), 'P219S': array(['TP53'], dtype=object), 'S121C': array(['TP53'], dtype=object), 'R321Q': array(['SMAD2'], dtype=object), 'AKAP9-BRAF Fusion': array(['BRAF'], dtype=object), 'Y35C': array(['RHEB'], dtype=object), 'K111E': array(['PIK3CA'], dtype=object), 'Y35H': array(['RHEB'], dtype=object), 'S464L': array(['EGFR'], dtype=object), 'K111N': array(['PIK3CA'], dtype=object), 'V384D': array(['MLH1'], dtype=object), 'V506A': array(['MLH1'], dtype=object), 'R369Q': array(['ETV6'], dtype=object), 'R369W': array(['ETV6'], dtype=object), 'E258V': array(['TP53'], dtype=object), 'T599dup': array(['BRAF'], dtype=object), 'S1651P': array(['BRCA1'], dtype=object), 'G244S': array(['TP53'], dtype=object), 'A648T': array(['FGFR2'], dtype=object), 'Q252H': array(['ABL1'], dtype=object), 'S65L': array(['VHL'], dtype=object), 'F1695L': array(['BRCA1'], dtype=object), 'S65N': array(['KRAS'], dtype=object), 'S784F': array(['EGFR'], dtype=object), 'C569Y': array(['PRDM1'], dtype=object), 'CEP85L-ROS1 Fusion': array(['ROS1'], dtype=object), 'F1592S': array(['NOTCH1'], dtype=object), 'G305W': array(['TSC1'], dtype=object), 'S65A': array(['VHL'], dtype=object), 'G305R': array(['TSC1'], dtype=object), 'G250E': array(['ABL1'], dtype=object), 'R310H': array(['SMAD2'], dtype=object), 'S276L': array(['SMAD2'], dtype=object), 'A111P': array(['MLH1'], dtype=object), 'S65W': array(['VHL'], dtype=object), 'P326L': array(['MAP2K4'], dtype=object), 'Y1294A': array(['TET2'], dtype=object), 'A159T': array(['AR'], dtype=object), 'K120E': array(['TP53'], dtype=object), 'D84Y': array(['CDKN2A'], dtype=object), 'K409Q': array(['KLF4'], dtype=object), 'K120N': array(['TP53'], dtype=object), 'E622D': array(['SF3B1'], dtype=object), 'V1671I': array(['NOTCH1'], dtype=object), 'D84N': array(['CDKN2A'], dtype=object), 'D84H': array(['CDKN2A'], dtype=object), 'EWSR1-WT1 Fusion': array(['EWSR1'], dtype=object), 'E580*': array(['MSH2'], dtype=object), 'D384N': array(['SMO'], dtype=object), 'D84G': array(['CDKN2A'], dtype=object), 'E622Q': array(['SF3B1'], dtype=object), 'X475_splice': array(['PIK3R1'], dtype=object), 'N319D': array(['ERBB2'], dtype=object), 'G774V': array(['DDR2'], dtype=object), 'R276W': array(['RARA'], dtype=object), 'P142H': array(['BRCA1'], dtype=object), 'I99M': array(['IDH1'], dtype=object), 'L1240V': array(['ALK'], dtype=object), '2010_2471trunc': array(['NOTCH2'], dtype=object), 'S459del': array(['POLE'], dtype=object), 'N82K': array(['PIM1'], dtype=object), 'N319T': array(['BRCA2'], dtype=object), 'R49C': array(['IDH1'], dtype=object), 'C630R': array(['RET'], dtype=object), 'Truncating Mutations Upstream of Transactivation Domain': array(['NOTCH1'], dtype=object), 'R1446C': array(['CREBBP'], dtype=object), 'F347L': array(['PTEN'], dtype=object), 'F615S': array(['TSC2'], dtype=object), 'R1189*': array(['SMARCA4'], dtype=object), 'P40T': array(['CDKN2B'], dtype=object), 'L345Q': array(['PTEN'], dtype=object), 'E884K': array(['EGFR'], dtype=object), 'R2430M': array(['MTOR'], dtype=object), 'F568fs': array(['ERCC2'], dtype=object), 'KDR-PDGFRA Fusion': array(['PDGFRA'], dtype=object), 'A41T': array(['MTOR'], dtype=object), 'G719C': array(['EGFR'], dtype=object), 'E731K': array(['FGFR2'], dtype=object), 'P151S': array(['TP53'], dtype=object), 'C696Y': array(['TSC2'], dtype=object), 'P151H': array(['TP53'], dtype=object), 'G163D': array(['STK11'], dtype=object), 'BCAN-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'P151A': array(['TP53'], dtype=object), 'A197T': array(['CDK6'], dtype=object), 'IGH-NKX2 Fusion': array(['NKX2-1'], dtype=object), 'M117I': array(['SPOP'], dtype=object), 'E542G': array(['PIK3CA'], dtype=object), 'K128T': array(['PTEN'], dtype=object), 'L747_P753del': array(['EGFR'], dtype=object), 'K806A': array(['EGFR'], dtype=object), 'Q22E': array(['KRAS'], dtype=object), 'Q1396R': array(['BRCA2'], dtype=object), 'Q22K': array(['HRAS', 'KRAS'], dtype=object), 'D770_N771insNPG': array(['EGFR'], dtype=object), 'T1623I': array(['TSC2'], dtype=object), 'S80R': array(['SPOP'], dtype=object), 'V559_V560del': array(['KIT'], dtype=object), 'F400I': array(['FOXA1'], dtype=object), 'L321A': array(['AKT1'], dtype=object), 'T244_I245insCPT': array(['IL7R'], dtype=object), 'D277H': array(['ERBB2'], dtype=object), 'H68R': array(['TSC1'], dtype=object), 'P1812R': array(['BRCA1'], dtype=object), 'P1812S': array(['BRCA1'], dtype=object), 'BCR-ABL1 Fusion': array(['ABL1'], dtype=object), 'T37A': array(['RHOA'], dtype=object), 'N239S': array(['TP53'], dtype=object), 'E76K': array(['PTPN11'], dtype=object), 'T37R': array(['BRCA1'], dtype=object), 'E76A': array(['PTPN11'], dtype=object), 'Exon 11 mutations': array(['KIT'], dtype=object), 'R905G': array(['TSC2'], dtype=object), 'G42R': array(['RUNX1'], dtype=object), 'V1378I': array(['BRCA1'], dtype=object), 'I448V': array(['RAF1'], dtype=object), 'R905Q': array(['TSC2'], dtype=object), 'R905W': array(['TSC2'], dtype=object), 'Y253H': array(['ABL1'], dtype=object), 'Y253F': array(['ABL1'], dtype=object), 'R1076C': array(['MSH6'], dtype=object), 'V369G': array(['PTEN'], dtype=object), 'M1255I': array(['IGF1R'], dtype=object), 'Exon 19 deletion': array(['EGFR'], dtype=object), 'R166Q': array(['RUNX1'], dtype=object), 'K28M': array(['H3F3A'], dtype=object), 'G60D': array(['KRAS'], dtype=object), 'I21V': array(['BRCA1'], dtype=object), 'R552G': array(['SOS1'], dtype=object), 'S217A': array(['CARM1'], dtype=object), 'S217C': array(['CARM1'], dtype=object), 'N217I': array(['ARAF'], dtype=object), 'G60R': array(['KRAS'], dtype=object), 'I2675V': array(['BRCA2'], dtype=object), 'S170R': array(['PTEN'], dtype=object), 'ERLIN2?FGFR1 Fusion': array(['FGFR1'], dtype=object), 'E734Q': array(['EGFR'], dtype=object), 'S760A': array(['ERBB2'], dtype=object), 'G129E': array(['PTEN'], dtype=object), 'G129A': array(['PTEN'], dtype=object), 'CPEB1-NTRK3 Fusion': array(['NTRK3'], dtype=object), 'E685V': array(['BAP1'], dtype=object), 'R23A': array(['AKT1'], dtype=object), 'S170N': array(['PTEN'], dtype=object), 'Q367P': array(['CBL'], dtype=object), 'T785A': array(['EGFR'], dtype=object), 'T286A': array(['CCND3', 'CCND1'], dtype=object), 'E41A': array(['PMS2'], dtype=object), 'S1424C': array(['BRCA2'], dtype=object), 'S226D': array(['MAP2K2'], dtype=object), 'T286I': array(['CCND1'], dtype=object), 'Y598C': array(['TSC2'], dtype=object), 'SND1-BRAF Fusion': array(['BRAF'], dtype=object), 'T401I': array(['PTEN'], dtype=object), 'PAX5-JAK2 Fusion': array(['JAK2'], dtype=object), 'D32H': array(['AKT2'], dtype=object), 'R465H': array(['FBXW7'], dtype=object), 'T1025S': array(['PIK3CA'], dtype=object), 'D32N': array(['CTNNB1'], dtype=object), 'D32A': array(['CTNNB1'], dtype=object), 'R133*': array(['SDHC'], dtype=object), 'R812A': array(['AGO2'], dtype=object), 'D32Y': array(['AKT1'], dtype=object), 'H650Q': array(['PDGFRA'], dtype=object), 'YAP1-FAM118B Fusion': array(['YAP1'], dtype=object), 'C157Y': array(['RAC1'], dtype=object), 'Y553_Q556del': array(['KIT'], dtype=object), 'V289F': array(['ABL1'], dtype=object), 'R453C': array(['PTPRT'], dtype=object), 'S3660L': array(['KMT2C'], dtype=object), 'H773_V774insH': array(['EGFR'], dtype=object), 'G264S': array(['RAD51C'], dtype=object), 'D1270G': array(['ALK'], dtype=object), 'W345*': array(['MSH2'], dtype=object), 'R133H': array(['BRCA1'], dtype=object), 'D661V': array(['STAT3'], dtype=object), 'S45del': array(['CTNNB1'], dtype=object), 'G796S': array(['EGFR'], dtype=object), 'W183A': array(['FANCA'], dtype=object), 'T19N': array(['RHOA'], dtype=object), 'R583A': array(['AGO2'], dtype=object), 'I836del': array(['FLT3'], dtype=object), 'M351T': array(['ABL1'], dtype=object), 'C443Y': array(['KIT'], dtype=object), 'Y591D': array(['FLT3'], dtype=object), 'Y375C': array(['FGFR2'], dtype=object), 'E78K': array(['PIK3CA'], dtype=object), 'D92A': array(['PTEN'], dtype=object), 'D92G': array(['PTEN'], dtype=object), 'D92E': array(['PTEN'], dtype=object), 'T1720A': array(['BRCA1'], dtype=object), 'A272V': array(['MSH2'], dtype=object), 'D92H': array(['PTEN'], dtype=object), 'D92N': array(['PTEN'], dtype=object), 'D92V': array(['PTEN'], dtype=object), 'S31R': array(['CDKN1A'], dtype=object), 'R625L': array(['SF3B1'], dtype=object), 'M535I': array(['FGFR2'], dtype=object), 'R625H': array(['SF3B1'], dtype=object), 'R2520Q': array(['BRCA2'], dtype=object), 'R625D': array(['SF3B1'], dtype=object), 'R1625C': array(['SETD2'], dtype=object), 'R625G': array(['SF3B1'], dtype=object), 'E525K': array(['PIK3CD'], dtype=object), 'R625C': array(['SF3B1'], dtype=object), 'Q144R': array(['TP53'], dtype=object), 'Q58L': array(['AR'], dtype=object), 'F594L': array(['FLT3'], dtype=object), 'H115N': array(['TP53'], dtype=object), 'I653T': array(['KIT'], dtype=object), 'T123A': array(['TP53'], dtype=object), 'V1833M': array(['BRCA1'], dtype=object), 'G199R': array(['TP53'], dtype=object), 'R420H': array(['SMAD4'], dtype=object), 'V1833E': array(['BRCA1'], dtype=object), 'E1644G': array(['BRCA1'], dtype=object), 'EZR-ERBB4 Fusion': array(['ERBB4'], dtype=object), 'N1647K': array(['BRCA1'], dtype=object), 'FGFR2-TACC3 Fusion': array(['FGFR2'], dtype=object), 'D323H': array(['AKT1'], dtype=object), 'S425C': array(['SMAD3'], dtype=object), 'H643D': array(['FGFR3'], dtype=object), 'R1204W': array(['NF1'], dtype=object), 'R689S': array(['ERCC4'], dtype=object), 'R689Q': array(['FBXW7'], dtype=object), 'G311D': array(['AKT1'], dtype=object), 'R1204G': array(['NF1'], dtype=object), 'Q72L': array(['RRAS2'], dtype=object), 'M90I': array(['RIT1'], dtype=object), 'E563K': array(['ERBB4'], dtype=object), 'R337L': array(['TP53'], dtype=object), 'L384M': array(['ABL1'], dtype=object), 'A634V': array(['CDH1'], dtype=object), 'P1771R': array(['BRCA1'], dtype=object), 'D194Y': array(['STK11'], dtype=object), 'K45Q': array(['BRCA1'], dtype=object), 'S256W': array(['PPP2R1A'], dtype=object), 'K45T': array(['BRCA1'], dtype=object), 'R337C': array(['TP53'], dtype=object), 'A634D': array(['JAK1'], dtype=object), 'T654I': array(['DDR2'], dtype=object), 'K45N': array(['BRCA1'], dtype=object), 'T654M': array(['DDR2'], dtype=object), 'P1771L': array(['BRCA1'], dtype=object), 'R337P': array(['TP53'], dtype=object), 'S256F': array(['PPP2R1A'], dtype=object), 'K830R': array(['PDGFRA'], dtype=object), 'Y806C': array(['RET'], dtype=object), 'S1733F': array(['BRCA2'], dtype=object), 'E317K': array(['ERBB4'], dtype=object), 'R552S': array(['SOS1'], dtype=object), 'V1605del': array(['NOTCH1'], dtype=object), 'E746_A750del': array(['EGFR'], dtype=object), 'E69G': array(['CDKN2A'], dtype=object), 'D254N': array(['CDH1'], dtype=object), 'R320Q': array(['KEAP1'], dtype=object), 'V299L': array(['ABL1'], dtype=object), 'Q96P': array(['VHL'], dtype=object), 'Q233*': array(['PMS1'], dtype=object), 'C229R': array(['TP53'], dtype=object), 'G735S': array(['EGFR'], dtype=object), 'D1071N': array(['PDGFRA'], dtype=object), 'Y16C': array(['PTEN'], dtype=object), 'K558delinsNP': array(['KIT'], dtype=object), 'L692F': array(['EGFR'], dtype=object), 'S241L': array(['TGFBR1'], dtype=object), 'L1196M': array(['ALK'], dtype=object), 'S241F': array(['TP53'], dtype=object), 'L703P': array(['EGFR'], dtype=object), 'Y298A': array(['SMAD3'], dtype=object), 'P250L': array(['TP53'], dtype=object), 'L399V': array(['CBL'], dtype=object), 'S241Y': array(['TP53'], dtype=object), 'S46N': array(['PMS2'], dtype=object), 'L1196Q': array(['ALK'], dtype=object), 'S241T': array(['TP53'], dtype=object), 'S46I': array(['PMS2'], dtype=object), 'G251C': array(['PTEN'], dtype=object), 'Q689R': array(['MLH1'], dtype=object), 'L792R': array(['TSC2'], dtype=object), 'Y105C': array(['BRCA1'], dtype=object), 'M35R': array(['PTEN'], dtype=object), 'C41Y': array(['TGFBR1'], dtype=object), 'K603Q': array(['RET'], dtype=object), 'G35R': array(['H3F3A', 'CDKN2A'], dtype=object), 'RUNX1-RUNX1T1 Fusion': array(['RUNX1'], dtype=object), 'Y551F': array(['BTK'], dtype=object), 'V1306I': array(['BRCA2'], dtype=object), 'Q984K': array(['PTPRT'], dtype=object), 'L248V': array(['ABL1'], dtype=object), 'R1726G': array(['BRCA1'], dtype=object), 'L607I': array(['RB1'], dtype=object), 'E355A': array(['ABL1'], dtype=object), 'E17K': array(['AKT1', 'AKT2', 'AKT3'], dtype=object), 'KIAA1549-BRAF Fusion': array(['BRAF'], dtype=object), 'E322K': array(['MAPK1'], dtype=object), 'S492R': array(['EGFR'], dtype=object), 'H570R': array(['PDGFRA'], dtype=object), 'E709V': array(['EGFR'], dtype=object), 'L1198F': array(['ALK'], dtype=object), 'E709Q': array(['EGFR'], dtype=object), 'G12V': array(['HRAS', 'RAC1', 'KRAS', 'NRAS'], dtype=object), 'F346V': array(['SMAD2'], dtype=object), 'P1859R': array(['BRCA1'], dtype=object), 'G12R': array(['KRAS'], dtype=object), 'G12S': array(['HRAS', 'KRAS'], dtype=object), 'E709G': array(['EGFR'], dtype=object), 'L1198P': array(['ALK'], dtype=object), 'E709A': array(['EGFR'], dtype=object), 'G12F': array(['KRAS'], dtype=object), 'G12D': array(['KRAS', 'NRAS'], dtype=object), 'G12C': array(['HRAS', 'KRAS'], dtype=object), 'S186Y': array(['BRCA1'], dtype=object), 'G12A': array(['HRAS', 'KRAS'], dtype=object), 'L344P': array(['TP53'], dtype=object), 'L344R': array(['TP53'], dtype=object), 'C611Y': array(['RET'], dtype=object), 'V1565I': array(['PTPRD'], dtype=object), 'D130A': array(['SPOP'], dtype=object), 'E746_T751delinsVA': array(['EGFR'], dtype=object), 'N276S': array(['PTEN'], dtype=object), 'L983F': array(['JAK2'], dtype=object), 'A40E': array(['KNSTRN'], dtype=object), 'D422N': array(['KEAP1'], dtype=object), 'E31K': array(['KRAS'], dtype=object), 'R130A': array(['PTEN'], dtype=object), 'Amplification': array(['TERT', 'CCND2', 'CCND3', 'CCNE1', 'EGFR', 'NKX2-1', 'BRD4', 'PAK1',
       'ROS1', 'CDK4', 'CDK6', 'CDK8', 'ERBB2', 'ERBB3', 'ERCC2', 'PDGFRA',
       'PDGFRB', 'FOXA1', 'MDM2', 'EZH2', 'MDM4', 'MET', 'PIK3CA', 'YAP1',
       'PIK3R2', 'HRAS', 'FGF3', 'FGF4', 'KMT2A', 'AKT2', 'FGFR1', 'FGFR3',
       'FGFR4', 'ALK', 'VEGFA', 'FLT1', 'FLT3', 'RICTOR', 'AR', 'RHOA',
       'IKBKE', 'SMO', 'MEF2B', 'MAPK1', 'MAP2K1', 'MYC', 'MYCN', 'AGO2',
       'AXL', 'SRC', 'GATA3', 'CCND1', 'WHSC1L1', 'JAK2', 'AURKA', 'JUN',
       'FOXL2', 'BRAF', 'KDR', 'KIT', 'FGF19', 'RAC1', 'DNMT3B', 'KRAS',
       'RAF1', 'AKT3', 'NRAS', 'KDM5A', 'NTRK1', 'DDR2', 'RUNX1'], dtype=object), 'R130G': array(['PTEN'], dtype=object), 'T910M': array(['SMARCA4'], dtype=object), 'R130K': array(['PTEN'], dtype=object), 'G508S': array(['SMAD4'], dtype=object), 'R130L': array(['PTEN'], dtype=object), 'R802*': array(['PMS2'], dtype=object), 'X963_splice': array(['MET'], dtype=object), 'E804G': array(['EGFR'], dtype=object), 'S376F': array(['CBL'], dtype=object), 'V1688del': array(['BRCA1'], dtype=object), 'G207E': array(['PMS2'], dtype=object), 'EWSR1-ETV4 Fusion': array(['EWSR1'], dtype=object), 'R479Q': array(['FBXW7'], dtype=object), 'A727V': array(['PIK3R2'], dtype=object), 'R1835P': array(['BRCA1'], dtype=object), 'R154W': array(['MAP2K4'], dtype=object), 'D1280V': array(['BRCA2'], dtype=object), 'M269R': array(['SOS1'], dtype=object), 'Y234C': array(['TP53'], dtype=object), 'R479H': array(['FBXW7'], dtype=object), 'S453fs*': array(['FOXA1'], dtype=object), 'R479L': array(['FBXW7'], dtype=object), 'Y234H': array(['TP53'], dtype=object), 'E79K': array(['NFE2L2'], dtype=object), 'T41A': array(['CTNNB1'], dtype=object), 'V430M': array(['CBL'], dtype=object), 'G70D': array(['IDH1'], dtype=object), 'G93W': array(['CDKN2A'], dtype=object), 'Q2858R': array(['BRCA2'], dtype=object), 'N238S': array(['ERCC2'], dtype=object), 'F367S': array(['POLE'], dtype=object), 'MPRIP-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'R2842H': array(['BRCA2'], dtype=object), 'E77K': array(['MEF2B'], dtype=object), 'S729C': array(['JAK1'], dtype=object), 'P1749R': array(['BRCA1'], dtype=object), 'S858R': array(['FANCA'], dtype=object), 'I1307K': array(['APC'], dtype=object), 'C1156F': array(['ALK'], dtype=object), 'N1236K': array(['BRCA1'], dtype=object), 'R130*': array(['PTEN'], dtype=object), 'S1002R': array(['ERBB2'], dtype=object), 'C121S': array(['MAP2K1'], dtype=object), 'H1966Y': array(['BRCA2'], dtype=object), 'I834V': array(['PDGFRA'], dtype=object), 'L32P': array(['CDKN2A'], dtype=object), 'N535K': array(['FGFR4'], dtype=object), 'K320E': array(['ELF3'], dtype=object), 'G637W': array(['FGFR3'], dtype=object), 'C1156Y': array(['ALK'], dtype=object), 'RUNX1-EVI1 Fusion': array(['RUNX1'], dtype=object), 'A113_splice': array(['EIF1AX'], dtype=object), 'H214N': array(['TP53'], dtype=object), 'R640G': array(['APC'], dtype=object), 'V1576E': array(['NOTCH1'], dtype=object), 'K373E': array(['CHEK2'], dtype=object), 'P306H': array(['MAP2K1'], dtype=object), 'H214R': array(['TP53'], dtype=object), 'H214Q': array(['TP53'], dtype=object), 'Y69H': array(['MEF2B'], dtype=object), 'G245D': array(['TP53'], dtype=object), 'G245A': array(['TP53'], dtype=object), 'TFG-ROS1 Fusion': array(['ROS1'], dtype=object), 'I195T': array(['TP53'], dtype=object), 'Q477E': array(['SMO'], dtype=object), 'G245S': array(['TP53'], dtype=object), 'E1346K': array(['BRCA1'], dtype=object), 'R631C': array(['ERCC2'], dtype=object), 'L117P': array(['TSC1'], dtype=object), 'I251S': array(['TP53'], dtype=object), 'E40N': array(['RHOA'], dtype=object), 'E40L': array(['RHOA'], dtype=object), 'E40K': array(['AKT1'], dtype=object), 'T771R': array(['KDR'], dtype=object), 'E40W': array(['RHOA'], dtype=object), '963_D1010splice': array(['MET'], dtype=object), 'K291E': array(['TP53'], dtype=object), 'E40Q': array(['RHOA'], dtype=object), 'K342N': array(['PTEN'], dtype=object), 'P491S': array(['PTPN11'], dtype=object), 'V294M': array(['IDH1'], dtype=object), '1_2009trunc': array(['NOTCH2'], dtype=object), 'C136R': array(['PTEN'], dtype=object), 'P286H': array(['POLE'], dtype=object), 'I219V': array(['MLH1'], dtype=object), 'R1040L': array(['PTPRT'], dtype=object), 'R1095H': array(['MSH6'], dtype=object), 'N857S': array(['ERBB2'], dtype=object), 'I1616N': array(['NOTCH1'], dtype=object), 'I1616T': array(['NOTCH1'], dtype=object), 'T992I': array(['MET'], dtype=object), 'N653H': array(['FGFR3'], dtype=object), 'Q59E': array(['AKT1'], dtype=object), 'C228T': array(['TERT'], dtype=object), 'G1809R': array(['DICER1'], dtype=object), 'L46R': array(['NF2'], dtype=object), 'D257N': array(['CDH1'], dtype=object), 'D300H': array(['SMAD2'], dtype=object), 'R183P': array(['PPP2R1A'], dtype=object), 'K117R': array(['HRAS'], dtype=object), 'D300N': array(['SMAD2'], dtype=object), 'I26N': array(['BRCA1'], dtype=object), 'L46F': array(['MAP2K2'], dtype=object), 'E586K': array(['BRAF'], dtype=object), 'T241P': array(['BRAF'], dtype=object), 'M160V': array(['TMPRSS2'], dtype=object), 'G1809K': array(['DICER1'], dtype=object), 'E1051K': array(['PIK3CB'], dtype=object), 'R183G': array(['PPP2R1A'], dtype=object), 'S45P': array(['CTNNB1'], dtype=object), 'P253R': array(['FGFR2'], dtype=object), 'S1653F': array(['TSC2'], dtype=object), 'S45Y': array(['CTNNB1'], dtype=object), 'H662R': array(['SF3B1'], dtype=object), 'S10N': array(['PTEN'], dtype=object), 'S45A': array(['CTNNB1'], dtype=object), 'S45F': array(['CTNNB1'], dtype=object), 'L702H': array(['AR'], dtype=object), 'E606G': array(['ERCC2'], dtype=object), 'S240R': array(['TP53'], dtype=object), 'S1653P': array(['TSC2'], dtype=object), 'G1123S': array(['ALK'], dtype=object), 'E1794D': array(['BRCA1'], dtype=object), 'M774_A775insAYVM': array(['ERBB2'], dtype=object), 'L601_K602insREYEYDL': array(['FLT3'], dtype=object), 'D331G': array(['PTEN'], dtype=object), 'G1123D': array(['ALK'], dtype=object), 'E579K': array(['PIK3CA'], dtype=object), 'V794M': array(['ERBB2'], dtype=object), 'F212Y': array(['TP53'], dtype=object), 'H114Y': array(['PPP6C'], dtype=object), 'Q145H': array(['VHL'], dtype=object), 'T338I': array(['RET'], dtype=object), 'E221Q': array(['TP53'], dtype=object), 'I68K': array(['BRCA1'], dtype=object), 'T1685I': array(['BRCA1'], dtype=object), 'D86N': array(['CDKN2B'], dtype=object), 'R324L': array(['EGFR'], dtype=object), 'K1062M': array(['ALK'], dtype=object), 'V726M': array(['EGFR'], dtype=object), 'G469V': array(['BRAF'], dtype=object), 'I326V': array(['BRAF'], dtype=object), 'Y297A': array(['SMAD3'], dtype=object), '256_286trunc': array(['CCND1'], dtype=object), 'G13R': array(['HRAS'], dtype=object), 'Q1503P': array(['TSC2'], dtype=object), 'G13V': array(['HRAS', 'NRAS'], dtype=object), 'A717G': array(['ERCC2'], dtype=object), 'R265C': array(['MLH1'], dtype=object), 'S2215T': array(['MTOR'], dtype=object), 'G469A': array(['BRAF'], dtype=object), 'S2215Y': array(['MTOR'], dtype=object), 'T710A': array(['EGFR'], dtype=object), 'F341C': array(['TP53'], dtype=object), 'G13E': array(['KRAS'], dtype=object), 'G13D': array(['HRAS', 'NRAS'], dtype=object), 'E330K': array(['EGFR', 'SMAD4'], dtype=object), 'K558N': array(['KIT'], dtype=object), 'I1807S': array(['BRCA1'], dtype=object), 'E172K': array(['MLH1'], dtype=object), 'LIMA1-ROS1 Fusion': array(['ROS1'], dtype=object), 'P42T': array(['AKT1'], dtype=object), 'D820G': array(['KIT'], dtype=object), 'K507Q': array(['SMAD4'], dtype=object), 'G161V': array(['AKT2'], dtype=object), 'M133T': array(['TP53'], dtype=object), 'Y375_K455del': array(['PDGFRA'], dtype=object), 'E452K': array(['ERBB4'], dtype=object), 'H1047L': array(['PIK3CA'], dtype=object), 'D423N': array(['ERCC2'], dtype=object), 'K507A': array(['SMAD4'], dtype=object), 'F1061W': array(['JAK2'], dtype=object), 'A211D': array(['NF2'], dtype=object), 'R1276P': array(['NF1'], dtype=object), 'R1276Q': array(['NF1'], dtype=object), 'C634W': array(['RET'], dtype=object), 'C634S': array(['RET'], dtype=object), 'C634R': array(['RET'], dtype=object), 'A532H': array(['SMAD4'], dtype=object), 'F1174L': array(['ALK'], dtype=object), 'F1174I': array(['ALK'], dtype=object), 'E483*': array(['MSH2'], dtype=object), 'W802*': array(['CBL'], dtype=object), 'H1421Y': array(['BRCA1'], dtype=object), 'V773A': array(['ERBB2'], dtype=object), 'G881D': array(['ALK'], dtype=object), 'I767M': array(['ERBB2'], dtype=object), 'A389T': array(['FGFR2'], dtype=object), 'S1088F': array(['FANCA'], dtype=object), 'S257W': array(['RAF1'], dtype=object), 'R378A': array(['SMAD4'], dtype=object), 'Y412F': array(['PTPRT'], dtype=object), 'F1524V': array(['BRCA2'], dtype=object), 'E14*': array(['NFKBIA'], dtype=object), 'M1628V': array(['BRCA1'], dtype=object), 'S1172L': array(['BRCA2'], dtype=object), 'E632_L633del': array(['RET'], dtype=object), 'V1673F': array(['TSC2'], dtype=object), 'D153V': array(['KRAS'], dtype=object), 'A72V': array(['PTPN11'], dtype=object), 'R337H': array(['TP53'], dtype=object), 'D806H': array(['BRCA2'], dtype=object), 'G810S': array(['EGFR'], dtype=object), 'X434_splice': array(['PIK3R1'], dtype=object), 'R38H': array(['PIK3CA'], dtype=object), 'A1789T': array(['BRCA1'], dtype=object), 'L493V': array(['TSC2'], dtype=object), 'A34D': array(['PTEN'], dtype=object), 'L493P': array(['TSC2'], dtype=object), 'R1896M': array(['TET2'], dtype=object), 'A750P': array(['EGFR'], dtype=object), 'G116S': array(['CARD11'], dtype=object), 'C242F': array(['TP53'], dtype=object), 'L230P': array(['ERCC4'], dtype=object), 'W1502A': array(['CREBBP'], dtype=object), 'K935I': array(['ERBB4'], dtype=object), 'V555M': array(['FGFR3'], dtype=object), 'D108H': array(['CDKN2A'], dtype=object), 'D108N': array(['CDKN2A'], dtype=object), 'ACPP-PIK3CB Fusion': array(['PIK3CB'], dtype=object), 'C39S': array(['BRCA1'], dtype=object), 'D842_H845del': array(['PDGFRA'], dtype=object), 'W117C': array(['VHL'], dtype=object), 'C39Y': array(['BRCA1'], dtype=object), 'V1653M': array(['BRCA1'], dtype=object), 'D835H': array(['FLT3'], dtype=object), 'G39E': array(['MSH6'], dtype=object), 'R481G': array(['PDGFRA'], dtype=object), 'G2430A': array(['NOTCH1'], dtype=object), 'Fusions': array(['NKX2-1', 'TMPRSS2', 'ROS1', 'EP300', 'ERBB4', 'ABL1', 'ERG',
       'PDGFRB', 'ESR1', 'ETV1', 'ETV6', 'EWSR1', 'PIK3CB', 'YAP1',
       'KMT2A', 'AKT2', 'FGFR1', 'FGFR3', 'FGFR2', 'ALK', 'MYC', 'BCOR',
       'CIC', 'JAK2', 'BRAF', 'NOTCH1', 'RAF1', 'AKT3', 'NTRK1', 'NTRK2',
       'NTRK3', 'NSD1', 'RET', 'RUNX1'], dtype=object), 'D140G': array(['SPOP'], dtype=object), 'ROS1-CD74 Fusion': array(['ROS1'], dtype=object), 'T670I': array(['KIT'], dtype=object), 'S2670L': array(['BRCA2'], dtype=object), 'P33S': array(['IDH1'], dtype=object), 'A1131T': array(['ERBB3'], dtype=object), 'A1020V': array(['PIK3CA'], dtype=object), 'G382D': array(['FGFR3'], dtype=object), 'A1752P': array(['BRCA1'], dtype=object), 'EGFRvII': array(['EGFR'], dtype=object), 'E265K': array(['ERBB2'], dtype=object), 'ETV6-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'T1087I': array(['ALK'], dtype=object), 'D1352Y': array(['BRCA2'], dtype=object), 'P1139S': array(['ALK'], dtype=object), 'EGFRvIV': array(['EGFR'], dtype=object), 'K292I': array(['TP53'], dtype=object), 'ZNF198-FGFR1 Fusion': array(['FGFR1'], dtype=object), 'H410R': array(['STAT3'], dtype=object), 'R158H': array(['TP53'], dtype=object), 'D845A': array(['ERBB2'], dtype=object), 'V777L': array(['ERBB2'], dtype=object), 'V777M': array(['ERBB2'], dtype=object), 'V777A': array(['ERBB2'], dtype=object), 'M1663K': array(['BRCA1'], dtype=object), 'G1202R': array(['ALK'], dtype=object), 'G623R': array(['NTRK3'], dtype=object), 'R267P': array(['TP53'], dtype=object), 'W557_V560del': array(['KIT'], dtype=object), 'F2108L': array(['MTOR'], dtype=object), 'W557_K558del': array(['KIT'], dtype=object), 'L57del': array(['AR'], dtype=object), 'S297F': array(['POLE'], dtype=object), 'G430C': array(['KEAP1'], dtype=object), 'V391I': array(['CBL'], dtype=object), 'K648N': array(['JAK1'], dtype=object), 'F1088Lfs*5': array(['MSH6'], dtype=object), 'D600_L601insFREYEYD': array(['FLT3'], dtype=object), 'RANBP2-ALK Fusion': array(['ALK'], dtype=object), 'W1038C': array(['JAK2'], dtype=object), 'D162H': array(['PTEN'], dtype=object), 'I89T': array(['BRCA1'], dtype=object), 'G266R': array(['TP53'], dtype=object), 'G284R': array(['ERBB3'], dtype=object), 'A2770T': array(['BRCA2'], dtype=object), 'A1099T': array(['ALK'], dtype=object), 'Y599_D600insPAPQIMSTSTLISENMNIA': array(['FLT3'], dtype=object), 'M237K': array(['TP53'], dtype=object), 'I89N': array(['BRCA1'], dtype=object), 'G266E': array(['TP53'], dtype=object), 'K117N': array(['HRAS', 'KRAS'], dtype=object), 'D67N': array(['MAP2K1'], dtype=object), 'M1K': array(['B2M'], dtype=object), 'L37P': array(['MAP2K1'], dtype=object), 'G701S': array(['FGFR2'], dtype=object), 'W80A': array(['AKT1'], dtype=object), 'CAD-ALK Fusion': array(['ALK'], dtype=object), 'I642V': array(['FGFR2'], dtype=object), 'M1R': array(['B2M', 'BRCA1'], dtype=object), 'M1T': array(['B2M'], dtype=object), 'D67Y': array(['BRCA1'], dtype=object), 'M1V': array(['B2M'], dtype=object), 'S783P': array(['ERBB2'], dtype=object), 'G253C': array(['DDR2'], dtype=object), 'GOLGA4-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'I843del': array(['PDGFRA'], dtype=object), 'I47F': array(['BAP1'], dtype=object), 'R249M': array(['TP53'], dtype=object), 'T485K': array(['PMS2'], dtype=object), 'R732Q': array(['CDH1'], dtype=object), 'K62R': array(['PTEN'], dtype=object), 'E627D': array(['FGFR3'], dtype=object), 'F1734S': array(['BRCA1'], dtype=object), 'Truncating Mutations in the PEST Domain': array(['NOTCH1'], dtype=object), 'F31I': array(['AURKA'], dtype=object), 'M1783T': array(['BRCA1'], dtype=object), 'E501G': array(['BRAF'], dtype=object), 'Q120E': array(['AR'], dtype=object), 'YAP1-MAMLD1 Fusion': array(['YAP1'], dtype=object), 'T875N': array(['JAK2'], dtype=object), 'M1652T': array(['BRCA1'], dtype=object), 'E501K': array(['BRAF'], dtype=object), 'A134D': array(['IDH1'], dtype=object), 'P305L': array(['SMAD2'], dtype=object), 'M1652K': array(['BRCA1'], dtype=object), 'M1783L': array(['BRCA1'], dtype=object), 'T58I': array(['HRAS', 'KRAS', 'NRAS'], dtype=object), 'M1783I': array(['BRCA1'], dtype=object), 'A1843T': array(['BRCA1'], dtype=object), 'P449T': array(['PIK3CA'], dtype=object), 'G244D': array(['TP53'], dtype=object), 'N542_E543del': array(['JAK2'], dtype=object), 'DNA binding domain deletions': array(['TP53'], dtype=object), 'F102C': array(['SPOP'], dtype=object), 'G244R': array(['TP53'], dtype=object), 'N810Y': array(['BRCA1'], dtype=object), 'R725C': array(['MLH1'], dtype=object), 'E565G': array(['FGFR2'], dtype=object), 'D84V': array(['CDKN2A'], dtype=object), 'H1620R': array(['TSC2'], dtype=object), 'A290T': array(['APC'], dtype=object), 'I538V': array(['FGFR3'], dtype=object), 'A347T': array(['TP53'], dtype=object), 'A636P': array(['MSH2'], dtype=object), 'H845Y': array(['PDGFRA'], dtype=object), 'E921K': array(['RET'], dtype=object), 'G724S': array(['EGFR'], dtype=object), 'S70fsX93': array(['RUNX1'], dtype=object), 'L112R': array(['PTEN'], dtype=object), 'K526E': array(['FGFR2'], dtype=object), 'G1201E': array(['ALK'], dtype=object), 'T1203K': array(['TSC2'], dtype=object), 'L830R': array(['TSC2'], dtype=object), 'R331P': array(['ELF3'], dtype=object), 'K101M': array(['MAP2K2'], dtype=object), 'S476I': array(['KIT'], dtype=object), 'A8S': array(['MTOR'], dtype=object), 'L747V': array(['EGFR'], dtype=object), 'L747P': array(['EGFR'], dtype=object), 'L747S': array(['EGFR'], dtype=object), 'S250P': array(['CARD11'], dtype=object), 'L747F': array(['EGFR'], dtype=object), 'R506_K507insVLR': array(['BRAF'], dtype=object), 'H538Q': array(['JAK2'], dtype=object), 'P214L': array(['ETV6'], dtype=object), 'A1708E': array(['BRCA1'], dtype=object), 'S59R': array(['BRCA1'], dtype=object), 'G1567D': array(['TSC2'], dtype=object), 'A500T': array(['FGFR3'], dtype=object), 'A205T': array(['STK11'], dtype=object), 'G2748D': array(['BRCA2'], dtype=object), 'Q1756fs': array(['BRCA1'], dtype=object), 'L536P': array(['ESR1'], dtype=object), 'R182W': array(['PPP2R1A'], dtype=object), 'L749P': array(['MLH1'], dtype=object), 'L1204F': array(['ALK'], dtype=object), 'H115R': array(['VHL'], dtype=object), 'L67P': array(['STK11'], dtype=object), 'D1692N': array(['BRCA1'], dtype=object), 'N1044K': array(['PIK3CA'], dtype=object), 'EWSR1-FLI1 Fusion': array(['EWSR1'], dtype=object), 'T24A': array(['FOXO1'], dtype=object), 'SSBP2-JAK2 Fusion': array(['JAK2'], dtype=object), 'E554_V559del': array(['KIT'], dtype=object), 'S2483N': array(['BRCA2'], dtype=object), 'K601N': array(['BRAF'], dtype=object), 'M887I': array(['ERBB4'], dtype=object), 'SPAG9-JAK2 Fusion': array(['JAK2'], dtype=object), 'K601E': array(['BRAF'], dtype=object), 'M2327I': array(['MTOR'], dtype=object), 'Y553N': array(['KIT'], dtype=object), 'EGFRvV': array(['EGFR'], dtype=object), 'T389K': array(['ERBB3'], dtype=object), 'K601Q': array(['BRAF'], dtype=object), 'N372H': array(['BRCA2'], dtype=object), 'Y3092C': array(['BRCA2'], dtype=object), 'A767_V769del': array(['EGFR'], dtype=object), 'M37K': array(['NFKBIA'], dtype=object), 'P114L': array(['CDKN2A'], dtype=object), 'P114S': array(['CDKN2A'], dtype=object), 'V379I': array(['ABL1'], dtype=object), 'L996F': array(['CDK12'], dtype=object), 'BCR-JAK2 Fusion': array(['JAK2'], dtype=object), 'S869L': array(['BRCA2'], dtype=object), 'P848L': array(['EGFR'], dtype=object), 'K499E': array(['BRAF'], dtype=object), 'L1460P': array(['MTOR'], dtype=object), 'H193P': array(['TP53'], dtype=object), 'Y220C': array(['TP53'], dtype=object), 'S1670A': array(['BRCA2'], dtype=object), 'H193N': array(['TP53'], dtype=object), 'E255K': array(['ABL1'], dtype=object), 'R213Q': array(['TP53'], dtype=object), 'K38N': array(['BRCA1'], dtype=object), 'E2663V': array(['BRCA2'], dtype=object), 'V414A': array(['SMO'], dtype=object), 'E279K': array(['ABL1'], dtype=object), 'STRN-ALK Fusion': array(['ALK'], dtype=object), 'E1099K': array(['WHSC1'], dtype=object), 'H1402Y': array(['BRCA1'], dtype=object), 'G1079D': array(['JAK1'], dtype=object), 'E453A': array(['PIK3CA'], dtype=object), 'D821N': array(['ERBB2'], dtype=object), 'C176F': array(['TP53'], dtype=object), 'N1125I': array(['PTPRT'], dtype=object), 'R879C': array(['JAK1'], dtype=object), 'R1464*': array(['ALK'], dtype=object), 'H297N': array(['TP53'], dtype=object), 'T488_P492del': array(['BRAF'], dtype=object), 'P577_D579del': array(['KIT'], dtype=object), 'E88K': array(['CDKN2A'], dtype=object), 'M713I': array(['NTRK2'], dtype=object), 'CD74-ROS1 Fusion': array(['ROS1'], dtype=object), 'N2113S': array(['BRCA2'], dtype=object), 'R11K': array(['KNSTRN'], dtype=object), 'EWSR1-DDIT3 Fusion': array(['EWSR1'], dtype=object), 'SEC16A1-NOTCH1 Fusion': array(['NOTCH1'], dtype=object), 'Y236S': array(['TP53'], dtype=object), 'H36P': array(['CTNNB1'], dtype=object), 'EWSR1-ETV1 Fusion': array(['ETV1', 'EWSR1'], dtype=object), 'L550P': array(['MLH1'], dtype=object), 'D646Y': array(['FGFR3'], dtype=object), 'Y236D': array(['TP53'], dtype=object), 'Y236C': array(['TP53'], dtype=object), 'S1512I': array(['BRCA1'], dtype=object), 'P94A': array(['SPOP'], dtype=object), 'FAM131B-BRAF Fusion': array(['BRAF'], dtype=object), 'F1245C': array(['ALK'], dtype=object), 'R1751P': array(['BRCA1'], dtype=object), 'R1751Q': array(['BRCA1'], dtype=object), 'R273G': array(['TP53'], dtype=object), 'I31M': array(['PIK3CA', 'BRCA1'], dtype=object), 'L790F': array(['RET'], dtype=object), 'H1904R': array(['TET2'], dtype=object), 'L1267S': array(['BRCA1'], dtype=object), 'W509R': array(['SMAD4'], dtype=object), 'E717K': array(['ERBB2'], dtype=object), 'F1245V': array(['ALK'], dtype=object), 'DNMT3B7': array(['DNMT3B'], dtype=object), 'C1483F': array(['MTOR'], dtype=object), 'F691L': array(['FLT3'], dtype=object), 'C18Y': array(['RAC1'], dtype=object), 'L861F': array(['EGFR'], dtype=object), 'C1483R': array(['MTOR'], dtype=object), 'C1483W': array(['MTOR'], dtype=object), 'R1231Q': array(['ALK'], dtype=object), 'C1483Y': array(['MTOR'], dtype=object), 'E996K': array(['PDGFRA'], dtype=object), 'C582F': array(['FGFR3'], dtype=object), 'D770_N771insVDSVDNP': array(['EGFR'], dtype=object), 'C1385': array(['EP300'], dtype=object), 'P480L': array(['KDM5C'], dtype=object), 'L410R': array(['TSC2'], dtype=object), 'N564K': array(['PIK3R1'], dtype=object), 'I157T': array(['CHEK2'], dtype=object), 'Y238A': array(['SMAD3'], dtype=object), 'V242F': array(['ERCC2'], dtype=object), 'G1232D': array(['SMARCA4'], dtype=object), 'N564D': array(['PIK3R1'], dtype=object), 'V654A': array(['KIT'], dtype=object), 'Y1853*': array(['BRCA1'], dtype=object), 'Exon 20 insertion': array(['EGFR'], dtype=object), 'A2425T': array(['NOTCH1'], dtype=object), 'D24Y': array(['PTEN'], dtype=object), 'R724H': array(['JAK1'], dtype=object), 'NSD1-NUP98 Fusion': array(['NSD1'], dtype=object), 'V220F': array(['TSC1'], dtype=object), 'R156C': array(['TP53'], dtype=object), 'Y1853C': array(['BRCA1'], dtype=object), 'T1365M': array(['PTPRT'], dtype=object), 'H773Y': array(['EGFR'], dtype=object), 'M552_W557del': array(['KIT'], dtype=object), 'I463S': array(['BRAF'], dtype=object), 'L469V': array(['ESR1'], dtype=object), 'H773L': array(['EGFR'], dtype=object), 'S335C': array(['ERBB2'], dtype=object), 'C105F': array(['PTEN'], dtype=object), 'V3079I': array(['BRCA2'], dtype=object), 'L108P': array(['PTEN'], dtype=object), 'FGFR2-KIAA1967 Fusion': array(['FGFR2'], dtype=object), 'D1399Y': array(['EP300'], dtype=object), 'S1463F': array(['NF1'], dtype=object), 'D761Y': array(['EGFR'], dtype=object), 'V1075F': array(['JAK2'], dtype=object), 'D846Y': array(['PDGFRA'], dtype=object), 'V211D': array(['MAP2K1'], dtype=object), 'E633K': array(['PIK3CB'], dtype=object), 'I744_K745delinsKIPVAI': array(['EGFR'], dtype=object), 'L747_T751delinsP': array(['EGFR'], dtype=object), 'G87R': array(['FOXA1'], dtype=object), 'E116K': array(['PIK3CA'], dtype=object), 'F158C': array(['TSC1'], dtype=object), 'D357Y': array(['SMAD4'], dtype=object), 'L64P': array(['NF2'], dtype=object), 'ATF7IP-JAK2 Fusion': array(['JAK2'], dtype=object), 'M587V': array(['MLH1'], dtype=object), 'D1420Y': array(['BRCA2'], dtype=object), 'L915M': array(['ERBB2'], dtype=object), 'S72R': array(['BRCA1'], dtype=object), '533_534del': array(['EPAS1'], dtype=object), 'A146T': array(['HRAS', 'KRAS'], dtype=object), 'A146V': array(['HRAS', 'KRAS'], dtype=object), 'R181C': array(['TP53'], dtype=object), 'P1502L': array(['EP300'], dtype=object), 'N2436I': array(['BRCA2'], dtype=object), 'E664K': array(['FGFR1'], dtype=object), 'R181L': array(['TP53'], dtype=object), 'H123Q': array(['PTEN'], dtype=object), 'R181H': array(['TP53'], dtype=object), 'R1192P': array(['ALK'], dtype=object), 'R611Q': array(['TSC2'], dtype=object), 'R611W': array(['TSC2'], dtype=object), 'H123D': array(['PTEN'], dtype=object), 'L1780P': array(['BRCA1'], dtype=object), 'D1067Y': array(['PIK3CB'], dtype=object), 'D1067V': array(['PIK3CB'], dtype=object), 'A148T': array(['CDKN2A'], dtype=object), 'IGH-BCL2 Fusion': array(['BCL2'], dtype=object), 'L188Q': array(['VHL'], dtype=object), 'V536M': array(['ERCC2'], dtype=object), 'Y842C': array(['FLT3'], dtype=object), 'A126D': array(['PTEN'], dtype=object), 'V536E': array(['PDGFRA'], dtype=object), 'D1067A': array(['PIK3CB'], dtype=object), 'L617F': array(['FGFR2'], dtype=object), 'H2428Q': array(['NOTCH1'], dtype=object), 'FGFR2-FAM76A Fusion': array(['FGFR2'], dtype=object), 'Y652H': array(['JAK1'], dtype=object), 'CEP85L-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'L142P': array(['SOX9'], dtype=object), 'S459F': array(['POLE'], dtype=object), 'Y1035C': array(['JAK1'], dtype=object), 'W731L': array(['EGFR'], dtype=object), 'E626K': array(['CARD11'], dtype=object), 'V550E': array(['FGFR4'], dtype=object), 'T341P': array(['FGFR2'], dtype=object), 'L726F': array(['ERBB2'], dtype=object), 'L726I': array(['ERBB2'], dtype=object), 'L2431P': array(['MTOR'], dtype=object), 'E255V': array(['ABL1'], dtype=object), 'R243A': array(['SMAD3'], dtype=object), 'E1705A': array(['DICER1'], dtype=object), 'S891A': array(['RET'], dtype=object), 'L507P': array(['PDGFRA'], dtype=object), 'E1705K': array(['DICER1'], dtype=object), 'C77F': array(['AKT1'], dtype=object), 'K39N': array(['AKT1'], dtype=object), 'A4419S': array(['FAT1'], dtype=object), 'R1343L': array(['PTPRT'], dtype=object), 'L844R': array(['TSC2'], dtype=object), 'TEL-RUNX1 Fusion': array(['RUNX1'], dtype=object), 'V714M': array(['ERBB3'], dtype=object), 'H1106D': array(['MET'], dtype=object), 'R389*': array(['MSH2'], dtype=object), 'L833V': array(['EGFR'], dtype=object), 'M2676T': array(['BRCA2'], dtype=object), 'A728V': array(['BRAF'], dtype=object), 'A750_E758del': array(['EGFR'], dtype=object), 'A530T': array(['EPAS1'], dtype=object), 'A530V': array(['EPAS1'], dtype=object), 'A246P': array(['BRAF'], dtype=object), 'G776_V777insYVMA': array(['ERBB2'], dtype=object), "3' Deletion": array(['EPCAM'], dtype=object), 'E768D': array(['RET'], dtype=object), 'V579A': array(['FLT3'], dtype=object), 'K590R': array(['PTPN11'], dtype=object), 'P395A': array(['CBL'], dtype=object), 'L2721H': array(['BRCA2'], dtype=object), 'S310F': array(['ERBB2'], dtype=object), 'I408V': array(['SMO'], dtype=object), 'Q56_V60del': array(['MAP2K1'], dtype=object), 'I1349T': array(['BRCA2'], dtype=object), 'BRD4-NUT Fusion': array(['BRD4'], dtype=object), 'S247Y': array(['RXRA'], dtype=object), 'R115L': array(['PIK3CA'], dtype=object), 'R487*': array(['MLH1'], dtype=object), 'D119N': array(['KRAS'], dtype=object), 'Y139D': array(['IDH1'], dtype=object), 'P95L': array(['PTEN'], dtype=object), 'V343E': array(['PTEN'], dtype=object), 'F12L': array(['SOX9'], dtype=object), 'N45S': array(['TGFBR1'], dtype=object), 'V343L': array(['PTEN'], dtype=object), 'W383R': array(['CTNNB1'], dtype=object), 'P838L': array(['KIT'], dtype=object), 'V851A': array(['EGFR'], dtype=object), 'A36P': array(['CDKN2A'], dtype=object), 'K162D': array(['AURKA'], dtype=object), 'E1210K': array(['ALK'], dtype=object), 'I2500M': array(['MTOR'], dtype=object), 'Y599_D600insGLYVDFREYEY': array(['FLT3'], dtype=object), 'S846I': array(['ERBB3'], dtype=object), 'N219D': array(['SMO'], dtype=object), 'V299G': array(['TSC2'], dtype=object), 'I2500F': array(['MTOR'], dtype=object), 'L165P': array(['BRCA1'], dtype=object), 'G114R': array(['VHL'], dtype=object), 'Y472C': array(['BRAF'], dtype=object), 'Exon 20 insertions/deletions': array(['ERBB2'], dtype=object), 'Y472H': array(['RASA1'], dtype=object), 'G478C': array(['BRAF'], dtype=object), 'R978*': array(['SMARCA4'], dtype=object), 'W531C': array(['BRAF'], dtype=object), 'T417_D419delinsRG': array(['KIT'], dtype=object), 'V557I': array(['FLT3'], dtype=object), 'D1203N': array(['ALK'], dtype=object), 'E321K': array(['CHEK2'], dtype=object), 'S1290A': array(['TET2'], dtype=object), 'E321G': array(['ERBB2'], dtype=object), 'R487Q': array(['TGFBR1'], dtype=object), 'R487W': array(['ERCC2'], dtype=object), 'L274P': array(['FANCA'], dtype=object), 'V274F': array(['TP53'], dtype=object), 'K753E': array(['ERBB2'], dtype=object), '534_536del': array(['EPAS1'], dtype=object), 'K753A': array(['ERBB2'], dtype=object), 'Y371H': array(['CBL'], dtype=object), 'K753M': array(['ERBB2'], dtype=object), 'F1662S': array(['BRCA1'], dtype=object), 'P525L': array(['TGFBR2'], dtype=object), 'Y371S': array(['CBL'], dtype=object), 'V804L': array(['RET'], dtype=object), 'A1022E': array(['PTPRT'], dtype=object), 'AR-V7': array(['AR'], dtype=object), 'T798M': array(['ERBB2'], dtype=object), 'R2336H': array(['BRCA2'], dtype=object), 'T798I': array(['ERBB2'], dtype=object), 'G380R': array(['FGFR3'], dtype=object), 'K341A': array(['SMAD3'], dtype=object), 'A1669S': array(['BRCA1'], dtype=object), 'Y98H': array(['VHL'], dtype=object), 'R505C': array(['FBXW7'], dtype=object), 'H1047Y': array(['PIK3CA'], dtype=object), 'V755I': array(['FGFR2'], dtype=object), 'W557_V559delinsC': array(['KIT'], dtype=object), 'P95S': array(['PTEN'], dtype=object), 'W349C': array(['PDGFRA'], dtype=object), 'KIF5B-RET Fusion': array(['RET'], dtype=object), 'Y647C': array(['FGFR3'], dtype=object), 'V1398D': array(['NF1'], dtype=object), 'Y1248F': array(['ERBB2'], dtype=object), 'Q429*': array(['MSH2'], dtype=object), 'P551_V555del': array(['KIT'], dtype=object), 'I491M': array(['EGFR'], dtype=object), 'V559del': array(['KIT'], dtype=object), 'R177Q': array(['RUNX1'], dtype=object), 'S1473P': array(['BRCA1'], dtype=object), 'D2512Y': array(['MTOR'], dtype=object), 'G936R': array(['ERBB4'], dtype=object), 'K659N': array(['FGFR2'], dtype=object), 'V705E': array(['TSC2'], dtype=object), 'D2512G': array(['MTOR'], dtype=object), 'L221R': array(['SMO'], dtype=object), 'Y237A': array(['SMAD3'], dtype=object), 'K659E': array(['FGFR2'], dtype=object), 'Y772_A775dup': array(['ERBB2'], dtype=object), 'F57C': array(['MAP2K2'], dtype=object), 'RANBP1-ALK Fusion': array(['ALK'], dtype=object), 'G464A': array(['PTPN11'], dtype=object), 'A149P': array(['VHL'], dtype=object), 'E137K': array(['PIK3R1'], dtype=object), 'K442Nfs*': array(['NUP93'], dtype=object), 'F57L': array(['MAP2K2'], dtype=object), 'R669G': array(['FGFR3'], dtype=object), 'F57V': array(['MAP2K2'], dtype=object), 'E168D': array(['MET'], dtype=object), 'E554_K558del': array(['KIT'], dtype=object), 'R669C': array(['ERBB3'], dtype=object), 'D65N': array(['MAP2K1'], dtype=object), 'L145R': array(['TP53'], dtype=object), 'C238S': array(['TP53'], dtype=object), 'I1170S': array(['ALK'], dtype=object), 'I1170N': array(['ALK'], dtype=object), 'V1578del': array(['NOTCH1'], dtype=object), 'T340A': array(['CDH1'], dtype=object), 'H233N': array(['TP53'], dtype=object), 'C238F': array(['TP53'], dtype=object), 'L115R': array(['MAP2K1'], dtype=object), 'F590G': array(['FLT3'], dtype=object), 'T1691K': array(['BRCA1'], dtype=object), 'K57E': array(['MAP2K1'], dtype=object), 'R315*': array(['PMS2'], dtype=object), 'Delta-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'E157G': array(['PTEN'], dtype=object), 'R838Q': array(['ERBB4'], dtype=object), 'R368C': array(['AKT2'], dtype=object), 'H694R': array(['ALK'], dtype=object), 'T1151M': array(['ALK'], dtype=object), 'F79S': array(['BRCA1'], dtype=object), 'D887N': array(['CDK12'], dtype=object), 'R156H': array(['TP53'], dtype=object), 'Exon 19 insertion': array(['EGFR'], dtype=object), 'L118R': array(['VHL'], dtype=object), 'L118P': array(['VHL'], dtype=object), 'Y353L': array(['SMAD4'], dtype=object), 'R1200W': array(['TSC2'], dtype=object), 'V597A': array(['ALK'], dtype=object), 'D2870A': array(['ATM'], dtype=object), 'Q472H': array(['KDR'], dtype=object), 'K181M': array(['AKT2'], dtype=object), 'A767_V769dup': array(['EGFR'], dtype=object), 'N588D': array(['BRCA2'], dtype=object), 'IGH-FGFR3 Fusion': array(['FGFR3'], dtype=object), 'S478P': array(['PDGFRA'], dtype=object), 'V544_L545insAVLVLLVIVIISLI': array(['PDGFRA'], dtype=object), 'R1275L': array(['ALK'], dtype=object), 'W1610G': array(['TSC2'], dtype=object), 'H94Y': array(['CBL'], dtype=object), 'I1250T': array(['ALK'], dtype=object), 'L1433S': array(['MTOR'], dtype=object), 'M504V': array(['PTPN11'], dtype=object), 'S4F': array(['BRCA1'], dtype=object), 'T574_R588delinsL': array(['KIT'], dtype=object), 'H83Y': array(['CDKN2A'], dtype=object), 'E70K': array(['VHL'], dtype=object), 'E746_A750delinsQ': array(['EGFR'], dtype=object), 'H83N': array(['CDKN2A'], dtype=object), 'S4P': array(['BRCA1'], dtype=object), 'K539L': array(['JAK2'], dtype=object), 'C1767S': array(['BRCA1'], dtype=object), 'S252W': array(['FGFR2'], dtype=object), 'S645C': array(['EGFR'], dtype=object), 'E2419K': array(['MTOR'], dtype=object), 'D1818G': array(['BRCA1'], dtype=object), 'V659E': array(['ERBB2'], dtype=object), 'S653C': array(['ERBB2'], dtype=object), 'N1730S': array(['BRCA1'], dtype=object), 'V1714G': array(['BRCA1'], dtype=object), 'L49H': array(['ERBB2'], dtype=object), 'K428A': array(['SMAD4'], dtype=object), 'S1715N': array(['BRCA1'], dtype=object), 'T244R': array(['ERBB4'], dtype=object), 'T3349A': array(['BRCA2'], dtype=object), 'E1799K': array(['MTOR'], dtype=object), 'Q337*': array(['MSH2'], dtype=object), 'D390Y': array(['CBL'], dtype=object), 'D1853N': array(['ATM'], dtype=object), 'K558_V559del': array(['KIT'], dtype=object), 'N540S': array(['FGFR3'], dtype=object), 'PRKG2-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'R897Q': array(['RET'], dtype=object), 'M1400V': array(['BRCA1'], dtype=object), 'H1746Q': array(['BRCA1'], dtype=object), 'G67W': array(['MLH1'], dtype=object), 'V1676D': array(['NOTCH1'], dtype=object), 'R206H': array(['ACVR1'], dtype=object), 'G67R': array(['CDKN2A', 'MLH1'], dtype=object), 'H396R': array(['ABL1'], dtype=object), 'M1250T': array(['MET'], dtype=object), 'F351L': array(['ARAF'], dtype=object), 'D927G': array(['PTPRT'], dtype=object), 'Y532C': array(['EPAS1'], dtype=object), 'P449_L455del': array(['PIK3CA'], dtype=object), 'L348S': array(['TP53'], dtype=object), 'H1746N': array(['BRCA1'], dtype=object), 'A19V': array(['SOX9'], dtype=object), 'P34R': array(['HRAS', 'KRAS'], dtype=object), 'R3052W': array(['BRCA2'], dtype=object), 'E326L': array(['TP53'], dtype=object), 'A883T': array(['RET'], dtype=object), 'W1837R': array(['BRCA1'], dtype=object), 'W1291R': array(['TET2'], dtype=object), 'A57V': array(['CDKN2A'], dtype=object), 'A677G': array(['EZH2'], dtype=object), 'EWSR1-FEV Fusion': array(['EWSR1'], dtype=object), 'V191D': array(['BRCA1'], dtype=object), 'L1301R': array(['NF1'], dtype=object), 'S614R': array(['STAT3'], dtype=object), 'D325A': array(['AKT1'], dtype=object), 'C706F': array(['RB1'], dtype=object), 'V191I': array(['BRCA1'], dtype=object), 'R441P': array(['SMAD4'], dtype=object), 'R268A': array(['SMAD3'], dtype=object), 'G466V': array(['BRAF'], dtype=object), 'G466R': array(['BRAF'], dtype=object), 'L485_Q494del': array(['BRAF'], dtype=object), 'G778_P780dup': array(['ERBB2'], dtype=object), 'R342W': array(['NTRK1'], dtype=object), 'R342P': array(['TP53'], dtype=object), 'R342Q': array(['NTRK1'], dtype=object), 'G466A': array(['BRAF'], dtype=object), 'R922*': array(['MSH6'], dtype=object), 'H191D': array(['VHL'], dtype=object), 'A232V': array(['ERBB3'], dtype=object), 'D560Y': array(['PIK3R1'], dtype=object), 'A75P': array(['BRCA2'], dtype=object), 'D770_N771insD': array(['EGFR'], dtype=object), 'L910P': array(['JAK1'], dtype=object), 'Q1500P': array(['CREBBP'], dtype=object), 'S703I': array(['JAK1'], dtype=object), 'Q2223K': array(['MTOR'], dtype=object), 'K467T': array(['EGFR'], dtype=object), 'M53I': array(['CDKN2A'], dtype=object), 'G2032R': array(['ROS1'], dtype=object), 'E836K': array(['ERBB4'], dtype=object), 'E102_I103del': array(['MAP2K1'], dtype=object), 'C396R': array(['CBL'], dtype=object), 'R420Q': array(['CBL'], dtype=object), 'R833C': array(['RET'], dtype=object), 'L1152R': array(['ALK'], dtype=object), 'FGFR1OP1-FGFR1 Fusion': array(['FGFR1'], dtype=object), 'L1152P': array(['ALK'], dtype=object), 'V742A': array(['EGFR'], dtype=object), 'G52R': array(['TGFBR1'], dtype=object), 'G857E': array(['EGFR'], dtype=object), 'G857A': array(['PMS2'], dtype=object), 'N48K': array(['PTEN'], dtype=object), 'K656E': array(['FGFR1'], dtype=object), 'P539R': array(['PIK3CA'], dtype=object), 'R776H': array(['EGFR'], dtype=object), 'G602R': array(['BRCA2'], dtype=object), 'F81V': array(['BAP1'], dtype=object), 'S326R': array(['BRCA2'], dtype=object), 'R1753T': array(['BRCA1'], dtype=object), 'S243C': array(['KEAP1'], dtype=object), 'V1736A': array(['BRCA1'], dtype=object), 'R776C': array(['EGFR'], dtype=object), 'K125R': array(['PTEN'], dtype=object), 'V1736G': array(['BRCA1'], dtype=object), 'R174*': array(['RUNX1'], dtype=object), 'R82P': array(['VHL'], dtype=object), 'V555_L576del': array(['KIT'], dtype=object), 'E466K': array(['FGFR3'], dtype=object), 'P123M': array(['PIM1'], dtype=object), 'H398Y': array(['CBL'], dtype=object), 'D609E': array(['ERCC2'], dtype=object), 'D1709E': array(['DICER1'], dtype=object), 'D107Y': array(['PTEN'], dtype=object), 'D1709A': array(['DICER1'], dtype=object), 'A1830T': array(['BRCA1'], dtype=object), 'R64P': array(['VHL'], dtype=object), 'L370fs': array(['PIK3R1'], dtype=object), 'T2681R': array(['BRCA2'], dtype=object), 'NIN-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'P428L': array(['CBL'], dtype=object), 'I289M': array(['AKT2'], dtype=object), 'X582_splice': array(['PIK3R1'], dtype=object), 'V1838E': array(['BRCA1'], dtype=object), 'C242S': array(['TP53'], dtype=object), 'V126D': array(['CDKN2A'], dtype=object), 'L1904V': array(['BRCA2'], dtype=object), 'R659P': array(['MLH1'], dtype=object), 'C528S': array(['XPO1'], dtype=object), 'N848K': array(['PDGFRA'], dtype=object), 'E79Q': array(['NFE2L2'], dtype=object), 'R158L': array(['TP53'], dtype=object), 'Exon 20 insertions': array(['ERBB2'], dtype=object), 'T582P': array(['BRCA2'], dtype=object), 'D631A': array(['RET'], dtype=object), 'V277D': array(['SMAD3'], dtype=object), 'Q1064R': array(['ALK'], dtype=object), 'T283A': array(['CCND3'], dtype=object), 'R158C': array(['TP53'], dtype=object), 'M232T': array(['MYD88'], dtype=object), 'H123Y': array(['PTEN'], dtype=object), 'FGFR3 - BAIAP2L1 Fusion': array(['FGFR3'], dtype=object), 'R659*': array(['MLH1'], dtype=object), 'I255F': array(['TP53'], dtype=object), 'L1657P': array(['BRCA1'], dtype=object), 'M1775E': array(['BRCA1'], dtype=object), 'M1775K': array(['BRCA1'], dtype=object), 'Y646S': array(['EZH2'], dtype=object), 'V354E': array(['RBM10'], dtype=object), 'S142I': array(['BRCA2'], dtype=object), 'L536H': array(['ESR1'], dtype=object), 'Y646H': array(['EZH2'], dtype=object), 'R2842C': array(['BRCA2'], dtype=object), 'L536R': array(['ESR1'], dtype=object), 'Y646N': array(['EZH2'], dtype=object), 'M1775V': array(['BRCA1'], dtype=object), 'S34Y': array(['U2AF1'], dtype=object), 'Y646C': array(['EZH2'], dtype=object), 'V217D': array(['PTEN'], dtype=object), 'Y646F': array(['EZH2'], dtype=object), 'T674I': array(['PDGFRA'], dtype=object), 'G85R': array(['MAP2K4'], dtype=object), 'Q546K': array(['PIK3CA'], dtype=object), 'D1810A': array(['DICER1'], dtype=object), 'L1600P': array(['NOTCH1'], dtype=object), 'H662Q': array(['SF3B1'], dtype=object), 'Q79R': array(['PTPN11'], dtype=object), 'V658F': array(['JAK1'], dtype=object), 'Q546E': array(['PIK3CA'], dtype=object), 'V658A': array(['PDGFRA'], dtype=object), 'Q79K': array(['AKT1'], dtype=object), 'Q276*': array(['CCND3'], dtype=object), 'C24R': array(['BRCA1'], dtype=object), 'Q546P': array(['PIK3CA'], dtype=object), 'Q546R': array(['PIK3CA'], dtype=object), 'Q79E': array(['AKT1'], dtype=object), 'K1026E': array(['JAK1'], dtype=object), 'E746_S752delinsI': array(['EGFR'], dtype=object), 'E746_S752delinsA': array(['EGFR'], dtype=object), 'L838P': array(['EGFR'], dtype=object), 'R167W': array(['VHL'], dtype=object), 'T205A': array(['FBXW7'], dtype=object), 'F21A': array(['PTEN'], dtype=object), 'F123I': array(['CARD11'], dtype=object), 'C277R': array(['TP53'], dtype=object), 'N1178H': array(['ALK'], dtype=object), 'C277Q': array(['TP53'], dtype=object), 'R1190W': array(['BRCA2'], dtype=object), 'Y1230C': array(['MET'], dtype=object), 'P2415del': array(['NOTCH1'], dtype=object), 'D594A': array(['BRAF'], dtype=object), 'S387N': array(['SMO'], dtype=object), 'P691S': array(['EGFR'], dtype=object), 'N382H': array(['MAP2K1'], dtype=object), 'E875G': array(['ERCC4'], dtype=object), 'A126G': array(['PTEN'], dtype=object), 'D594N': array(['BRAF'], dtype=object), 'T319del': array(['PTEN'], dtype=object), 'S387Y': array(['TGFBR1'], dtype=object), 'D594V': array(['BRAF'], dtype=object), 'A126S': array(['PTEN'], dtype=object), 'L251P': array(['CARD11'], dtype=object), 'D594Y': array(['BRAF'], dtype=object), 'K1299E': array(['TET2'], dtype=object), 'R1515H': array(['CIC'], dtype=object), 'A126V': array(['PTEN'], dtype=object), 'R544W': array(['ERBB4'], dtype=object), 'V534E': array(['ESR1'], dtype=object), 'Q164R': array(['VHL'], dtype=object), 'A617T': array(['CDH1'], dtype=object), 'Exon 19 deletion/insertion': array(['EGFR'], dtype=object), 'G356A': array(['TP53'], dtype=object), 'I638F': array(['DDR2'], dtype=object), 'F129L': array(['MAP2K1'], dtype=object), 'L783F': array(['JAK1'], dtype=object), 'I1171N': array(['ALK'], dtype=object), 'R173P': array(['PTEN'], dtype=object), 'V804M': array(['RET'], dtype=object), 'C456_R481del': array(['PDGFRA'], dtype=object), 'C609Y': array(['RET'], dtype=object), 'L1596H': array(['NOTCH1'], dtype=object), 'V804G': array(['RET'], dtype=object), 'H1047R': array(['PIK3CA'], dtype=object), 'L390F': array(['MSH2'], dtype=object), 'D594G': array(['BRAF'], dtype=object), 'E218*': array(['PIK3R1'], dtype=object), 'P551_E554del': array(['KIT'], dtype=object), 'V2006L': array(['MTOR'], dtype=object), 'L826P': array(['TSC2'], dtype=object), 'ZC3H7B-BCOR Fusion': array(['BCOR'], dtype=object), 'V2006I': array(['MTOR'], dtype=object), 'G465E': array(['EGFR'], dtype=object), 'S222D': array(['MAP2K1', 'MAP2K2'], dtype=object), 'D1546N': array(['BRCA1'], dtype=object), 'K57T': array(['MAP2K1'], dtype=object), 'T576del': array(['PIK3R1'], dtype=object), 'R258H': array(['PPP2R1A'], dtype=object), 'L622H': array(['MLH1'], dtype=object), 'L866M': array(['ERBB2'], dtype=object), 'S2G': array(['SHOC2'], dtype=object), 'P1856S': array(['BRCA1'], dtype=object), 'P1856T': array(['BRCA1'], dtype=object), 'R258C': array(['PPP2R1A'], dtype=object), 'N81Y': array(['MEF2B'], dtype=object), 'CASP8L': array(['CASP8'], dtype=object), 'D594E': array(['BRAF'], dtype=object), 'R88Q': array(['PIK3CA'], dtype=object), 'N550H': array(['BRCA1'], dtype=object), 'Y280H': array(['PTPRT'], dtype=object), 'W406A': array(['SMAD3'], dtype=object), 'C44F': array(['BRCA1'], dtype=object), 'K525E': array(['FGFR2'], dtype=object), 'R976H': array(['MSH6'], dtype=object), 'TPM3-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'KANK1-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'Q2384K': array(['BRCA2'], dtype=object), 'Hypermethylation': array(['CDKN2B'], dtype=object), 'P798L': array(['BRCA1'], dtype=object), 'W131A': array(['SPOP'], dtype=object), 'W131G': array(['SPOP'], dtype=object), 'L188V': array(['VHL'], dtype=object), 'S56I': array(['CDKN2A'], dtype=object), 'I18V': array(['PMS2'], dtype=object), 'EBF1-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'S1613C': array(['BRCA1'], dtype=object), 'R1627': array(['EP300'], dtype=object), 'R498L': array(['PTPN11'], dtype=object), 'S1613G': array(['BRCA1'], dtype=object), 'P48R': array(['PRDM1'], dtype=object), 'P48T': array(['PRDM1'], dtype=object), 'G20E': array(['PTEN'], dtype=object), 'R117G': array(['CHEK2'], dtype=object), 'P48L': array(['CDKN2A'], dtype=object), 'E50K': array(['SPOP'], dtype=object), 'L747_E749del': array(['EGFR'], dtype=object), 'STRN-PDGFRA Fusion': array(['PDGFRA'], dtype=object), 'C1787S': array(['BRCA1'], dtype=object), 'D401N': array(['CARD11'], dtype=object), 'FGFR2-CCDC6 Fusion': array(['FGFR2'], dtype=object), 'T1481fs': array(['MAP3K1'], dtype=object), 'E554_I571del': array(['KIT'], dtype=object), 'L82P': array(['BRCA1'], dtype=object), 'CIC-DUX4 Fusion': array(['CIC'], dtype=object), 'D106A': array(['PTPN11'], dtype=object), 'R100*': array(['MLH1'], dtype=object), 'G466E': array(['BRAF'], dtype=object), 'G186R': array(['KEAP1'], dtype=object), 'E29V': array(['BRCA1'], dtype=object), 'TMPRSS2-ERG Fusion': array(['ERG'], dtype=object), 'C360R': array(['BRCA1'], dtype=object), 'TGFBR1*6A': array(['TGFBR1'], dtype=object), 'S119N': array(['SPOP'], dtype=object), 'H1918Y': array(['BRCA2'], dtype=object), 'R100T': array(['SMAD4'], dtype=object), 'G1194D': array(['BRCA2'], dtype=object), 'V600D_K601insFGLAT': array(['BRAF'], dtype=object), 'V559G': array(['KIT'], dtype=object), 'V559D': array(['KIT'], dtype=object), 'L2230V': array(['MTOR'], dtype=object), 'V559C': array(['KIT'], dtype=object), 'D839G': array(['FLT3'], dtype=object), 'V559A': array(['KIT'], dtype=object), 'L412F': array(['SMO'], dtype=object), 'K550_K558del': array(['KIT'], dtype=object), 'G665A': array(['ERCC2'], dtype=object), 'P262H': array(['ERBB3'], dtype=object), 'G123R': array(['IDH1'], dtype=object), 'G123S': array(['CARD11'], dtype=object), 'L358R': array(['BRCA1'], dtype=object), 'E490K': array(['KIT'], dtype=object), 'P261L': array(['RAF1'], dtype=object), 'S33F': array(['CTNNB1'], dtype=object), 'H206D': array(['TSC1'], dtype=object), 'E311_K312del': array(['PDGFRA'], dtype=object), 'C382R': array(['FGFR2'], dtype=object), 'L448P': array(['TSC2'], dtype=object), 'MSN-ROS1 Fusion': array(['ROS1'], dtype=object), 'A121E': array(['PTEN'], dtype=object), 'Y373C': array(['FGFR3'], dtype=object), 'X1007_splice': array(['MET'], dtype=object), 'D171G': array(['RUNX1'], dtype=object), 'L747_A750delinsP': array(['EGFR'], dtype=object), 'P654L': array(['MLH1'], dtype=object), 'D171N': array(['RUNX1'], dtype=object), 'P463L': array(['ERCC2'], dtype=object), 'K642E': array(['KIT'], dtype=object), 'L747_A750del': array(['EGFR'], dtype=object), 'R421*': array(['PMS2'], dtype=object), 'R592H': array(['ERCC2'], dtype=object), 'TPR-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'F328V': array(['TP53'], dtype=object), 'Y510C': array(['FANCA'], dtype=object), 'V1070E': array(['MET'], dtype=object), 'C124R': array(['PTEN'], dtype=object), 'L1273F': array(['RAD50'], dtype=object), 'BCR-PDGFRA Fusion': array(['PDGFRA'], dtype=object), 'V45L': array(['PTPN11'], dtype=object), 'G44S': array(['MED12'], dtype=object), 'R418G': array(['ETV6'], dtype=object), 'Y3098H': array(['BRCA2'], dtype=object), 'N486_P490del': array(['BRAF'], dtype=object), 'K289E': array(['PTEN'], dtype=object), 'R175L': array(['TP53'], dtype=object), 'Q227R': array(['GNAS'], dtype=object), 'A864T': array(['EGFR'], dtype=object), 'R175H': array(['TP53'], dtype=object), 'G909R': array(['CDK12'], dtype=object), 'R175C': array(['TP53'], dtype=object), 'E872K': array(['ERBB4'], dtype=object), 'E135K': array(['PIM1'], dtype=object), 'E23fs': array(['BRCA1'], dtype=object), 'E142D': array(['PIM1'], dtype=object), 'S384F': array(['BRCA2'], dtype=object), 'P316L': array(['TP53'], dtype=object), 'R634W': array(['KIT'], dtype=object), 'R100A': array(['IDH1'], dtype=object), 'D595V': array(['ERBB4'], dtype=object), 'P336S': array(['SMAD3'], dtype=object), 'K1690N': array(['BRCA2'], dtype=object), 'M1689R': array(['BRCA1'], dtype=object), 'S279Y': array(['POLE'], dtype=object), 'T779fs': array(['MAP3K1'], dtype=object), 'P278L': array(['TP53'], dtype=object), 'N463S': array(['KIT'], dtype=object), 'E1356G': array(['NF1'], dtype=object), 'T417_D419delinsI': array(['KIT'], dtype=object), 'A614D': array(['TSC2'], dtype=object), 'W279*': array(['RUNX1'], dtype=object), 'S330A': array(['ELF3'], dtype=object), 'Q1756C': array(['BRCA1'], dtype=object), 'F317L': array(['ABL1'], dtype=object), 'K376N': array(['PIK3R2'], dtype=object), 'F248S': array(['PTPRT'], dtype=object), 'E518K': array(['SMO'], dtype=object), 'G248V': array(['AR'], dtype=object), 'E545Q': array(['PIK3CA'], dtype=object), 'LMNA-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'EGFR-PURB Fusion': array(['EGFR'], dtype=object), 'S584L': array(['PDGFRA'], dtype=object), 'E518A': array(['SMO'], dtype=object), 'N132K': array(['BRCA1'], dtype=object), 'E545G': array(['PIK3CA'], dtype=object), 'K97M': array(['MAP2K1'], dtype=object), 'E545A': array(['PIK3CA'], dtype=object), 'R3052Q': array(['BRCA2'], dtype=object), 'L747_T751del': array(['EGFR'], dtype=object), 'E545K': array(['PIK3CA'], dtype=object), 'G322S': array(['ARAF'], dtype=object), 'W368H': array(['SMAD2'], dtype=object), 'V648G': array(['PTPRT'], dtype=object), 'T733I': array(['ERBB2'], dtype=object), 'P278R': array(['TP53'], dtype=object), 'T1852S': array(['BRCA1'], dtype=object), 'V648I': array(['RET'], dtype=object), 'E758G': array(['EGFR'], dtype=object), 'H492R': array(['FANCA'], dtype=object), 'D74Y': array(['CDKN2A'], dtype=object), 'W257C': array(['PPP2R1A'], dtype=object), 'L52R': array(['AKT1'], dtype=object), 'W257G': array(['PPP2R1A'], dtype=object), 'I1171T': array(['ALK'], dtype=object), 'Y553_K558del': array(['KIT'], dtype=object), 'M18K': array(['BRCA1'], dtype=object), 'D258N': array(['SMAD3'], dtype=object), 'A723D': array(['JAK1'], dtype=object), 'K291Q': array(['TP53'], dtype=object), 'Q50P': array(['CDKN2A'], dtype=object), 'P34L': array(['KRAS'], dtype=object), 'Q395*': array(['MSH2'], dtype=object), 'R601Q': array(['ERCC2'], dtype=object), 'CUX1-FGFR1 Fusion': array(['FGFR1'], dtype=object), 'A1200V': array(['ALK'], dtype=object), 'M664I': array(['FLT3'], dtype=object), 'M18T': array(['BRCA1'], dtype=object), 'L147F': array(['BRCA1'], dtype=object), 'Q538P': array(['NF2'], dtype=object), 'D29H': array(['NFE2L2'], dtype=object), 'Q50*': array(['CDKN2A'], dtype=object), 'C2060G': array(['ROS1'], dtype=object), 'H168R': array(['TP53'], dtype=object), 'N2875K': array(['ATM'], dtype=object), 'D29Y': array(['NFE2L2'], dtype=object), 'H231R': array(['SMO'], dtype=object), 'H168N': array(['TP53'], dtype=object), 'L63P': array(['VHL'], dtype=object), 'L63V': array(['DDR2'], dtype=object), 'D252G': array(['PTEN'], dtype=object), 'I290A': array(['CCND3'], dtype=object), 'E552K': array(['PIK3CB'], dtype=object), 'N546K': array(['FGFR1'], dtype=object), 'S1986Y': array(['ROS1'], dtype=object), 'FLT3 internal tandem duplications': array(['FLT3'], dtype=object), 'I347M': array(['FBXW7'], dtype=object), 'E40T': array(['RHOA'], dtype=object), 'S1986F': array(['ROS1'], dtype=object), 'V769_D770insGVV': array(['EGFR'], dtype=object), 'S267_D273dup': array(['FGFR2'], dtype=object), 'V84L': array(['VHL'], dtype=object), 'D1778Y': array(['BRCA1'], dtype=object), 'R93W': array(['PIK3CA'], dtype=object), 'D3095E': array(['BRCA2'], dtype=object), 'R93Q': array(['PIK3CA'], dtype=object), 'G67S': array(['CDKN2A'], dtype=object), 'A77T': array(['RIT1'], dtype=object), 'P96Q': array(['PTEN'], dtype=object), 'A77P': array(['RIT1'], dtype=object), 'F28L': array(['HRAS', 'KRAS'], dtype=object), 'D1778H': array(['BRCA1'], dtype=object), 'M1043V': array(['PIK3CA'], dtype=object), 'N198_F199delinsI': array(['TSC1'], dtype=object), 'R200W': array(['VHL'], dtype=object), 'D1778N': array(['BRCA1'], dtype=object), 'R248W': array(['TP53'], dtype=object), 'R248Q': array(['TP53'], dtype=object), 'EZR-ROS1 Fusion': array(['ROS1'], dtype=object), 'P81T': array(['CDKN2A'], dtype=object), 'T507K': array(['PTPN11'], dtype=object), 'H773dup': array(['EGFR'], dtype=object), 'K575M': array(['SMO'], dtype=object), 'R248C': array(['FGFR3'], dtype=object), 'R159G': array(['PTEN'], dtype=object), 'R248L': array(['TP53'], dtype=object), 'R248K': array(['TP53'], dtype=object), 'A77S': array(['RIT1'], dtype=object), 'L858Q': array(['EGFR'], dtype=object), 'L858R': array(['EGFR'], dtype=object), 'T50I': array(['NRAS'], dtype=object), 'D557H': array(['PIK3R2'], dtype=object), 'I32del': array(['PTEN'], dtype=object), 'G751R': array(['MSH2'], dtype=object), 'K2472T': array(['BRCA2'], dtype=object), 'S463P': array(['ESR1'], dtype=object), 'L858M': array(['EGFR'], dtype=object), 'V561A': array(['PDGFRA'], dtype=object), 'G13C': array(['HRAS', 'KRAS'], dtype=object), 'A60V': array(['CDKN2A'], dtype=object), 'G464E': array(['BRAF'], dtype=object), 'E1682V': array(['BRCA1'], dtype=object), 'K50E': array(['BRCA1'], dtype=object), 'L23F': array(['PTEN'], dtype=object), 'E380Q': array(['ESR1'], dtype=object), 'K50L': array(['BRCA1'], dtype=object), 'T1720I': array(['BRCA1'], dtype=object), 'G464V': array(['BRAF'], dtype=object), 'G464R': array(['BRAF'], dtype=object), 'C136Y': array(['PTEN'], dtype=object), 'M136R': array(['STK11'], dtype=object), 'D537E': array(['SMAD4'], dtype=object), 'E1682K': array(['BRCA1'], dtype=object), 'I111P': array(['MAP2K1'], dtype=object), 'I111R': array(['MAP2K1'], dtype=object), 'R1563S': array(['CREBBP'], dtype=object), 'N1068fs*4': array(['PIK3CA'], dtype=object), 'R161G': array(['PTEN'], dtype=object), 'Q556_K558del': array(['KIT'], dtype=object), 'T1343I': array(['ALK'], dtype=object), 'I111A': array(['MAP2K1'], dtype=object), 'D3170G': array(['BRCA2'], dtype=object), 'I28T': array(['NFE2L2'], dtype=object), 'S566_E571delinsR': array(['PDGFRA'], dtype=object), 'P38L': array(['PTEN'], dtype=object), 'BCR-FGFR1 Fusion': array(['FGFR1'], dtype=object), 'I111N': array(['MAP2K1'], dtype=object), 'K288Q': array(['PIK3R1'], dtype=object), 'G106D': array(['PIK3CA'], dtype=object), 'R48W': array(['PIK3CB'], dtype=object), 'R172S': array(['IDH2'], dtype=object), 'K765R': array(['CDK12'], dtype=object), 'R24C': array(['CDK4'], dtype=object), 'TRIM24-BRAF Fusion': array(['BRAF'], dtype=object), 'R172K': array(['IDH2'], dtype=object), 'R172M': array(['IDH2'], dtype=object), 'A2717S': array(['BRCA2'], dtype=object), 'L481F': array(['RASA1'], dtype=object), 'W237_Y242del': array(['PIK3R1'], dtype=object), 'FGFR2?PPHLN1 Fusion': array(['FGFR2'], dtype=object), 'R24P': array(['CDKN2A'], dtype=object), 'H773insLGNP': array(['EGFR'], dtype=object), 'R172G': array(['IDH2'], dtype=object), 'R1060H': array(['ALK'], dtype=object), 'R110P': array(['TP53'], dtype=object), 'C620Y': array(['EGFR'], dtype=object), 'A859_L883delinsV': array(['EGFR'], dtype=object), 'V60E': array(['MAP2K1'], dtype=object), 'T1354M': array(['BRCA2'], dtype=object), 'R80C': array(['RUNX1'], dtype=object), 'V60M': array(['MAP2K1'], dtype=object), 'E459K': array(['ABL1'], dtype=object), 'G23D': array(['CDKN2A'], dtype=object), 'N296I': array(['SPOP'], dtype=object), 'T117M': array(['MLH1'], dtype=object), '596_619splice': array(['HLA-A'], dtype=object), 'R80P': array(['CDKN2A'], dtype=object), 'R5Q': array(['RHOA'], dtype=object), 'V319D': array(['SMAD2'], dtype=object), 'R110L': array(['TP53'], dtype=object), 'D287H': array(['BRAF'], dtype=object), 'R544S': array(['BTK'], dtype=object), 'S1841A': array(['BRCA1'], dtype=object), 'E439del': array(['PIK3R1'], dtype=object), 'GIT2-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'G375P': array(['CBL'], dtype=object), 'V852I': array(['KIT'], dtype=object), 'R957Q': array(['SF3B1'], dtype=object), 'S1841N': array(['BRCA1'], dtype=object), 'I35S': array(['CTNNB1'], dtype=object), 'D402Y': array(['KDM5C'], dtype=object), 'S1841R': array(['BRCA1'], dtype=object), 'CHTOP-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'G1706A': array(['BRCA1'], dtype=object), 'P70R': array(['SOX9'], dtype=object), 'L576P': array(['KIT'], dtype=object), 'G1706E': array(['BRCA1'], dtype=object), 'V11A': array(['BRCA1'], dtype=object), 'Y406H': array(['SMAD2'], dtype=object), 'C334S': array(['ERBB2'], dtype=object), 'F1761S': array(['BRCA1'], dtype=object), 'N655K': array(['KIT'], dtype=object), 'F161L': array(['RAB35'], dtype=object), 'D1010H': array(['MET'], dtype=object), 'F1761I': array(['BRCA1'], dtype=object), 'K975E': array(['CDK12'], dtype=object), 'H179Q': array(['TP53'], dtype=object), 'F468C': array(['BRAF'], dtype=object), 'L128F': array(['VHL'], dtype=object), 'T878A': array(['AR'], dtype=object), 'E542Q': array(['PIK3CA'], dtype=object), 'G719S': array(['EGFR'], dtype=object), 'K375A': array(['SMAD2'], dtype=object), 'E542V': array(['PIK3CA'], dtype=object), 'E612_F613insGYVDFREYEYDLKWEFRPRENLEF': array(['FLT3'], dtype=object), 'H878Y': array(['ERBB2'], dtype=object), 'R961W': array(['KDR'], dtype=object), 'G719A': array(['EGFR'], dtype=object), 'R183Q': array(['PPP2R1A'], dtype=object), 'N826Y': array(['EGFR'], dtype=object), 'G719D': array(['EGFR'], dtype=object), 'V592A': array(['FLT3'], dtype=object), 'T241M': array(['SMO'], dtype=object), 'N826S': array(['EGFR'], dtype=object), 'E542K': array(['ERBB4', 'PIK3CA'], dtype=object), 'G376R': array(['PIK3R1'], dtype=object), 'N1387A': array(['TET2'], dtype=object), 'H179Y': array(['TP53'], dtype=object), 'V271L': array(['BRCA1'], dtype=object), 'V271A': array(['AKT1'], dtype=object), 'V600R': array(['BRAF'], dtype=object), 'T47D': array(['BRCA1'], dtype=object), 'V600K': array(['BRAF'], dtype=object), 'V600M': array(['BRAF'], dtype=object), 'V2098I': array(['ROS1'], dtype=object), 'V600E': array(['BRAF'], dtype=object), 'V600D': array(['BRAF'], dtype=object), 'V600G': array(['BRAF'], dtype=object), 'S921R': array(['EGFR'], dtype=object), 'R143Q': array(['ERBB2'], dtype=object), 'L536Q': array(['ESR1'], dtype=object), 'V750E': array(['ERBB2'], dtype=object), 'K641N': array(['FGFR2'], dtype=object), 'K641R': array(['FGFR2'], dtype=object), 'R415G': array(['KEAP1'], dtype=object), 'M552_K558del': array(['KIT'], dtype=object), 'I162M': array(['TP53'], dtype=object), 'R873Q': array(['RET'], dtype=object), 'G216R': array(['AR'], dtype=object), 'R79P': array(['CDKN2A'], dtype=object), 'L283_D294del': array(['CCND1'], dtype=object), 'S310Y': array(['ERBB2'], dtype=object), 'E285K': array(['TP53'], dtype=object), 'F1704S': array(['BRCA1'], dtype=object), 'L146R': array(['TSC2'], dtype=object), 'Q249E': array(['CBL'], dtype=object), 'K420A': array(['SMAD2'], dtype=object), 'A151T': array(['RAB35'], dtype=object), 'E285V': array(['TP53'], dtype=object), 'Y68H': array(['PTEN'], dtype=object), 'C275S': array(['TP53'], dtype=object), 'S36Y': array(['NFKBIA'], dtype=object), 'T352M': array(['NF2'], dtype=object), 'R680*': array(['MSH2'], dtype=object), 'S1303N': array(['TET2'], dtype=object), 'F154L': array(['PTEN'], dtype=object), 'R844H': array(['GNAS'], dtype=object), 'Y68D': array(['PTEN'], dtype=object), 'EGFR-RAD51 Fusion': array(['EGFR'], dtype=object), 'G334R': array(['TP53'], dtype=object), 'Q201H': array(['BRAF'], dtype=object), 'D641N': array(['FGFR3'], dtype=object), 'S646F': array(['JAK1'], dtype=object), 'F958S': array(['JAK1'], dtype=object), 'G591V': array(['NUP93'], dtype=object), 'F958V': array(['JAK1'], dtype=object), 'Y599_D600insSTDNEYFYVDFREYEY': array(['FLT3'], dtype=object), 'A120S': array(['MLH1'], dtype=object), 'P417A': array(['CBL'], dtype=object), 'E143K': array(['BRCA1'], dtype=object), 'Q809R': array(['ERBB3'], dtype=object), 'W719*': array(['CDK12'], dtype=object), 'EWSR1-YY1 Fusion': array(['EWSR1'], dtype=object), 'D60N': array(['KIT'], dtype=object), 'EGFR-KDD': array(['EGFR'], dtype=object), 'Y1414C': array(['EP300'], dtype=object), 'X1008_splice': array(['MET'], dtype=object), 'R671Q': array(['BRAF'], dtype=object), 'K550_V555delinsI': array(['KIT'], dtype=object), 'K128Q': array(['PTEN'], dtype=object), 'I42V': array(['BRCA1'], dtype=object), 'K128N': array(['PTEN'], dtype=object), 'K1452N': array(['MTOR'], dtype=object), 'S123T': array(['MAP2K1'], dtype=object), 'Q816*': array(['MSH2'], dtype=object), 'T196A': array(['HNF1A'], dtype=object), 'G34V': array(['CTNNB1'], dtype=object), 'BCOR-RARA Fusion': array(['BCOR'], dtype=object), 'C456_N468del': array(['PDGFRA'], dtype=object), 'R290G': array(['TP53'], dtype=object), 'FGFR2-MGEA5 Fusion': array(['FGFR2'], dtype=object), 'D186A': array(['TP53'], dtype=object), 'V560G': array(['KIT'], dtype=object), 'V560D': array(['KIT'], dtype=object), 'V560E': array(['KIT'], dtype=object), 'A1708V': array(['BRCA1'], dtype=object), 'R267Q': array(['TP53'], dtype=object), 'L1224F': array(['MED12'], dtype=object), 'R561C': array(['PDGFRB'], dtype=object), 'V356R': array(['SMAD3'], dtype=object), 'L755S': array(['ERBB2'], dtype=object), 'L755P': array(['ERBB2'], dtype=object), 'E207K': array(['MAP2K2'], dtype=object), 'P1675L': array(['TSC2'], dtype=object), 'Y384A': array(['SMAD3'], dtype=object), 'K483E': array(['BRAF'], dtype=object), 'V842I': array(['ERBB2'], dtype=object), 'D2312V': array(['BRCA2'], dtype=object), 'K483M': array(['BRAF'], dtype=object), 'W563L': array(['RB1'], dtype=object), 'K65M': array(['BRCA1'], dtype=object), 'R134Q': array(['MAP2K4'], dtype=object), 'T730S': array(['FGFR2'], dtype=object), 'L704N': array(['EGFR'], dtype=object), 'R361C': array(['SMAD4'], dtype=object), 'H78Q': array(['RUNX1'], dtype=object), 'H1805P': array(['BRCA1'], dtype=object), 'Q347_A348del': array(['ARAF'], dtype=object), 'P577_W582delinsPYD': array(['KIT'], dtype=object), 'F877L': array(['AR'], dtype=object), 'H105R': array(['TSC1'], dtype=object), 'K1434I': array(['BRCA2'], dtype=object), 'C378R': array(['PIK3CA'], dtype=object), 'M117V': array(['SPOP'], dtype=object), 'A391E': array(['FGFR3'], dtype=object), 'BCOR-CCNB3 Fusion': array(['BCOR'], dtype=object), 'V422del': array(['ESR1'], dtype=object), 'R1699L': array(['BRCA1'], dtype=object), 'P1776S': array(['BRCA1'], dtype=object), 'G34E': array(['CTNNB1'], dtype=object), 'A633T': array(['PDGFRA'], dtype=object), 'W742L': array(['AR'], dtype=object), 'A633V': array(['PDGFRA'], dtype=object), 'D641G': array(['FGFR3'], dtype=object), 'I843_D846del': array(['PDGFRA'], dtype=object), 'W742C': array(['AR'], dtype=object), 'P2417A': array(['NOTCH1'], dtype=object), 'N1026S': array(['APC'], dtype=object), 'V487_P492delinsA': array(['BRAF'], dtype=object), 'S768I': array(['EGFR'], dtype=object), 'S206C': array(['BRCA2'], dtype=object), 'ATF7IP-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'E365K': array(['PIK3CA'], dtype=object), 'Y418_D419del': array(['KIT'], dtype=object), 'K550_W557del': array(['KIT'], dtype=object), 'M1293A': array(['TET2'], dtype=object), 'E1214K': array(['BRCA1'], dtype=object), 'G1269S': array(['ALK'], dtype=object), 'H845_N848delinsP': array(['PDGFRA'], dtype=object), 'R836C': array(['EGFR'], dtype=object), 'P81L': array(['CDKN2A'], dtype=object), 'A263V': array(['FGFR1'], dtype=object), 'L617M': array(['FGFR2'], dtype=object), 'G1269A': array(['ALK'], dtype=object), 'MIR143-NOTCH1 Fusion': array(['NOTCH1'], dtype=object), 'Promoter Mutations': array(['TERT'], dtype=object), 'S151A': array(['BRAF'], dtype=object), 'R290H': array(['TP53'], dtype=object), 'R201H': array(['GNAS'], dtype=object), 'C618R': array(['JAK2', 'RET'], dtype=object), 'R287A': array(['SMAD3'], dtype=object), 'P95H': array(['SRSF2'], dtype=object), 'Y1703H': array(['BRCA1'], dtype=object), 'R201C': array(['GNAS'], dtype=object), 'R505L': array(['FBXW7'], dtype=object), 'SDC4-ROS1 Fusion': array(['ROS1'], dtype=object), 'R201Q': array(['RUNX1'], dtype=object), 'R2784W': array(['BRCA2'], dtype=object), 'W535L': array(['SMO'], dtype=object), 'E267G': array(['AKT1'], dtype=object), 'V705M': array(['TSC2'], dtype=object), 'R249W': array(['TP53'], dtype=object), 'V35M': array(['MAP2K2'], dtype=object), 'R249S': array(['TP53'], dtype=object), 'L597V': array(['BRAF'], dtype=object), 'H1382Y': array(['TET2'], dtype=object), 'F354L': array(['STK11'], dtype=object), 'L597R': array(['BRAF'], dtype=object), 'L597S': array(['BRAF'], dtype=object), 'L597Q': array(['BRAF'], dtype=object), 'L1705P': array(['BRCA1'], dtype=object), 'ARv567es': array(['AR'], dtype=object), 'V194M': array(['FLT3'], dtype=object), 'D572A': array(['KIT'], dtype=object), 'MYC-nick': array(['MYC'], dtype=object), 'C420R': array(['PIK3CA'], dtype=object), 'D1733G': array(['BRCA1'], dtype=object), 'I103N': array(['MAP2K1'], dtype=object), 'Y791F': array(['RET'], dtype=object), 'CUL1-BRAF Fusion': array(['BRAF'], dtype=object), 'R796G': array(['KIT'], dtype=object), 'V270A': array(['AKT1'], dtype=object), 'Y901C': array(['CDK12'], dtype=object), 'E49K': array(['AKT1'], dtype=object), 'N233Y': array(['SOS1'], dtype=object), 'G785S': array(['ERBB4'], dtype=object), 'V32G': array(['NFE2L2'], dtype=object), 'G863D': array(['EGFR'], dtype=object), 'T80R': array(['NFE2L2'], dtype=object), 'E812K': array(['ERBB2'], dtype=object), 'W398V': array(['SMAD4'], dtype=object), 'M244V': array(['ABL1'], dtype=object), 'M391R': array(['FGFR2'], dtype=object), 'T1977K': array(['MTOR'], dtype=object), 'T80A': array(['NFE2L2'], dtype=object), 'E81K': array(['PIK3CA'], dtype=object), 'G863S': array(['EGFR'], dtype=object), 'K310R': array(['FGFR2'], dtype=object), 'T80K': array(['NFE2L2'], dtype=object), 'S1497A': array(['BRCA1'], dtype=object), 'Y1235D': array(['MET'], dtype=object), 'S1025C': array(['JAK2'], dtype=object), 'Q2405Rfs*17': array(['NOTCH1'], dtype=object), 'M253K': array(['FOXA1'], dtype=object), 'N676S': array(['FLT3'], dtype=object), 'G31A': array(['NFE2L2'], dtype=object), 'C24Y': array(['BRCA1'], dtype=object), 'N676K': array(['FLT3'], dtype=object), 'G914R': array(['PIK3CA'], dtype=object), '550_592del': array(['KIT'], dtype=object), 'N78S': array(['VHL'], dtype=object), 'G31V': array(['NFE2L2'], dtype=object), 'N676D': array(['FLT3'], dtype=object), 'G31R': array(['NFE2L2'], dtype=object), 'A23E': array(['CDKN2B'], dtype=object), 'FGFR2-BICC1 Fusion': array(['FGFR2'], dtype=object), 'L455M': array(['FOXA1'], dtype=object), 'A598V': array(['BRAF'], dtype=object), 'R834Q': array(['FLT3'], dtype=object), 'A598T': array(['BRAF'], dtype=object), 'T783A': array(['EGFR'], dtype=object), 'FUS-ERG Fusion': array(['ERG'], dtype=object), 'C712R': array(['RB1'], dtype=object), 'K218T': array(['PTPRT'], dtype=object), 'R173H': array(['PTEN'], dtype=object), 'E946*': array(['MSH6'], dtype=object), 'R49L': array(['MAP2K1'], dtype=object), 'R173C': array(['BRIP1', 'PTEN'], dtype=object), 'L28P': array(['BRCA1'], dtype=object), 'C324Y': array(['SMAD4'], dtype=object), 'R659L': array(['MLH1'], dtype=object), 'A2034V': array(['MTOR'], dtype=object), 'D61Y': array(['PTPN11'], dtype=object), 'A121P': array(['PTEN'], dtype=object), 'A339V': array(['NKX2-1'], dtype=object), 'A121V': array(['PTEN'], dtype=object), 'F53S': array(['MAP2K1'], dtype=object), 'p61BRAF': array(['BRAF'], dtype=object), 'N987I': array(['BRCA2'], dtype=object), 'V1092I': array(['MET'], dtype=object), 'F53C': array(['MAP2K1'], dtype=object), 'KIF5B-ALK Fusion': array(['ALK'], dtype=object), 'D61N': array(['PTPN11'], dtype=object), 'V569_L576del': array(['KIT'], dtype=object), 'F53L': array(['MAP2K1'], dtype=object), 'Q157P': array(['U2AF1'], dtype=object), 'I204T': array(['MAP2K1'], dtype=object), 'S432L': array(['ESR1'], dtype=object), 'C250T': array(['TERT'], dtype=object), 'L63F': array(['BRCA1'], dtype=object), 'F522C': array(['KIT'], dtype=object), 'D717V': array(['KDR'], dtype=object), 'E161del': array(['CHEK2'], dtype=object), 'K129E': array(['SPOP'], dtype=object), 'A919V': array(['RET'], dtype=object), 'P2273S': array(['MTOR'], dtype=object), 'R47Q': array(['MAP2K1'], dtype=object), 'P551_W557delinsL': array(['KIT'], dtype=object), 'M299I': array(['BRIP1'], dtype=object), 'E1660G': array(['BRCA1'], dtype=object), 'S37F': array(['CTNNB1'], dtype=object), 'V143A': array(['TP53'], dtype=object), 'I562M': array(['PDGFRA'], dtype=object), 'P85L': array(['CHEK2'], dtype=object), 'N1100Y': array(['MET'], dtype=object), 'T1685A': array(['BRCA1'], dtype=object), 'G375C': array(['FGFR3'], dtype=object), 'D617G': array(['FGFR3'], dtype=object), 'L485_P490delinsY': array(['BRAF'], dtype=object), 'S308A': array(['ELF3', 'BRCA1'], dtype=object), 'R470C': array(['KEAP1'], dtype=object), 'L485_P490delinsF': array(['BRAF'], dtype=object), 'P596L': array(['EGFR'], dtype=object), 'T73I': array(['MYC', 'PTPN11'], dtype=object), 'H179R': array(['TP53'], dtype=object), 'Y27S': array(['PTEN'], dtype=object), 'V1180L': array(['ALK'], dtype=object), 'P4309A': array(['FAT1'], dtype=object), 'N454D': array(['CBL'], dtype=object), 'K413E': array(['NF2'], dtype=object), 'I852M': array(['RET'], dtype=object), 'N1878K': array(['BRCA2'], dtype=object), 'M1689T': array(['BRCA1'], dtype=object), 'S505N': array(['MPL'], dtype=object), 'M1411T': array(['BRCA1'], dtype=object), 'Y823D': array(['KIT'], dtype=object), 'H179N': array(['TP53'], dtype=object), 'L770V': array(['FGFR2'], dtype=object), 'Q1554H': array(['TSC2'], dtype=object), 'Y285C': array(['ERBB4'], dtype=object), 'W24S': array(['NFE2L2'], dtype=object), 'W24R': array(['NFE2L2'], dtype=object), 'H284P': array(['CTCF'], dtype=object), 'A18D': array(['KRAS'], dtype=object), 'T630I': array(['BRCA2'], dtype=object), 'E709K': array(['EGFR'], dtype=object), 'S2215F': array(['MTOR'], dtype=object), 'T315I': array(['ABL1'], dtype=object), 'A707T': array(['PTPRT'], dtype=object), 'W24C': array(['NFE2L2'], dtype=object), 'N553S': array(['PIK3CB'], dtype=object), 'Q56P': array(['MAP2K1'], dtype=object), 'H284N': array(['CTCF'], dtype=object), 'R280K': array(['TP53'], dtype=object), 'D1778G': array(['BRCA1'], dtype=object), 'A298T': array(['CDH1'], dtype=object), 'T1977R': array(['MTOR'], dtype=object), 'T1977S': array(['MTOR'], dtype=object), 'Copy Number Loss': array(['FBXW7'], dtype=object), 'A459V': array(['SMO'], dtype=object), 'R304W': array(['STK11'], dtype=object), 'H875Y': array(['AR'], dtype=object), 'G434R': array(['SOS1'], dtype=object), 'E124Q': array(['PIM1'], dtype=object), 'Q545A': array(['AGO2'], dtype=object), 'P1637L': array(['BRCA1'], dtype=object), 'L929S': array(['BRCA2'], dtype=object), 'CEP110-FGFR1 Fusion': array(['FGFR1'], dtype=object), 'F341V': array(['PTEN'], dtype=object), 'Exon 9 mutations': array(['KIT'], dtype=object), 'T1977I': array(['MTOR'], dtype=object), 'TP53BP1-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'G1529R': array(['BRCA2'], dtype=object), 'V104M': array(['ERBB3'], dtype=object), 'S37Y': array(['CTNNB1'], dtype=object), 'E139D': array(['PTPN11'], dtype=object), 'G469E': array(['BRAF'], dtype=object), 'R99P': array(['CDKN2A'], dtype=object), 'E636K': array(['FGFR2'], dtype=object), 'R1262A': array(['TET2'], dtype=object), 'L181P': array(['PTEN'], dtype=object), 'S37C': array(['CTNNB1'], dtype=object), 'S840_N841insGS': array(['FLT3'], dtype=object), 'S37A': array(['CTNNB1'], dtype=object), 'Y426A': array(['SMAD2'], dtype=object), 'R280A': array(['TP53'], dtype=object), 'R202S': array(['TP53'], dtype=object), 'L611_E612insCSSDNEYFYVDFREYEYDLKWEFPRENL': array(['FLT3'], dtype=object), 'D473H': array(['SMO'], dtype=object), 'D473G': array(['SMO'], dtype=object), 'S567L': array(['RB1'], dtype=object), 'R886W': array(['RET'], dtype=object), 'V465M': array(['SMAD4'], dtype=object), 'R280T': array(['TP53'], dtype=object), 'T195I': array(['AKT1'], dtype=object), 'P655R': array(['BRCA2'], dtype=object), 'A504_Y505ins': array(['KIT'], dtype=object), 'E1060A': array(['BRCA1'], dtype=object), 'K517R': array(['FGFR2'], dtype=object), 'CD74-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'N480del': array(['RB1'], dtype=object), 'T599_V600insEAT': array(['BRAF'], dtype=object), 'H355M': array(['AKT2'], dtype=object), 'M737I': array(['FLT3'], dtype=object), 'C49Y': array(['CARD11'], dtype=object), 'G419V': array(['SMAD4'], dtype=object), 'Y87N': array(['SPOP'], dtype=object), 'Y65C': array(['PTEN'], dtype=object), 'L1593P': array(['NOTCH1'], dtype=object), 'E2014K': array(['MTOR'], dtype=object), 'L330R': array(['TP53'], dtype=object), 'R264C': array(['PPP6C'], dtype=object), 'S1206Y': array(['ALK'], dtype=object), 'Y87C': array(['SPOP'], dtype=object), 'K745_A750del': array(['EGFR'], dtype=object), 'A95D': array(['BAP1'], dtype=object), 'P648L': array(['MLH1'], dtype=object), 'K82T': array(['TSC1'], dtype=object), 'S247F': array(['RXRA'], dtype=object), 'C1365Y': array(['BRCA2'], dtype=object), 'Y835F': array(['ERBB2'], dtype=object), 'R1391S': array(['NF1'], dtype=object), 'I122V': array(['PTEN'], dtype=object), 'H179L': array(['TP53'], dtype=object), 'R1594Q': array(['NOTCH1'], dtype=object), 'S227A': array(['FBXW7'], dtype=object), 'P753S': array(['EGFR'], dtype=object), 'G1596V': array(['TSC2'], dtype=object), 'R1391G': array(['NF1'], dtype=object), 'A763_Y764insFQEA': array(['EGFR'], dtype=object), 'KIAA1509-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'V843I': array(['EGFR'], dtype=object), 'K52R': array(['BRIP1'], dtype=object), 'D820A': array(['KIT'], dtype=object), 'P169H': array(['PTEN'], dtype=object), 'D769A': array(['ERBB2'], dtype=object), 'C1265S': array(['BRCA2'], dtype=object), 'C47G': array(['BRCA1'], dtype=object), 'E1282V': array(['BRCA1'], dtype=object), 'K382E': array(['CBL'], dtype=object), 'D769H': array(['ERBB2'], dtype=object), 'R2336P': array(['BRCA2'], dtype=object), 'E330G': array(['SMAD4'], dtype=object), 'D820E': array(['KIT'], dtype=object), 'T167A': array(['FLT3', 'PTEN'], dtype=object), 'C47S': array(['CCND1'], dtype=object), 'G776delinsVC': array(['ERBB2'], dtype=object), 'K2729N': array(['BRCA2'], dtype=object), 'N505I': array(['KIT'], dtype=object), 'V894I': array(['BRCA2'], dtype=object), 'K335I': array(['CTNNB1'], dtype=object), 'P531L': array(['EPAS1'], dtype=object), 'G853D': array(['PDGFRA'], dtype=object), 'R174C': array(['BRCA2'], dtype=object), 'S501_A502dup': array(['KIT'], dtype=object), 'P130S': array(['SMAD4', 'PDGFRA'], dtype=object), 'Y163C': array(['TP53'], dtype=object), 'L535P': array(['NF2'], dtype=object), 'PAX8-PPAR? Fusion': array(['PAX8'], dtype=object), 'R273C': array(['TP53'], dtype=object), 'N387K': array(['CTNNB1'], dtype=object), 'I1766S': array(['BRCA1'], dtype=object), 'R658Q': array(['FBXW7'], dtype=object), 'P83L': array(['TGFBR1'], dtype=object), 'C482R': array(['KDR'], dtype=object), 'R273H': array(['TP53'], dtype=object), 'R273L': array(['TP53'], dtype=object), 'E362H': array(['BRCA1'], dtype=object), 'EWSR1-ATF1 Fusion': array(['EWSR1'], dtype=object), 'ETV6-NTRK3 Fusion': array(['ETV6', 'NTRK3'], dtype=object), 'LRIG3-ROS1 Fusion': array(['ROS1'], dtype=object), 'G14V': array(['RHOA'], dtype=object), 'N387P': array(['SMAD2'], dtype=object), 'V1741G': array(['BRCA1'], dtype=object), 'K56N': array(['BRCA1'], dtype=object), 'H61R': array(['PTEN'], dtype=object), 'C71Y': array(['PTEN'], dtype=object), 'L246V': array(['BRCA1'], dtype=object), 'H61D': array(['PTEN'], dtype=object), 'A1170V': array(['BRCA2'], dtype=object), 'M980T': array(['RET'], dtype=object), 'G333S': array(['KEAP1'], dtype=object), 'P2476L': array(['MTOR'], dtype=object), 'S1164I': array(['BRCA1'], dtype=object), 'E462G': array(['BRCA2'], dtype=object), 'A1065T': array(['KDR'], dtype=object), 'S1206R': array(['ALK'], dtype=object), 'ESR1-CCDC170 Fusion': array(['ESR1'], dtype=object), 'F74S': array(['PTPRT'], dtype=object), 'G2101A': array(['ROS1'], dtype=object), 'M374V': array(['CBL'], dtype=object), 'A1843P': array(['BRCA1'], dtype=object), 'W1782C': array(['BRCA1'], dtype=object), 'R574fs': array(['PIK3R1'], dtype=object), 'T413N': array(['SMAD2'], dtype=object), 'N659R': array(['PDGFRA'], dtype=object), 'I2285V': array(['BRCA2'], dtype=object), 'D1344H': array(['BRCA1'], dtype=object), 'R866C': array(['BRCA1'], dtype=object), 'KDELR2-ROS1 Fusion': array(['ROS1'], dtype=object), 'C634Y': array(['RET'], dtype=object), 'K650M': array(['FGFR3'], dtype=object), 'R177*': array(['RUNX1'], dtype=object), 'K650N': array(['FGFR3'], dtype=object), 'P648S': array(['MLH1'], dtype=object), 'F170I': array(['BAP1'], dtype=object), 'C141Y': array(['TP53'], dtype=object), 'K650Q': array(['FGFR3'], dtype=object), 'E1836K': array(['BRCA1'], dtype=object), 'L225LI': array(['CARD11'], dtype=object), 'EWSR1-ERG Fusion': array(['ERG'], dtype=object), 'L461V': array(['ERCC2'], dtype=object), 'P375S': array(['BRCA2'], dtype=object), 'A829P': array(['KIT'], dtype=object), 'H701P': array(['PIK3CA'], dtype=object), 'C277W': array(['TP53'], dtype=object), 'R170W': array(['AKT2', 'BRCA1'], dtype=object), 'Truncating Mutations': array(['FAM58A', 'CBL', 'DICER1', 'RYBP', 'TGFBR2', 'MSH6', 'KMT2D',
       'LATS1', 'PBRM1', 'ARID2', 'ELF3', 'ASXL2', 'TP53', 'TP53BP1',
       'SMAD4', 'CDKN1A', 'CDKN1B', 'CDKN2A', 'CDKN2B', 'BRIP1', 'ERCC3',
       'ERCC4', 'CEBPA', 'HLA-A', 'HLA-B', 'RBM10', 'TSC1', 'TSC2',
       'ARID1A', 'MAP3K1', 'AXIN1', 'BAP1', 'FANCA', 'FANCC', 'KMT2C',
       'FAT1', 'ARID1B', 'ARID5B', 'MLH1', 'KMT2A', 'SPOP', 'KDM6A', 'VHL',
       'MAP2K4', 'PMS2', 'PIK3R3', 'MSH2', 'XRCC2', 'CREBBP', 'NCOR1',
       'SETD2', 'CTCF', 'TET1', 'SMARCA4', 'SMARCB1', 'FOXP1', 'ATM',
       'STAG2', 'SOX9', 'KMT2B', 'ATR', 'ATRX', 'CDK12', 'B2M', 'GATA3',
       'CCND1', 'KEAP1', 'PTCH1', 'PTEN', 'RNF43', 'CIC', 'INPP4B', 'JAK1',
       'STK11', 'NF1', 'PTPRD', 'BRCA1', 'NF2', 'FUBP1', 'BCL10', 'MGA',
       'DNMT3A', 'RAD21', 'RAD51C', 'RAD51D', 'NPM1', 'HNF1A', 'TCF7L2',
       'RASA1', 'BCL2L11', 'RB1', 'CASP8', 'RUNX1'], dtype=object), 'R170Q': array(['BRCA1'], dtype=object), 'C481S': array(['BTK'], dtype=object), '560_561insER': array(['PDGFRA'], dtype=object), 'V173L': array(['TP53'], dtype=object), 'E746_T751delinsA': array(['EGFR'], dtype=object), 'E5K': array(['PMS2'], dtype=object), 'S1206C': array(['ALK'], dtype=object), 'V173E': array(['TP53'], dtype=object), 'W1837G': array(['BRCA1'], dtype=object), 'D324N': array(['FLT3'], dtype=object), 'L52F': array(['BRCA1'], dtype=object), 'R1608S': array(['NOTCH1'], dtype=object), 'P291Qfs*51': array(['HNF1A'], dtype=object), 'A1701P': array(['NOTCH1'], dtype=object), 'T3211K': array(['BRCA2'], dtype=object), 'K379E': array(['PIK3R1'], dtype=object), 'Deletion': array(['CBL', 'DICER1', 'SHQ1', 'RAD50', 'TGFBR2', 'MSH6', 'KMT2D',
       'PBRM1', 'LATS2', 'ARID2', 'ERRFI1', 'CDH1', 'EP300', 'TP53BP1',
       'SMAD2', 'SMAD3', 'SMAD4', 'FBXW7', 'CDKN1A', 'CDKN1B', 'CDKN2A',
       'CDKN2B', 'CDKN2C', 'ASXL1', 'BRIP1', 'ERCC4', 'RBM10', 'IKZF1',
       'TSC2', 'FOXA1', 'BAP1', 'MEN1', 'FANCA', 'KMT2C', 'FAT1', 'ARID1B',
       'PIK3R1', 'KDM6A', 'VHL', 'APC', 'MSH2', 'XRCC2', 'CREBBP', 'SETD2',
       'CTCF', 'TET1', 'SMARCB1', 'ATM', 'TET2', 'ATRX', 'CDK12', 'B2M',
       'BARD1', 'KEAP1', 'PTEN', 'BCOR', 'PRDM1', 'STK11', 'PTPN11', 'NF1',
       'PTPRD', 'BRCA1', 'BRCA2', 'BCL10', 'MGA', 'MED12', 'DNMT3A',
       'RAD21', 'NPM1', 'RASA1', 'BCL2L11', 'RB1', 'DUSP4', 'CASP8'], dtype=object), 'D74N': array(['CDKN2A'], dtype=object), 'G81D': array(['NFE2L2'], dtype=object), 'K78I': array(['STK11'], dtype=object), 'L193F': array(['PIM1'], dtype=object), 'K78A': array(['STK11'], dtype=object), 'T725M': array(['EGFR'], dtype=object), 'F156L': array(['KRAS'], dtype=object), 'S1301R': array(['BRCA1'], dtype=object), 'R1209W': array(['PTPRT'], dtype=object), 'G81R': array(['NFE2L2'], dtype=object), 'A122*': array(['RUNX1'], dtype=object), 'T34_A289del': array(['EGFR'], dtype=object), 'L180P': array(['TSC1'], dtype=object), 'L1574P': array(['NOTCH1'], dtype=object), 'S34F': array(['U2AF1'], dtype=object), 'P1709L': array(['TSC2'], dtype=object), 'R1093*': array(['SMARCA4'], dtype=object), 'L747_P753delinsS': array(['EGFR'], dtype=object), 'A1823T': array(['BRCA1'], dtype=object), 'T1691I': array(['BRCA1'], dtype=object), 'L19F': array(['KRAS'], dtype=object), 'V2969M': array(['BRCA2'], dtype=object), 'R647A': array(['AGO2'], dtype=object), 'R755S': array(['MLH1'], dtype=object), 'V155A': array(['VHL'], dtype=object), 'E571K': array(['XPO1'], dtype=object), 'K83N': array(['RUNX1'], dtype=object), 'V665A': array(['PDGFRB'], dtype=object), 'Q110R': array(['PTEN'], dtype=object), 'K83E': array(['RUNX1'], dtype=object), 'R715G': array(['NTRK2'], dtype=object), 'S227F': array(['PTEN'], dtype=object), 'N551K': array(['TCF3'], dtype=object), 'G1803A': array(['BRCA1'], dtype=object), 'V109G': array(['CDKN1B'], dtype=object), 'R265S': array(['MLH1'], dtype=object), 'N549K': array(['FGFR2'], dtype=object), 'Y64A': array(['HRAS', 'KRAS'], dtype=object), 'R370C': array(['AKT1'], dtype=object), 'N549S': array(['FGFR2'], dtype=object), 'EWSR1-CREB1 Fusion': array(['EWSR1'], dtype=object), 'S746fs': array(['ERCC2'], dtype=object), 'N549T': array(['FGFR2'], dtype=object), 'S1655F': array(['BRCA1'], dtype=object), 'Q1826H': array(['BRCA1'], dtype=object), 'L576del': array(['KIT'], dtype=object), 'S23R': array(['CTNNB1'], dtype=object), 'P168T': array(['BRCA2'], dtype=object), 'D2723G': array(['BRCA2'], dtype=object), 'H174R': array(['STK11'], dtype=object), 'V471F': array(['BRAF'], dtype=object), 'D2723H': array(['BRCA2'], dtype=object), 'M541L': array(['KIT'], dtype=object), 'Y174N': array(['PTEN'], dtype=object), 'Y35N': array(['RHEB'], dtype=object), 'IGL-MYC Fusion': array(['MYC'], dtype=object), 'Y570H': array(['KIT'], dtype=object), 'S196N': array(['BRCA2'], dtype=object), 'D661Y': array(['STAT3'], dtype=object), 'M918V': array(['RET'], dtype=object), 'L708P': array(['PTPRT'], dtype=object), 'P577S': array(['PDGFRA'], dtype=object), 'G325R': array(['RAD54L'], dtype=object), 'D769Y': array(['ERBB2'], dtype=object), 'V1804A': array(['BRCA1'], dtype=object), 'T2250A': array(['BRCA2'], dtype=object), 'S451E': array(['FLT3'], dtype=object), 'S451F': array(['FLT3'], dtype=object), 'S1651F': array(['BRCA1'], dtype=object), 'G325A': array(['CASP8'], dtype=object), 'L57V': array(['RHOA'], dtype=object), 'A11_G12insGA': array(['KRAS'], dtype=object), 'TMPRSS2-ETV1 Fusion': array(['TMPRSS2', 'ETV1'], dtype=object), 'H132Y': array(['SMAD4'], dtype=object), 'A171V': array(['PIK3R2'], dtype=object), 'R304*': array(['MAP2K4'], dtype=object), 'M134L': array(['PTEN'], dtype=object), 'ESR1-YAP1 Fusion': array(['ESR1'], dtype=object), '981_1028splice': array(['MET'], dtype=object), 'K1436Q': array(['NF1'], dtype=object), 'HMGA2-RAD51B Fusion': array(['RAD51B'], dtype=object), 'D842_M844del': array(['PDGFRA'], dtype=object), 'G1763V': array(['BRCA1'], dtype=object), 'R87P': array(['CDKN2A'], dtype=object), 'G127E': array(['PTEN'], dtype=object), 'G127N': array(['PTEN'], dtype=object), 'L209F': array(['BRCA2'], dtype=object), 'R272C': array(['KEAP1'], dtype=object), 'R514C': array(['FOXP1'], dtype=object), 'EP300-MLL Fusion': array(['EP300'], dtype=object), 'V370D': array(['SMAD4'], dtype=object), 'D935N': array(['BRCA2'], dtype=object), 'T468M': array(['PTPN11'], dtype=object), 'R87L': array(['CDKN2A'], dtype=object), 'P44L': array(['MYCN'], dtype=object), 'ATG7-RAF1 Fusion': array(['RAF1'], dtype=object), 'T599I': array(['BRAF'], dtype=object), 'V1665M': array(['BRCA1'], dtype=object), 'P44A': array(['CTNNB1'], dtype=object), 'D450E': array(['SMAD2'], dtype=object), 'I1183T': array(['ALK'], dtype=object), 'D450H': array(['SMAD2'], dtype=object), 'A1066V': array(['PIK3CA'], dtype=object), 'E839K': array(['KIT'], dtype=object), 'R201W': array(['CIC'], dtype=object), 'D737N': array(['KIT'], dtype=object), 'P186S': array(['PPP6C'], dtype=object), 'L272F': array(['AR'], dtype=object), 'S1715C': array(['BRCA1'], dtype=object), 'R1088C': array(['PTPRD'], dtype=object), 'V839G': array(['ERBB2'], dtype=object), 'W557G': array(['KIT'], dtype=object), 'S259A': array(['RAF1'], dtype=object), 'S259F': array(['RAF1'], dtype=object), 'Exon 2 mutations': array(['MED12'], dtype=object), 'Y803N': array(['ERBB2'], dtype=object), 'V1673D': array(['TSC2'], dtype=object), 'S1715R': array(['BRCA1'], dtype=object), 'C64G': array(['BRCA1'], dtype=object), 'S259P': array(['RAF1'], dtype=object), 'A72S': array(['PTPN11'], dtype=object), 'W557R': array(['KIT'], dtype=object), 'D835N': array(['FLT3'], dtype=object), 'K22A': array(['CDK4'], dtype=object), 'Y113*': array(['RUNX1'], dtype=object), 'NFASC-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'D835E': array(['FLT3'], dtype=object), 'E746_T751insIP': array(['EGFR'], dtype=object), 'W1456R': array(['MTOR'], dtype=object), 'D835A': array(['FLT3'], dtype=object), 'L265P': array(['MYD88'], dtype=object), 'MLL-TET1 Fusion': array(['TET1'], dtype=object), 'L325F': array(['PTEN'], dtype=object), 'D835Y': array(['FLT3'], dtype=object), 'T1219I': array(['MSH6'], dtype=object), 'TMPRSS2-ETV4 Fusion': array(['TMPRSS2'], dtype=object), 'R1758G': array(['BRCA1'], dtype=object), 'P26S': array(['KNSTRN'], dtype=object), 'N181S': array(['ERBB4'], dtype=object), 'C27A': array(['BRCA1'], dtype=object), 'N235D': array(['TP53'], dtype=object), 'R841K': array(['EGFR', 'PDGFRA'], dtype=object), 'Y179C': array(['BRCA1'], dtype=object), 'N771_H773dup': array(['EGFR'], dtype=object), 'R841Q': array(['BRCA1'], dtype=object), 'L1019V': array(['BRCA2'], dtype=object), 'V272L': array(['TP53'], dtype=object), 'T131A': array(['PTEN'], dtype=object), 'C124N': array(['PTEN'], dtype=object), 'Y220S': array(['TP53'], dtype=object), 'T131I': array(['PTEN'], dtype=object), 'HIP1-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'R140Q': array(['IDH2'], dtype=object), 'T131L': array(['PTEN'], dtype=object), 'E709_T710delinsD': array(['EGFR'], dtype=object), 'T131S': array(['PTEN'], dtype=object), 'C809G': array(['KIT'], dtype=object), 'G1125A': array(['IGF1R'], dtype=object), 'C124S': array(['PTEN'], dtype=object), 'N1228D': array(['BRCA2'], dtype=object), 'T28I': array(['MAP2K1'], dtype=object), 'R162*': array(['PIK3R1'], dtype=object), 'T82A': array(['MLH1'], dtype=object), 'L1026F': array(['JAK2'], dtype=object), 'Q635E': array(['SMO'], dtype=object), 'W515L': array(['MPL'], dtype=object), 'W515K': array(['MPL'], dtype=object), 'I563_L576del': array(['KIT'], dtype=object), 'N56T': array(['BRCA2'], dtype=object), 'G1128S': array(['ALK'], dtype=object), 'I33del': array(['PTEN'], dtype=object), 'G1770V': array(['BRCA1'], dtype=object), 'R139G': array(['RUNX1'], dtype=object), 'IGK-MYC Fusion': array(['MYC'], dtype=object), 'D2665G': array(['BRCA2'], dtype=object), 'V128del': array(['TSC1'], dtype=object), 'R292A': array(['SMAD3'], dtype=object), 'L97R': array(['CDKN2A'], dtype=object), 'G101S': array(['MLH1'], dtype=object), 'L582F': array(['MLH1'], dtype=object), 'G101W': array(['CDKN2A'], dtype=object), 'Y846C': array(['TERT'], dtype=object), 'R683T': array(['JAK2'], dtype=object), 'F1888L': array(['MTOR'], dtype=object), 'L668F': array(['BRCA1'], dtype=object), 'F1888I': array(['MTOR'], dtype=object), 'F241S': array(['PTEN'], dtype=object), 'R683S': array(['JAK2'], dtype=object), 'PAPSS1-BRAF Fusion': array(['BRAF'], dtype=object), 'A1789S': array(['BRCA1'], dtype=object), 'R683G': array(['JAK2'], dtype=object), 'M199del': array(['PTEN'], dtype=object), 'F1888V': array(['MTOR'], dtype=object), 'L112P': array(['PTEN'], dtype=object), 'R683K': array(['JAK2'], dtype=object), 'H1094R': array(['MET'], dtype=object), 'S214C': array(['ARAF'], dtype=object), 'H1094Y': array(['MET'], dtype=object), 'Q61R': array(['RAC1', 'KRAS', 'NRAS'], dtype=object), 'K147E': array(['KRAS'], dtype=object), 'Q61L': array(['HRAS', 'KRAS', 'NRAS'], dtype=object), 'TEL-JAK2 Fusion': array(['JAK2'], dtype=object), 'Q61K': array(['HRAS', 'NRAS'], dtype=object), 'Q61H': array(['HRAS', 'KRAS', 'NRAS'], dtype=object), 'F594_R595insSDNEYFYVDF': array(['FLT3'], dtype=object), 'E106G': array(['NF2'], dtype=object), 'D83V': array(['MEF2B'], dtype=object), 'H1094L': array(['MET'], dtype=object), 'D1010Y': array(['MET'], dtype=object), 'Y364H': array(['FLT3'], dtype=object), 'K2411T': array(['BRCA2'], dtype=object), 'I130M': array(['IDH1'], dtype=object), 'D579del': array(['KIT'], dtype=object), 'D2033N': array(['ROS1'], dtype=object), 'Y572C': array(['FLT3'], dtype=object), 'N1102Y': array(['BRCA2'], dtype=object), 'G503V': array(['PTPN11'], dtype=object), 'T844M': array(['PTPRT'], dtype=object), 'E190K': array(['IDH1'], dtype=object), 'G373R': array(['PIK3R2'], dtype=object), 'G2274V': array(['BRCA2'], dtype=object), 'C91A': array(['BAP1'], dtype=object), 'L1854P': array(['BRCA1'], dtype=object), 'L2865V': array(['BRCA2'], dtype=object), 'R69C': array(['VHL'], dtype=object), 'L2106P': array(['BRCA2'], dtype=object), 'E541K': array(['PMS2'], dtype=object), 'C91S': array(['BAP1'], dtype=object), 'T75M': array(['TMPRSS2'], dtype=object), 'K120M': array(['TP53'], dtype=object), 'Y1703S': array(['BRCA1'], dtype=object), 'L1844R': array(['BRCA1'], dtype=object), 'W2626C': array(['BRCA2'], dtype=object), 'G1128A': array(['ALK'], dtype=object), 'I49S': array(['CDKN2A'], dtype=object), '385_418del': array(['ETV6'], dtype=object), 'D245V': array(['BRCA1'], dtype=object), 'K125L': array(['PTEN'], dtype=object), 'K125M': array(['PTEN'], dtype=object), 'I668V': array(['PMS2'], dtype=object), 'E160*': array(['PIK3R1'], dtype=object), 'S35Q': array(['TSC1'], dtype=object), 'S273C': array(['PIK3R2'], dtype=object), 'F808L': array(['PDGFRA'], dtype=object), 'WDR48-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'K125E': array(['PTEN'], dtype=object), 'S100F': array(['SDHB'], dtype=object), 'E281K': array(['ABL1'], dtype=object), 'CCND1-IGH Fusion': array(['CCND1'], dtype=object), 'G1286R': array(['ALK'], dtype=object), 'P490_Q494del': array(['BRAF'], dtype=object), 'V1075Yfs*2': array(['KDM5C'], dtype=object), 'G831E': array(['FLT3'], dtype=object), 'W308C': array(['STK11'], dtype=object), 'L485_P490del': array(['BRAF'], dtype=object), 'L838V': array(['EGFR'], dtype=object), 'R339W': array(['CTCF'], dtype=object), 'ETV6-PDGFRA Fusion': array(['PDGFRA'], dtype=object), 'E1071W': array(['PDGFRB'], dtype=object), 'D350G': array(['PIK3CA'], dtype=object), 'D387V': array(['CARD11'], dtype=object), 'I290R': array(['CCND3'], dtype=object), 'P704S': array(['TERT'], dtype=object), 'G118D': array(['PIK3CA'], dtype=object), 'N564_Y578del': array(['KIT'], dtype=object), 'Q510E': array(['PTPN11'], dtype=object), 'D770_P772dup': array(['EGFR'], dtype=object), 'A1459P': array(['MTOR'], dtype=object), 'I559_D560insDKRMNS': array(['PIK3R1'], dtype=object), 'V1643A': array(['BRCA2'], dtype=object), 'Promoter Hypermethylation': array(['PTPRT', 'CASP8'], dtype=object), 'E286K': array(['TP53'], dtype=object), 'G596R': array(['BRAF'], dtype=object), 'S502T': array(['PTPN11'], dtype=object), 'G596V': array(['BRAF'], dtype=object), 'H1862L': array(['BRCA1'], dtype=object), 'K189N': array(['AKT1'], dtype=object), 'R2659T': array(['BRCA2'], dtype=object), 'R661W': array(['RB1'], dtype=object), 'S68W': array(['VHL'], dtype=object), 'K79E': array(['NF2'], dtype=object), 'G596C': array(['BRAF'], dtype=object), 'C39R': array(['BRCA1'], dtype=object), 'R2659K': array(['BRCA2'], dtype=object), 'C554W': array(['BRCA2'], dtype=object), 'G1971E': array(['ROS1'], dtype=object), 'Q209L': array(['GNA11', 'GNAQ'], dtype=object), 'C278F': array(['FGFR2'], dtype=object), 'P278A': array(['TP53'], dtype=object), 'R228K': array(['MAP2K4'], dtype=object), 'E120Q': array(['MAP2K1'], dtype=object), 'T1700A': array(['BRCA1'], dtype=object), 'S1722F': array(['BRCA1'], dtype=object), 'R112G': array(['CDKN2A'], dtype=object), 'E1552del': array(['TSC2'], dtype=object), 'S1140G': array(['BRCA1'], dtype=object), 'P278S': array(['TP53'], dtype=object), 'R282W': array(['TP53'], dtype=object), 'S768_V769insVAS': array(['EGFR'], dtype=object), 'K508M': array(['FGFR3'], dtype=object), 'R282Q': array(['TP53'], dtype=object), 'N1819S': array(['BRCA1'], dtype=object), 'N1819Y': array(['BRCA1'], dtype=object), 'Y599_D600insEYEYEYEY': array(['FLT3'], dtype=object), 'W714*': array(['MLH1'], dtype=object), 'W1718L': array(['BRCA1'], dtype=object), 'Q324L': array(['NF2'], dtype=object), 'M1043I': array(['PIK3CA'], dtype=object), 'R571W': array(['PTCH1'], dtype=object), 'V561_I562insER': array(['PDGFRA'], dtype=object), 'PTPRZ1-MET Fusion': array(['MET'], dtype=object), 'E3002K': array(['BRCA2'], dtype=object), 'Exon 13 deletion': array(['NUP93'], dtype=object), 'C620R': array(['RET'], dtype=object), 'A627T': array(['FLT3'], dtype=object), 'R262T': array(['PIK3R1'], dtype=object), 'Y63C': array(['PTPN11'], dtype=object), 'PDE4DIP-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'H133Q': array(['IDH1'], dtype=object), 'V509A': array(['MSH6'], dtype=object), 'I2627F': array(['BRCA2'], dtype=object), 'Y32S': array(['KRAS'], dtype=object), 'E239A': array(['SMAD3'], dtype=object), 'P1311T': array(['PTPRD'], dtype=object), 'R2502H': array(['BRCA2'], dtype=object), 'E1250K': array(['BRCA1'], dtype=object), 'A349P': array(['BRIP1'], dtype=object), 'D513Y': array(['ERCC2'], dtype=object), 'R2502C': array(['BRCA2'], dtype=object), 'W1718C': array(['BRCA1'], dtype=object), 'K5N': array(['KRAS'], dtype=object), 'L239R': array(['DDR2'], dtype=object), 'R380A': array(['GLI1'], dtype=object), 'D816V': array(['KIT'], dtype=object), 'K2950N': array(['BRCA2'], dtype=object), 'G660D': array(['ERBB2'], dtype=object), 'EGFRvIII': array(['EGFR'], dtype=object), 'S214A': array(['ARAF'], dtype=object), 'R1699W': array(['BRCA1'], dtype=object), 'R1446H': array(['CREBBP'], dtype=object), 'R1699Q': array(['BRCA1'], dtype=object), 'K459_S460delinsN': array(['PIK3R1'], dtype=object), 'Y555C': array(['PDGFRA'], dtype=object), 'D816Y': array(['KIT'], dtype=object), 'D816F': array(['KIT'], dtype=object), 'D816G': array(['KIT'], dtype=object), 'D816E': array(['KIT'], dtype=object), 'MKRN1-BRAF Fusion': array(['BRAF'], dtype=object), 'H1686R': array(['BRCA1'], dtype=object), 'H1686Q': array(['BRCA1'], dtype=object), 'D816A': array(['KIT'], dtype=object), 'D816N': array(['KIT'], dtype=object), 'S214T': array(['ARAF'], dtype=object), 'D816H': array(['KIT'], dtype=object), 'L1122V': array(['ALK'], dtype=object), 'R20Q': array(['PMS2'], dtype=object), 'Y149D': array(['PRDM1'], dtype=object), 'TFG-NTRK1 Fusion': array(['NTRK1'], dtype=object), 'SPTBN1-PDGFRB Fusion': array(['PDGFRB'], dtype=object), 'Y1045W': array(['JAK2'], dtype=object), 'L585I': array(['PMS2'], dtype=object), 'Y130C': array(['MAP2K1'], dtype=object), 'S765P': array(['RET'], dtype=object), 'E1322*': array(['MSH6'], dtype=object), 'K618T': array(['MLH1'], dtype=object), 'Y112H': array(['VHL'], dtype=object), 'A59T': array(['HRAS'], dtype=object), 'BTBD1-NTRK3 Fusion': array(['NTRK3'], dtype=object), 'V197E': array(['TP53'], dtype=object), 'W603_E604insDREYEYDLKW': array(['FLT3'], dtype=object), 'V197L': array(['TP53'], dtype=object), 'P573_D579del': array(['KIT'], dtype=object), 'R1598P': array(['NOTCH1'], dtype=object), 'Y1045*': array(['JAK2'], dtype=object), 'A59G': array(['KRAS'], dtype=object), 'R497H': array(['SMAD4'], dtype=object), 'R515G': array(['SMAD4'], dtype=object), 'Y489C': array(['NF1'], dtype=object), 'V995M': array(['PTPRT'], dtype=object), 'Y931C': array(['JAK2'], dtype=object), 'K398A': array(['SOX9'], dtype=object), 'R2973C': array(['BRCA2'], dtype=object), 'R306S': array(['ERBB4'], dtype=object), 'P173S': array(['RUNX1'], dtype=object), 'K607T': array(['BRCA2'], dtype=object), 'P29S': array(['RAC1'], dtype=object), 'R109*': array(['ERCC3'], dtype=object), 'V348L': array(['ERBB4'], dtype=object), 'I15T': array(['BRCA1'], dtype=object), 'S723F': array(['MSH2'], dtype=object), 'P29L': array(['RAC1'], dtype=object), 'P152L': array(['TP53'], dtype=object), 'V710A': array(['NTRK1'], dtype=object), 'F537_K539delinsL': array(['JAK2'], dtype=object), 'N659K': array(['PDGFRA'], dtype=object), 'A1752V': array(['BRCA1'], dtype=object), 'A2351G': array(['BRCA2'], dtype=object), 'E82D': array(['NFE2L2'], dtype=object), 'E82G': array(['NFE2L2'], dtype=object), 'L210R': array(['FANCA'], dtype=object), 'P531A': array(['EPAS1'], dtype=object), 'L232LI': array(['CARD11'], dtype=object), 'W290_I291delinsC': array(['FGFR2'], dtype=object), 'R748G': array(['PDGFRA'], dtype=object), 'E82V': array(['NFE2L2'], dtype=object), 'N659S': array(['PDGFRA'], dtype=object), 'P531S': array(['EPAS1'], dtype=object), 'A546D': array(['ESR1'], dtype=object), 'R71G': array(['BRCA1'], dtype=object), 'L1947R': array(['ROS1'], dtype=object), 'Q1811R': array(['BRCA1'], dtype=object), 'N822Y': array(['KIT'], dtype=object), 'T74P': array(['KRAS'], dtype=object), 'V765A': array(['EGFR'], dtype=object), 'P1614S': array(['BRCA1'], dtype=object), 'T574insTQLPYD': array(['KIT'], dtype=object), 'N822I': array(['KIT'], dtype=object), 'N822H': array(['KIT'], dtype=object), 'N234I': array(['MAP2K4'], dtype=object), 'L485F': array(['BRAF'], dtype=object), 'CCDC6-ROS1 Fusion': array(['ROS1'], dtype=object), 'H41R': array(['BRCA1'], dtype=object), 'R348*': array(['PIK3R1'], dtype=object), 'D408Y': array(['SMAD3'], dtype=object), 'Epigenetic Silencing': array(['CDH1'], dtype=object), 'C1697R': array(['BRCA1'], dtype=object), 'K745M': array(['EGFR'], dtype=object), 'D408E': array(['SMAD3'], dtype=object), 'TRB-NKX2-1 Fusion': array(['NKX2-1'], dtype=object), 'D1091N': array(['ALK'], dtype=object), 'R2418G': array(['BRCA2'], dtype=object), 'D408H': array(['SMAD3'], dtype=object), 'Q579_L581del': array(['PIK3R1'], dtype=object), 'I151S': array(['VHL'], dtype=object), 'C125S': array(['MAP2K2'], dtype=object), 'R428A': array(['SMAD2'], dtype=object), 'E35*': array(['CDKN2B'], dtype=object), 'V1810G': array(['BRCA1'], dtype=object), 'A883F': array(['RET'], dtype=object), 'E846K': array(['SOS1'], dtype=object), 'T2722R': array(['BRCA2'], dtype=object), 'L362R': array(['AKT1'], dtype=object), 'S362L': array(['PTEN'], dtype=object), 'K179M': array(['AKT1'], dtype=object), 'H876Q': array(['TERT'], dtype=object), 'E127G': array(['CARD11'], dtype=object), 'S428F': array(['CHEK2'], dtype=object), 'D326N': array(['PTEN'], dtype=object), 'D1010N': array(['MET'], dtype=object), 'Q22R': array(['KRAS'], dtype=object), 'Y40A': array(['KRAS'], dtype=object), 'Wildtype': array(['KRAS'], dtype=object), 'D404G': array(['TGFBR2'], dtype=object), 'P286R': array(['POLE'], dtype=object), 'V1534M': array(['BRCA1'], dtype=object), 'N1333Gfs*': array(['PBRM1'], dtype=object), 'D1029Y': array(['PIK3CA'], dtype=object), 'K650E': array(['FGFR3'], dtype=object), 'I279P': array(['ELF3'], dtype=object), 'S1101N': array(['BRCA1'], dtype=object), 'N517S': array(['BRCA2'], dtype=object), 'G309A': array(['ERBB2'], dtype=object), 'V157D': array(['TP53'], dtype=object), 'F133V': array(['SPOP'], dtype=object), 'V157F': array(['TP53'], dtype=object), 'E219K': array(['FGFR2'], dtype=object), 'T150I': array(['TP53'], dtype=object), 'R462C': array(['TSC2'], dtype=object), 'I168F': array(['PTEN'], dtype=object), 'R462E': array(['BRAF'], dtype=object), 'L122R': array(['MYOD1'], dtype=object), 'R462I': array(['BRAF'], dtype=object), 'L1764P': array(['BRCA1'], dtype=object), 'D1384V': array(['TET2'], dtype=object), 'Q60K': array(['PIK3CA'], dtype=object), 'R164Q': array(['KRAS'], dtype=object), 'S249C': array(['FGFR3'], dtype=object), 'G309E': array(['ERBB2'], dtype=object), 'EWSR1-NR4A3 Fusion': array(['EWSR1'], dtype=object)}

No. of Unique Genes Per Variations

In [43]:
unique_variation_gene_count={}
for key, value in sorted(unique_variation_gene.items()):
         unique_variation_gene_count[key]= len(value)
print unique_variation_gene_count
{'Y62C': 1, 'D399N': 1, 'V561D': 1, 'Q531*': 1, 'E1586G': 1, 'SRGAP3-RAF1 Fusion': 1, 'H93Q': 1, 'H93R': 1, 'V561A': 1, 'V564I': 1, 'H93D': 1, 'L1195V': 1, 'K4E': 1, 'Q227R': 1, 'E203K': 1, 'E2856A': 1, 'P380R': 1, 'R2505*': 1, 'S786F': 1, 'Y1463S': 1, 'N581Y': 1, 'D537Y': 1, 'S32I': 1, 'N581S': 1, 'MAGI3-AKT3 Fusion': 1, 'A502_Y503dup': 1, 'H697Y': 1, 'Y53H': 1, 'L2396F': 1, 'R2450*': 1, 'D3170G': 1, 'N581D': 1, 'G464A': 1, 'D289del': 1, 'V1188L': 1, 'W557_K558del': 1, 'L191H': 1, 'H118P': 1, 'EP300-MOZ Fusion': 1, 'D603N': 1, 'G480W': 1, 'D289_D292del': 1, 'L1122V': 1, 'D351H': 1, 'D603G': 1, 'F460L': 1, 'G17V': 1, 'S860L': 1, 'P86H': 1, 'N345I': 1, 'G1738R': 1, 'C450_K451insMIEWMI': 1, 'V411L': 1, 'T1324N': 1, 'E2663V': 1, 'V555_V559del': 1, 'G1738E': 1, 'GPIAP1-PDGFRB Fusion': 1, 'E598_Y599insGLVQVTGSSDNEYFYVDFREYE': 1, 'P124Q': 1, 'G165V': 1, 'P124S': 1, 'G829R': 1, 'G165R': 1, 'R496C': 1, 'D835del': 1, 'P95S': 1, 'K558_E562del': 1, 'R496H': 1, 'R383*': 1, 'G165E': 1, 'R156H': 1, 'Y1230C': 1, 'P124L': 1, 'P153H': 1, 'D808N': 1, 'G1743R': 1, 'V118D': 1, 'TRA-NKX2-1 Fusion': 1, 'P106L': 1, 'S562L': 1, 'M504V': 1, 'L387M': 1, 'M1008I': 1, 'C630Y': 1, 'Y801H': 1, 'E719G': 1, 'E719K': 1, 'C630R': 1, 'K509I': 1, 'I391M': 1, 'Q227L': 1, 'A750_E758delinsP': 1, 'T599_V600insV': 1, 'E746G': 1, 'D837N': 1, 'P179L': 1, 'RANBP2-ALK Fusion': 1, 'E746Q': 1, 'E746V': 1, 'R487Q': 1, 'P179R': 1, 'RET-CCDC6 Fusion': 1, 'K59del': 1, 'Q145H': 1, 'R18H': 1, 'E1935G': 1, 'G423V': 1, 'K11R': 1, 'L559R': 1, 'G423R': 1, 'T790M': 1, 'A1685S': 1, 'V1804D': 1, 'M224R': 1, '560_561insER': 1, 'C384R': 1, 'E69K': 1, 'V321M': 2, 'G35R': 2, 'K666M': 1, 'K666N': 1, 'G35V': 1, 'R108H': 1, 'S215C': 1, 'V14I': 1, 'C44F': 1, 'S215G': 1, 'F876L': 1, 'G35A': 1, 'C44Y': 1, 'N375S': 1, 'E1735K': 1, 'Y599_D600insGLYVDFREYEY': 1, 'K666R': 1, 'Y578C': 1, 'K666T': 1, 'M1328I': 1, 'D1739G': 1, 'D1739E': 1, 'A2643G': 1, 'W1837C': 1, 'G75R': 1, 'D1739V': 1, 'D419del': 1, 'P133T': 1, 'I99M': 1, 'L584F': 1, 'Y155C': 1, 'D1739Y': 1, 'R2318Q': 1, 'L424V': 1, 'P278R': 1, 'T77M': 1, 'S387N': 1, 'N510K': 1, 'E144K': 1, 'T77P': 1, 'E60L': 1, 'L424I': 1, 'L158V': 1, 'L1678P': 1, 'G271E': 1, 'L158Q': 1, 'L158P': 1, 'R2602T': 1, 'R698W': 1, 'E632_L633del': 1, 'S904F': 1, 'C135Y': 1, 'C135S': 1, 'C135R': 1, 'Y849C': 1, 'D1349H': 1, 'V1713A': 1, 'S1486C': 1, 'T529I': 1, 'AGK-BRAF Fusion': 1, 'T529N': 1, 'T529M': 1, 'I90T': 1, 'S1036P': 1, 'F71I': 1, 'S371C': 1, 'G505S': 1, 'F311L': 1, 'S752_I759del': 1, 'K700R': 1, 'K700E': 1, 'T17A': 1, 'F1200I': 1, 'F28L': 2, 'S33A': 1, 'F119S': 1, 'R658Q': 1, 'V2908G': 1, 'Y1003*': 1, 'N870S': 1, 'Y1295A': 1, 'L78T': 1, 'Y513A': 1, 'S33Y': 1, 'S102F': 1, 'L388M': 1, 'ETV6-FLT3 Fusion': 1, 'G328E': 1, 'G328V': 1, 'A2717S': 1, 'T160I': 1, 'Overexpression': 6, 'H2074N': 1, 'N561D': 1, 'G1656D': 1, 'S427G': 1, 'N116H': 1, 'W80A': 1, 'Y1003C': 1, 'P287A': 1, 'ESRP1-RAF1 Fusion': 1, 'Y1003F': 1, 'N841I': 1, 'I642V': 1, 'V248D': 1, 'P287S': 1, 'T605M': 1, 'P287T': 1, 'TRKAIII Splice Variant': 1, 'Y98N': 1, 'L2427R': 1, 'R882L': 1, 'L1584R': 1, 'R882C': 1, 'Y371H': 1, 'R177Pfs*126': 1, 'L861P': 1, 'L861Q': 1, 'L861R': 1, 'G697C': 1, '422_605trunc': 1, 'D84V': 1, 'R251Q': 1, 'H412Y': 1, 'T1354M': 1, 'F384L': 2, 'C61G': 1, 'I1018W': 1, 'K513R': 1, 'F384Y': 1, 'F359C': 1, 'D121G': 1, 'D520N': 1, 'I1018F': 1, 'F384V': 1, 'G60E': 1, 'Q12Y': 1, 'D493A': 1, 'R373Q': 1, 'R470C': 1, 'V774M': 1, 'L910P': 1, 'M1_E165DEL': 1, 'L43V': 1, 'R373H': 1, 'A161T': 1, 'A161S': 1, 'Q331R': 1, 'D631G': 1, 'S217C': 1, 'Q331H': 1, 'D631A': 1, 'I1680N': 1, 'P387S': 1, 'S24F': 1, 'M160V': 1, 'Y1230H': 1, 'R1589P': 1, 'DNA binding domain missense mutations': 1, 'D814V': 1, 'PRKG2-PDGFRB Fusion': 1, 'M1400V': 1, 'R167Q': 1, 'K1299E': 1, 'L783F': 1, 'P1806A': 1, 'E1384K': 1, 'R1589H': 1, 'KIF5B-PDGFRA Fusion': 1, 'Y1278S': 1, 'NPM-ALK Fusion': 1, 'L202F': 1, 'A889P': 1, 'G128V': 1, 'E475K': 1, 'G81S': 1, 'A1519T': 1, 'R217C': 1, 'P2412T': 1, 'Y426A': 1, 'G1788D': 1, 'G67S': 1, 'E275K': 1, 'Q1785H': 1, 'G1035S': 1, 'M784V': 1, 'G1788V': 1, 'N334K': 1, 'M1250T': 1, 'R537P': 1, 'V852I': 1, 'H65Y': 1, 'G17E': 1, 'BIN2-PDGFRB Fusion': 1, 'P47S': 1, 'TMPRSS2-ETV5 Fusion': 1, 'D323H': 1, 'P47A': 1, 'V344A': 1, 'K1702E': 1, 'PVT1-MYC Fusion': 1, 'R132G': 1, 'R132C': 1, 'S1039F': 1, 'T599R': 1, 'S153R': 1, 'L234fs': 1, 'R132H': 1, 'FIG-ROS1 Fusion': 1, 'R678Q': 1, 'R132Q': 1, 'G776S': 1, 'S1841N': 1, 'N126D': 1, 'D101Y': 1, 'E866K': 1, 'D587H': 1, 'T417I': 1, 'V677I': 1, 'R15K': 1, 'R15S': 1, 'G370C': 1, 'V344G': 1, 'Y537C': 1, 'Y253F': 1, 'P34R': 2, 'P25L': 1, 'Y537S': 1, 'E75G': 1, 'R121Q': 1, 'R108K': 1, 'L30F': 1, 'V155F': 1, 'N71S': 1, 'E1978*': 1, 'P780L': 1, 'SLC34A2-ROS1 Fusion': 1, 'Y551F': 1, 'T785A': 1, 'G776delinsLC': 1, 'R2888C': 1, 'F893L': 1, 'N71K': 1, 'MYC-nick': 1, 'E868G': 1, 'T1365M': 1, 'V769E': 1, 'H870R': 1, 'R987W': 1, 'V769M': 1, 'Q579R': 1, 'PCM1-JAK2 Fusion': 1, 'S768_D770dup': 1, 'V1808A': 1, 'R711*': 1, 'P284L': 1, 'A1374V': 1, 'I867S': 1, 'A1234T': 1, 'Y42C': 2, 'S217T': 1, 'Y412F': 1, 'Q2416*': 1, 'C248T': 1, 'X1009_splice': 1, 'V716M': 1, 'V1809F': 1, 'D842Y': 1, 'V1809A': 1, 'I122L': 1, 'D842V': 1, 'T878S': 1, 'I122V': 1, 'D842I': 1, 'Y599F': 1, 'I122S': 1, 'E598_Y599insDVDFREYE': 1, 'L1951R': 1, 'L1407P': 1, 'Q58_E62del': 1, 'V774A': 1, 'C706F': 1, 'Y640F': 1, 'R970C': 1, 'M374V': 1, 'F134Y': 1, 'Exon 1 mutations': 1, 'S121A': 1, 'P34L': 1, 'R2304C': 1, 'C134W': 1, 'G469del': 1, 'P219S': 1, 'S121C': 1, 'R321Q': 1, 'AKAP9-BRAF Fusion': 1, 'Y35C': 1, 'K111E': 1, 'Y35H': 1, 'S464L': 1, 'K111N': 1, 'V384D': 1, 'V506A': 1, 'R369Q': 1, 'R369W': 1, 'E258V': 1, 'T599dup': 1, 'S1651P': 1, 'N810Y': 1, 'A648T': 1, 'V1688del': 1, 'Q252H': 1, 'S65L': 1, 'F1695L': 1, 'S65N': 1, 'S784F': 1, 'C569Y': 1, 'G1125A': 1, 'F1592S': 1, 'G305W': 1, 'S65A': 1, 'G305R': 1, 'G250E': 1, 'R310H': 1, 'S276L': 1, 'A111P': 1, 'S65W': 1, 'P326L': 1, 'Y1294A': 1, 'T1025S': 1, 'A159T': 1, 'K120E': 1, 'D84Y': 1, 'K409Q': 1, 'K120N': 1, 'E622D': 1, 'S33F': 1, 'D84N': 1, 'D84H': 1, 'EWSR1-WT1 Fusion': 1, 'E580*': 1, 'D384N': 1, 'D84G': 1, 'E622Q': 1, 'X475_splice': 1, 'N319D': 1, 'V414A': 1, 'G774V': 1, 'R276W': 1, 'P142H': 1, 'L1240V': 1, '2010_2471trunc': 1, 'R2418G': 1, 'N82K': 1, 'N319T': 1, 'R49C': 1, 'R1446H': 1, 'Truncating Mutations Upstream of Transactivation Domain': 1, 'R1446C': 1, 'S249C': 1, 'F347L': 1, 'D32Y': 1, 'R1189*': 1, 'P40T': 1, 'L345Q': 1, 'E884K': 1, 'A41P': 1, 'F568fs': 1, 'KDR-PDGFRA Fusion': 1, 'A41T': 1, 'N826Y': 1, 'E731K': 1, 'P151S': 1, 'C696Y': 1, 'P151H': 1, 'G163D': 1, 'BCAN-NTRK1 Fusion': 1, 'P151A': 1, 'A197T': 1, 'IGH-NKX2 Fusion': 1, 'A728V': 1, 'K128T': 1, 'YAP1-FAM118B Fusion': 1, 'L747_P753del': 1, 'K806A': 1, 'Q22E': 1, 'Q1396R': 1, 'Q22K': 2, 'D770_N771insNPG': 1, 'MSN-ROS1 Fusion': 1, 'S80R': 1, 'V559_V560del': 1, 'F400I': 1, 'L321A': 1, 'T244_I245insCPT': 1, 'T37A': 1, 'D277H': 1, 'H68R': 1, 'V289F': 1, 'P1812R': 1, 'P1812S': 1, 'BCR-ABL1 Fusion': 1, 'V1671I': 1, 'H68Y': 1, 'N239S': 1, 'E76K': 1, 'N1068fs*4': 1, 'T37R': 1, 'E76A': 1, 'Exon 11 mutations': 1, 'R905G': 1, 'G42R': 1, 'V1378I': 1, 'I448V': 1, 'R905Q': 1, 'R905W': 1, 'Y253H': 1, 'S1101N': 1, 'R1076C': 1, 'V369G': 1, 'M1255I': 1, 'Exon 19 deletion': 1, 'R166Q': 1, 'K28M': 1, 'G60D': 1, 'I21V': 1, 'N659S': 1, 'R552G': 1, 'S217A': 1, 'L747V': 1, 'N217I': 1, 'V242F': 1, 'G60R': 1, 'I2675V': 1, 'S170R': 1, 'ERLIN2?FGFR1 Fusion': 1, 'S703I': 1, 'S760A': 1, 'G129E': 1, 'G129A': 1, 'CPEB1-NTRK3 Fusion': 1, 'E685V': 1, 'R23A': 1, 'S170N': 1, 'Q367P': 1, 'G129R': 1, 'T286A': 2, 'E41A': 1, 'S1424C': 1, 'S226D': 1, 'T286I': 1, 'Y598C': 1, 'SND1-BRAF Fusion': 1, 'T401I': 1, 'V536M': 1, 'V191I': 1, 'PAX5-JAK2 Fusion': 1, 'D32H': 1, 'R465H': 1, 'R134Q': 1, 'D32N': 1, 'D32A': 1, 'R133*': 1, 'R812A': 1, 'F615S': 1, 'H650Q': 1, 'R420H': 1, 'C157Y': 1, 'Y553_Q556del': 1, 'P42T': 1, 'R315*': 1, 'R453C': 1, 'S3660L': 1, 'H773_V774insH': 1, 'D1270G': 1, 'W345*': 1, 'R133H': 1, 'M918V': 1, 'S45del': 1, 'G796S': 1, 'W183A': 1, 'G264S': 1, 'R583A': 1, 'W24C': 1, 'I836del': 1, 'M351T': 1, 'C443Y': 1, 'Y375C': 1, 'E78K': 1, 'D92A': 1, 'D92G': 1, 'D92E': 1, 'T1720A': 1, 'A272V': 1, 'D92H': 1, 'D92N': 1, 'D92V': 1, 'S31R': 1, 'R625L': 1, 'M535I': 1, 'R625H': 1, 'R2520Q': 1, 'R625D': 1, 'R1625C': 1, 'R625G': 1, 'E525K': 1, 'R625C': 1, 'Q144R': 1, 'G93W': 1, 'F594L': 1, 'Q1756fs': 1, 'I653T': 1, 'T123A': 1, 'V1833M': 1, 'G199R': 1, 'V1833E': 1, 'E1644G': 1, 'N1647K': 1, 'FGFR2-TACC3 Fusion': 1, 'FGFR1-TACC1 Fusion': 1, 'S425C': 1, 'H643D': 1, 'TPM3-NTRK1 Fusion': 1, 'R1204W': 1, 'BCOR-CCNB3 Fusion': 1, 'R689Q': 1, 'G311D': 1, 'R1204G': 1, 'Q72L': 1, 'A1708V': 1, 'E563K': 1, 'R337L': 1, 'L384M': 1, 'A634V': 1, 'P1771R': 1, 'D194Y': 1, 'K45Q': 1, 'S256W': 1, 'K45T': 1, 'R337C': 1, 'A634D': 1, 'V197E': 1, 'T654I': 1, 'K45N': 1, 'T654M': 1, 'H410R': 1, 'R337P': 1, 'D641G': 1, 'K830R': 1, 'R174C': 1, 'S1733F': 1, 'E317K': 1, 'R552S': 1, 'V1605del': 1, 'E69G': 1, 'D254N': 1, 'L193F': 1, 'V299L': 1, 'G333S': 1, 'C229R': 1, 'G735S': 1, 'D1071N': 1, 'Y16C': 1, 'R201C': 1, 'K558delinsNP': 1, 'L692F': 1, 'S241L': 1, 'G778_P780dup': 1, 'L1196M': 1, 'S241F': 1, 'Y384A': 1, 'H297N': 1, 'Y298A': 1, 'P250L': 1, 'D646Y': 1, 'S241Y': 1, 'S46N': 1, 'L1196Q': 1, 'S241T': 1, 'R1231Q': 1, 'G251C': 1, 'Q689R': 1, 'S326R': 1, 'Y105C': 1, 'M35R': 1, 'C41Y': 1, 'K603Q': 1, 'R164Q': 1, 'RUNX1-RUNX1T1 Fusion': 1, 'LIMA1-ROS1 Fusion': 1, 'V1306I': 1, 'Q984K': 1, 'L248V': 1, 'R1726G': 1, 'L607I': 1, 'E355A': 1, 'E17K': 3, 'KIAA1549-BRAF Fusion': 1, 'E322K': 1, 'L507P': 1, 'H570R': 1, 'E709V': 1, 'L1198F': 1, 'E709Q': 1, 'G12V': 4, 'F346V': 1, 'P1859R': 1, 'G12R': 1, 'G12S': 2, 'E709G': 1, 'L1198P': 1, 'E709A': 1, 'G12F': 1, 'G12D': 2, 'G12C': 2, 'S186Y': 1, 'KIF5B-RET Fusion': 1, 'L344P': 1, 'L344R': 1, 'C611Y': 1, 'V1565I': 1, 'D130A': 1, 'E746_T751delinsVA': 1, 'N276S': 1, 'L983F': 1, 'A40E': 1, 'D422N': 1, 'E31K': 1, 'R130A': 1, 'Amplification': 71, 'R130G': 1, 'V1736G': 1, 'R130K': 1, 'G508S': 1, 'R130L': 1, 'R802*': 1, 'X963_splice': 1, 'E804G': 1, 'S376F': 1, 'E79Q': 1, 'G207E': 1, 'EWSR1-ETV4 Fusion': 1, 'R479Q': 1, 'R1835P': 1, 'R154W': 1, 'D1280V': 1, 'F241S': 1, 'K656E': 1, 'R479H': 1, 'S453fs*': 1, 'R479L': 1, 'Y234H': 1, 'E79K': 1, 'T41A': 1, 'Q201H': 1, 'V430M': 1, 'G70D': 1, 'Q58L': 1, 'V555_L576del': 1, 'Q2858R': 1, 'N238S': 1, 'F367S': 1, 'MPRIP-NTRK1 Fusion': 1, 'R2842H': 1, 'E77K': 1, 'S729C': 1, 'P1749R': 1, 'S858R': 1, 'I1307K': 1, 'C1156F': 1, 'N1236K': 1, 'R130*': 1, 'S1002R': 1, 'C121S': 1, 'H1966Y': 1, 'G591V': 1, 'L32P': 1, 'N535K': 1, 'K320E': 1, 'G637W': 1, 'R283Q': 1, 'C1156Y': 1, 'RUNX1-EVI1 Fusion': 1, 'A113_splice': 1, 'H214N': 1, 'R640G': 1, 'S1140G': 1, 'V1576E': 1, 'K373E': 1, 'P306H': 1, 'H214R': 1, 'H214Q': 1, 'Y69H': 1, 'G245D': 1, 'G245A': 1, 'TFG-ROS1 Fusion': 1, 'I195T': 1, 'Q477E': 1, 'G245S': 1, 'N372H': 1, 'E1346K': 1, 'R631C': 1, 'L117P': 1, 'I251S': 1, 'E40N': 1, 'E40L': 1, 'E40K': 1, 'T771R': 1, 'E40W': 1, '963_D1010splice': 1, 'K291E': 1, 'E40Q': 1, 'K342N': 1, 'P491S': 1, 'V294M': 1, '1_2009trunc': 1, 'C136R': 1, 'I219V': 1, 'C136Y': 1, 'R1095H': 1, 'N857S': 1, 'I1616N': 1, 'I1616T': 1, 'T992I': 1, 'N653H': 1, 'Q59E': 1, 'C228T': 1, 'G1809R': 1, 'L46R': 1, 'D257N': 1, 'D300H': 1, 'R183P': 1, 'K117R': 1, 'D300N': 1, 'I26N': 1, 'L46F': 1, 'E586K': 1, 'T241P': 1, 'K117N': 2, 'G1809K': 1, 'E1051K': 1, 'R183G': 1, 'S45P': 1, 'P253R': 1, 'S1653F': 1, 'S45Y': 1, 'S10N': 1, 'S45A': 1, 'S45F': 1, 'L702H': 1, 'E606G': 1, 'S240R': 1, 'S1653P': 1, 'G1123S': 1, 'E1794D': 1, 'M774_A775insAYVM': 1, 'L601_K602insREYEYDL': 1, 'D331G': 1, 'G1123D': 1, 'E579K': 1, 'V794M': 1, 'F212Y': 1, 'H114Y': 1, '550_592del': 1, 'T338I': 1, 'E221Q': 1, 'I68K': 1, 'F522C': 1, 'D86N': 1, 'R324L': 1, 'K1062M': 1, 'V726M': 1, 'G469V': 1, 'I326V': 1, 'S2215F': 1, '256_286trunc': 1, 'G13R': 1, 'Q1503P': 1, 'G13V': 2, 'G469E': 1, 'R265C': 1, 'I204T': 1, 'G469A': 1, 'S2215Y': 1, 'T710A': 1, 'A504_Y505ins': 1, 'CD74-NTRK1 Fusion': 1, 'G13D': 2, 'E330K': 2, 'K558N': 1, 'V1838E': 1, 'I1807S': 1, 'E172K': 1, 'D820A': 1, 'D820G': 1, 'E330G': 1, 'D820E': 1, 'M133T': 1, 'F1088Sfs*2': 1, 'E452K': 1, 'D423N': 1, 'K507A': 1, 'F1061W': 1, 'A211D': 1, 'C634Y': 1, 'R1276Q': 1, 'C634W': 1, 'C634S': 1, 'C634R': 1, 'A532H': 1, 'F1174L': 1, 'F1174I': 1, 'E483*': 1, 'W802*': 1, 'H1421Y': 1, 'V773A': 1, 'G881D': 1, 'I767M': 1, 'S1670A': 1, 'S1088F': 1, 'S257W': 1, 'M117I': 1, 'T196A': 1, 'F1524V': 1, 'E14*': 1, 'ATG7-RAF1 Fusion': 1, 'S1172L': 1, 'Y849S': 1, 'V1673F': 1, 'D153V': 1, 'V559G': 1, 'C64G': 1, 'R337H': 1, 'D806H': 1, 'G810S': 1, 'X434_splice': 1, 'R38H': 1, 'A1789T': 1, 'L493V': 1, 'A34D': 1, 'A1789S': 1, 'R1896M': 1, 'A750P': 1, 'G116S': 1, 'R342W': 1, 'N848K': 1, 'L230P': 1, 'W1502A': 1, 'K935I': 1, 'V555M': 1, 'D108H': 1, 'G118D': 1, 'ACPP-PIK3CB Fusion': 1, 'C39S': 1, 'C39R': 1, 'W117C': 1, 'R776C': 1, 'C39Y': 1, 'V1653M': 1, 'R5Q': 1, 'Y113*': 1, 'G39E': 1, 'R481G': 1, 'G2430A': 1, 'Fusions': 34, 'D140G': 1, 'ROS1-CD74 Fusion': 1, 'T670I': 1, 'S2670L': 1, 'P33S': 1, 'A1131T': 1, 'A1020V': 1, 'G382D': 1, 'A1752P': 1, 'EGFRvII': 1, 'D835A': 1, 'ETV6-PDGFRB Fusion': 1, 'T1087I': 1, 'D1352Y': 1, 'P1139S': 1, 'EGFRvIV': 1, 'W557G': 1, 'K292I': 1, 'Wildtype': 1, 'P1771L': 1, 'R158H': 1, 'D845A': 1, 'F133V': 1, 'V777M': 1, 'V777A': 1, 'M1663K': 1, 'V277D': 1, 'S256F': 1, 'G1202R': 1, 'G623R': 1, 'S462Y': 1, 'L57del': 1, 'F133L': 1, 'G430C': 1, 'V391I': 1, 'K648N': 1, 'F1088Lfs*5': 1, 'D600_L601insFREYEYD': 1, 'D162G': 1, 'W1038C': 1, 'D162H': 1, 'I89T': 1, 'G266R': 1, 'G284R': 1, 'A2770T': 1, 'Y599_D600insPAPQIMSTSTLISENMNIA': 1, 'M237K': 1, 'I89N': 1, 'G266E': 1, 'D67N': 1, 'T131A': 1, 'M1K': 1, 'S362L': 1, 'R1758G': 1, 'G701S': 1, 'S1297del': 1, 'CAD-ALK Fusion': 1, 'E1021K': 1, 'M1R': 2, 'M1T': 1, 'D67Y': 1, 'M1V': 1, 'S783P': 1, 'G253C': 1, 'GOLGA4-PDGFRB Fusion': 1, 'I843del': 1, 'I47F': 1, 'R249M': 1, 'T485K': 1, 'R732Q': 1, 'K62R': 1, 'E627D': 1, 'F1734S': 1, 'Truncating Mutations in the PEST Domain': 1, 'F31I': 1, 'M1783T': 1, 'E501G': 1, 'Q120E': 1, 'YAP1-MAMLD1 Fusion': 1, 'T875N': 1, 'M1652T': 1, 'E501K': 1, 'A134D': 1, 'P305L': 1, 'M1652K': 1, 'M1783L': 1, 'T58I': 3, 'M1783I': 1, 'P449T': 1, 'G244D': 1, 'N542_E543del': 1, 'DNA binding domain deletions': 1, 'F102C': 1, 'G244R': 1, 'G244S': 1, 'R725C': 1, 'E565G': 1, 'H1620R': 1, 'A290T': 1, 'M1775K': 1, 'A347T': 1, 'A636P': 1, 'H845Y': 1, 'R2430M': 1, 'E921K': 1, 'G724S': 1, 'S70fsX93': 1, 'K526E': 1, 'S142I': 1, 'T1203K': 1, 'L830R': 1, 'R331P': 1, 'K101M': 1, 'S476I': 1, 'A8S': 1, 'C628Y': 1, 'CUX1-FGFR1 Fusion': 1, 'L747S': 1, 'W535L': 1, 'S250P': 1, 'L747F': 1, 'R506_K507insVLR': 1, 'H538Q': 1, 'P214L': 1, 'A1708E': 1, 'S59R': 1, 'G1567D': 1, 'A500T': 1, 'A205T': 1, 'G2748D': 1, 'H115N': 1, 'M1775V': 1, 'R182W': 1, 'L749P': 1, 'L1204F': 1, 'H115R': 1, 'L67P': 1, 'D1692N': 1, 'N1044K': 1, 'EWSR1-FLI1 Fusion': 1, 'T24A': 1, 'SSBP2-JAK2 Fusion': 1, 'E554_V559del': 1, 'S2483N': 1, 'K601N': 1, 'M887I': 1, 'SPAG9-JAK2 Fusion': 1, 'K601E': 1, 'M2327I': 1, 'Y553N': 1, 'EGFRvV': 1, 'T389K': 1, 'K601Q': 1, 'L1600P': 1, 'Y3092C': 1, 'A767_V769del': 1, 'M37K': 1, 'P114L': 1, 'P114S': 1, 'V379I': 1, 'L996F': 1, 'BCR-JAK2 Fusion': 1, 'S869L': 1, 'P848L': 1, 'K499E': 1, 'L1460P': 1, 'H193P': 1, 'Y220C': 1, 'V658A': 1, 'A389T': 1, 'H193N': 1, 'E255K': 1, 'R213Q': 1, 'K38N': 1, 'S1651F': 1, 'E279K': 1, 'Q233*': 1, 'STRN-ALK Fusion': 1, 'E1099K': 1, 'H1402Y': 1, 'G1079D': 1, 'E453A': 1, 'D821N': 1, 'C176F': 1, 'N1125I': 1, 'R879C': 1, 'R1464*': 1, 'L703P': 1, 'T488_P492del': 1, 'P577_D579del': 1, 'E88K': 1, 'N1228D': 1, 'M713I': 1, 'CD74-ROS1 Fusion': 1, 'N2113S': 1, 'R11K': 1, 'EWSR1-DDIT3 Fusion': 1, 'SEC16A1-NOTCH1 Fusion': 1, 'Y236S': 1, 'H36P': 1, 'EWSR1-ETV1 Fusion': 2, 'L550P': 1, 'L399V': 1, 'Y236D': 1, 'Y236C': 1, 'S1512I': 1, 'P94A': 1, 'FAM131B-BRAF Fusion': 1, 'F1245C': 1, 'R1751P': 1, 'R1751Q': 1, 'R273G': 1, 'E836K': 1, 'K1026E': 1, 'H1904R': 1, 'EWSR1-YY1 Fusion': 1, 'W509R': 1, 'E717K': 1, 'F1245V': 1, 'DNMT3B7': 1, 'C1483F': 1, 'C18Y': 1, 'C1483R': 1, 'L790F': 1, 'C1483W': 1, 'S46I': 1, 'C1483Y': 1, 'E996K': 1, 'C582F': 1, 'D770_N771insVDSVDNP': 1, 'C1385': 1, 'P480L': 1, 'L410R': 1, 'N564K': 1, 'I157T': 1, 'Y238A': 1, 'L1267S': 1, 'G1232D': 1, 'N564D': 1, 'V654A': 1, 'Y1853*': 1, 'Exon 20 insertion': 1, 'A2425T': 1, 'D24Y': 1, 'R724H': 1, 'NSD1-NUP98 Fusion': 1, 'V220F': 1, 'R156C': 1, 'Y1853C': 1, 'TMPRSS2-ERG Fusion': 1, 'L115R': 1, 'H773Y': 1, 'M552_W557del': 1, 'I463S': 1, 'L469V': 1, 'R139G': 1, 'N1178H': 1, 'H773L': 1, 'S335C': 1, 'V3079I': 1, 'L108P': 1, 'FGFR2-KIAA1967 Fusion': 1, 'D1399Y': 1, 'S1463F': 1, 'D761Y': 1, 'V1075F': 1, 'D846Y': 1, 'V211D': 1, 'P2415del': 1, 'I744_K745delinsKIPVAI': 1, 'L747_T751delinsP': 1, 'G87R': 1, 'E116K': 1, 'F158C': 1, 'D357Y': 1, 'L64P': 1, 'ATF7IP-JAK2 Fusion': 1, 'M587V': 1, 'D1420Y': 1, 'L915M': 1, 'S72R': 1, '533_534del': 1, 'K162D': 1, 'A146V': 2, 'H123Y': 1, 'P1502L': 1, 'N2436I': 1, 'E664K': 1, 'R181L': 1, 'H123Q': 1, 'R181H': 1, 'R1192P': 1, 'R611Q': 1, 'R611W': 1, 'H123D': 1, 'L1780P': 1, 'D1067Y': 1, 'D1067V': 1, 'A148T': 1, 'IGH-BCL2 Fusion': 1, 'L188Q': 1, 'L188V': 1, 'Y842C': 1, 'T319del': 1, 'V536E': 1, 'D1067A': 1, 'L617F': 1, 'H2428Q': 1, 'FGFR2-FAM76A Fusion': 1, 'Y652H': 1, 'CEP85L-PDGFRB Fusion': 1, 'L142P': 1, 'S459F': 1, 'Y1035C': 1, 'W731L': 1, 'E626K': 1, 'V550E': 1, 'T341P': 1, 'L726F': 1, 'L726I': 1, 'L2431P': 1, 'E255V': 1, 'R243A': 1, 'S33C': 1, 'E1705A': 1, 'S891A': 1, 'S492R': 1, 'E1705K': 1, 'Y1045W': 1, 'C77F': 1, 'K39N': 1, 'A4419S': 1, 'R1343L': 1, 'L844R': 1, 'TEL-RUNX1 Fusion': 1, 'V714M': 1, 'H1106D': 1, 'R389*': 1, 'L833V': 1, 'M2676T': 1, 'S297F': 1, 'K975E': 1, 'F468C': 1, 'A246P': 1, 'G776_V777insYVMA': 1, "3' Deletion": 1, 'E768D': 1, 'V579A': 1, 'K590R': 1, 'H105R': 1, 'L2721H': 1, 'S310F': 1, 'Q56_V60del': 1, 'I1349T': 1, 'BRD4-NUT Fusion': 1, 'S247Y': 1, 'R115L': 1, 'T1151dup': 1, 'R246K': 1, 'D119N': 1, 'Y139D': 1, 'R201H': 1, 'V343E': 1, 'F12L': 1, 'A1099T': 1, 'I638F': 1, 'W383R': 1, 'P838L': 1, 'V851A': 1, 'A36P': 1, 'E1210K': 1, 'I2500M': 1, 'N2875K': 1, 'S846I': 1, 'N219D': 1, 'V299G': 1, 'I2500F': 1, 'L165P': 1, 'G114R': 1, 'Y472C': 1, 'Exon 20 insertions/deletions': 1, 'Y472H': 1, 'G478C': 1, 'R978*': 1, 'W531C': 1, 'T417_D419delinsRG': 1, 'V560E': 1, 'V557I': 1, 'D1203N': 1, 'E321K': 1, 'S1290A': 1, 'E321G': 1, 'H355M': 1, 'R487W': 1, 'Y98H': 1, 'SDC4-ROS1 Fusion': 1, 'V274F': 1, 'K753E': 1, '534_536del': 1, 'K753A': 1, 'Q96P': 1, 'K753M': 1, 'F1662S': 1, 'P525L': 1, 'Y371S': 1, 'K1452N': 1, 'A1022E': 1, 'AR-V7': 1, 'T798M': 1, 'R2336H': 1, 'R177*': 1, 'G380R': 1, 'K341A': 1, 'M552_K558del': 1, 'A1669S': 1, 'L861F': 1, 'R505C': 1, 'L1596H': 1, 'V755I': 1, 'W557_V559delinsC': 1, 'E709K': 1, 'W349C': 1, 'G12A': 2, 'Y647C': 1, 'V1398D': 1, 'D2665G': 1, 'Q429*': 1, 'P551_V555del': 1, 'I491M': 1, 'K82T': 1, 'R177Q': 1, 'S1473P': 1, 'D2512Y': 1, 'G936R': 1, 'K659N': 1, 'V705E': 1, 'D2512G': 1, 'L221R': 1, 'Y237A': 1, 'K659E': 1, 'F57C': 1, 'RANBP1-ALK Fusion': 1, 'A149P': 1, 'E137K': 1, 'K442Nfs*': 1, 'F57L': 1, 'R669G': 1, 'F57V': 1, 'E168D': 1, 'E554_K558del': 1, 'R669C': 1, 'D65N': 1, 'L145R': 1, 'E120Q': 1, 'I1170S': 1, 'I1170N': 1, 'V1578del': 1, 'T340A': 1, 'H233N': 1, 'C238F': 1, 'W1718C': 1, 'F590G': 1, 'T1691K': 1, 'S222D': 2, 'K459_S460delinsN': 1, 'Delta-NTRK1 Fusion': 1, 'E157G': 1, 'R838Q': 1, 'R368C': 1, 'H694R': 1, 'F79S': 1, 'D887N': 1, 'L1947R': 1, 'Exon 19 insertion': 1, 'L118R': 1, 'L118P': 1, 'Y353L': 1, 'R1200W': 1, 'V597A': 1, 'D2870A': 1, 'Q472H': 1, 'K181M': 1, 'A767_V769dup': 1, 'N588D': 1, 'IGH-FGFR3 Fusion': 1, 'S478P': 1, 'V544_L545insAVLVLLVIVIISLI': 1, 'S646F': 1, 'R1275L': 1, 'W1610G': 1, 'H94Y': 1, 'I1250T': 1, 'L1433S': 1, 'A209T': 1, 'S4F': 1, 'T574_R588delinsL': 1, 'H83Y': 1, 'S270L': 1, 'E746_A750delinsQ': 1, 'S259A': 1, 'H83N': 1, 'S4P': 1, 'K539L': 1, 'C1767S': 1, 'S252W': 1, 'S645C': 1, 'E2419K': 1, 'D1818G': 1, 'V659E': 1, 'S653C': 1, 'N1730S': 1, 'T1623I': 1, 'V1714G': 1, 'L49H': 1, 'K428A': 1, 'T244R': 1, 'T3349A': 1, 'E1799K': 1, 'Q337*': 1, 'D390Y': 1, 'K558_V559del': 1, 'N540S': 1, 'N454D': 1, 'R897Q': 1, 'L348F': 1, 'H1746Q': 1, 'N45S': 1, 'G67W': 1, 'V1676D': 1, 'R206H': 1, 'G67R': 2, 'H396R': 1, 'H396P': 1, 'F351L': 1, 'D927G': 1, 'Y532C': 1, 'P449_L455del': 1, 'L348S': 1, 'H1746N': 1, 'A19V': 1, 'L785F': 1, 'Q2384K': 1, 'E326L': 1, 'A883T': 1, 'P261L': 1, 'W1291R': 1, 'A57V': 1, 'A677G': 1, 'N81Y': 1, 'V191D': 1, 'L1301R': 1, 'P551_E554del': 1, 'S614R': 1, 'A883F': 1, 'DNA binding domain insertions': 1, 'R683S': 1, 'V32G': 1, 'R268A': 1, 'G466V': 1, 'G466R': 1, 'L485_Q494del': 1, 'K56N': 1, 'G466E': 1, 'R342P': 1, 'R342Q': 1, 'G466A': 1, 'R922*': 1, 'H191D': 1, 'A232V': 1, 'D560Y': 1, 'A75P': 1, 'D770_N771insD': 1, 'Y34C': 1, 'V804L': 1, 'Q1500P': 1, 'E734Q': 1, 'Q2223K': 1, 'N550H': 1, 'M53I': 1, 'M269R': 1, 'G2032R': 1, 'L1904V': 1, 'E102_I103del': 1, 'C396R': 1, 'R420Q': 1, 'R833C': 1, 'L1152R': 1, 'FGFR1OP1-FGFR1 Fusion': 1, 'L1152P': 1, 'V742A': 1, 'G52R': 1, 'G857E': 1, 'G857A': 1, 'N48K': 1, 'Y234C': 1, 'Y791F': 1, 'P539R': 1, 'R776H': 1, 'G602R': 1, 'L225LI': 1, 'F81V': 1, 'L792R': 1, 'R1753T': 1, 'S243C': 1, 'V1736A': 1, 'K310R': 1, 'D245V': 1, 'T910M': 1, 'R174*': 1, 'R82P': 1, 'G1128S': 1, 'E466K': 1, 'P123M': 1, 'H398Y': 1, 'D609E': 1, 'D1709E': 1, 'D107Y': 1, 'D1709A': 1, 'A1830T': 1, 'T80A': 1, 'R64P': 1, 'L370fs': 1, 'T2681R': 1, 'NIN-PDGFRB Fusion': 1, 'P428L': 1, 'WDR48-PDGFRB Fusion': 1, 'X582_splice': 1, 'S243N': 1, 'C242S': 1, 'S100F': 1, 'V126D': 1, 'E1282V': 1, 'R659P': 1, 'C528S': 1, 'C242F': 1, 'R659L': 1, 'MIR143-NOTCH1 Fusion': 1, 'R158L': 1, 'Exon 20 insertions': 1, 'T582P': 1, 'D544H': 1, 'A146T': 2, 'Q1064R': 1, 'T283A': 1, 'R158C': 1, 'M232T': 1, 'A727V': 1, 'FGFR3 - BAIAP2L1 Fusion': 1, 'R659*': 1, 'I255F': 1, 'L1657P': 1, 'M1775E': 1, 'I538V': 1, 'Y406H': 1, 'V354E': 1, 'G1201E': 1, 'L536H': 1, 'Y646H': 1, 'R2842C': 1, 'L536R': 1, 'L536Q': 1, 'L536P': 1, 'S34Y': 1, 'Y646C': 1, 'V217D': 1, 'Y646F': 1, 'T674I': 1, 'G85R': 1, 'Q546K': 1, 'H662R': 1, 'H662Q': 1, 'Q79R': 1, 'V658F': 1, 'Q546E': 1, 'C481S': 1, 'Q79K': 1, 'Q276*': 1, 'Q546P': 1, 'Q546R': 1, 'Q79E': 1, 'F691L': 1, 'E746_S752delinsI': 1, 'E746_S752delinsA': 1, 'L838P': 1, 'R167W': 1, 'T205A': 1, 'F21A': 1, 'C277R': 1, 'L838V': 1, 'C277Q': 1, 'R1190W': 1, 'C277W': 1, 'E633K': 1, 'D594A': 1, 'D594G': 1, 'D594E': 1, 'L611V': 1, 'E875G': 1, 'A126G': 1, 'D594N': 1, 'A126D': 1, 'S387Y': 1, 'D594V': 1, 'A126S': 1, 'L251P': 1, 'D594Y': 1, 'N53A': 1, 'R1515H': 1, 'A126V': 1, 'V534E': 1, 'Q164R': 1, 'A617T': 1, 'Exon 19 deletion/insertion': 1, 'G356A': 1, 'I1171T': 1, 'F129L': 1, 'Q58_Q59insL': 1, 'I1171N': 1, 'R173P': 1, 'R505L': 1, 'V804M': 1, 'C456_R481del': 1, 'C609Y': 1, 'H1047Y': 1, 'V804G': 1, 'H1047R': 1, 'L390F': 1, 'E218*': 1, 'H1047L': 1, 'V2006L': 1, 'L826P': 1, 'ZC3H7B-BCOR Fusion': 1, 'V2006I': 1, 'G465E': 1, 'K57E': 1, 'D1546N': 1, 'K57T': 1, 'T576del': 1, 'R258H': 1, 'L622H': 1, 'L866M': 1, 'S2G': 1, 'P1856S': 1, 'P1856T': 1, 'R258C': 1, 'EWSR1-FEV Fusion': 1, 'CASP8L': 1, 'R88Q': 1, 'K467T': 1, 'Y280H': 1, 'W406A': 1, 'K525E': 1, 'R976H': 1, 'T263P': 1, 'KANK1-PDGFRB Fusion': 1, 'G106_R108del': 1, 'Hypermethylation': 1, 'W131A': 1, 'W131G': 1, 'T574insTQLPYD': 1, 'S56I': 1, 'I18V': 1, 'EBF1-PDGFRB Fusion': 1, 'S1613C': 1, 'R498L': 1, 'S1613G': 1, 'N564_Y578del': 1, 'N382H': 1, 'G20E': 1, 'R117G': 1, 'P48L': 1, 'E50K': 1, 'L747_E749del': 1, 'STRN-PDGFRA Fusion': 1, 'C1787S': 1, 'D401N': 1, 'FGFR2-CCDC6 Fusion': 1, 'T1481fs': 1, 'E554_I571del': 1, 'L82P': 1, 'CIC-DUX4 Fusion': 1, 'D106A': 1, 'R100*': 1, 'p61BRAF': 1, 'E29V': 1, 'G665A': 1, 'C360R': 1, 'TGFBR1*6A': 1, 'S119N': 1, 'H1918Y': 1, 'R100T': 1, 'G1194D': 1, 'V600D_K601insFGLAT': 1, 'N1878K': 1, 'V559D': 1, 'K1690N': 1, 'V559C': 1, 'D839G': 1, 'V559A': 1, 'L412F': 1, 'K550_K558del': 1, 'R3052Q': 1, 'P262H': 1, 'G123R': 1, 'G123S': 1, 'L358R': 1, 'E490K': 1, 'H206D': 1, 'E311_K312del': 1, 'C382R': 1, 'L448P': 1, 'D61N': 1, 'Y373C': 1, 'X1007_splice': 1, 'R683K': 1, 'D171G': 1, 'L747_A750delinsP': 1, 'D171N': 1, 'P463L': 1, 'K642E': 1, 'D96N': 1, 'R421*': 1, 'R592H': 1, 'TPR-NTRK1 Fusion': 1, 'F328V': 1, 'Y510C': 1, 'V1070E': 1, 'V559del': 1, 'L1273F': 1, 'BCR-PDGFRA Fusion': 1, 'V45L': 1, 'G44S': 1, 'R418G': 1, 'G434R': 1, 'L210R': 1, 'K289E': 1, 'R175L': 1, 'R836C': 1, 'A864T': 1, 'R175H': 1, 'G909R': 1, 'R175C': 1, 'E872K': 1, 'E135K': 1, 'R2505Q': 1, 'E23fs': 1, 'E142D': 1, 'W742C': 1, 'S384F': 1, 'P316L': 1, 'R634W': 1, 'R100A': 1, 'D595V': 1, 'R647A': 1, 'P336S': 1, 'L2230V': 1, 'M1689R': 1, 'M1689T': 1, 'T779fs': 1, 'R228K': 1, 'N463S': 1, 'E1356G': 1, 'T417_D419delinsI': 1, 'A614D': 1, 'W279*': 1, 'S330A': 1, 'Q1756C': 1, 'F317L': 1, 'W24S': 1, 'K376N': 1, 'P691S': 1, 'F248S': 1, 'E518K': 1, 'G248V': 1, 'E545Q': 1, 'LMNA-NTRK1 Fusion': 1, 'EGFR-PURB Fusion': 1, 'S584L': 1, 'E518A': 1, 'K129E': 1, 'N132K': 1, 'E545G': 1, 'K97M': 1, 'E545A': 1, 'L747_T751del': 1, 'E545K': 1, 'G322S': 1, 'L52F': 1, 'V648G': 1, 'T733I': 1, 'D74N': 1, 'T1852S': 1, 'V648I': 1, 'E758G': 1, 'R282W': 1, 'D74Y': 1, 'W257C': 1, 'L52R': 1, 'C334S': 1, 'R3052W': 1, 'Y553_K558del': 1, 'M18K': 1, 'A723D': 1, 'K291Q': 1, 'Q50P': 1, 'Q395*': 1, 'R601Q': 1, 'A1200V': 1, 'M664I': 1, 'M18T': 1, 'L147F': 1, 'Q538P': 1, 'D29H': 1, 'Q50*': 1, 'C2060G': 1, 'H168R': 1, 'L63F': 1, 'D29Y': 1, 'H231R': 1, 'H168N': 1, 'L63P': 1, 'L63V': 1, 'D252G': 1, 'S1164I': 1, 'I290A': 1, 'E552K': 1, 'N546K': 1, 'S1986Y': 1, 'FLT3 internal tandem duplications': 1, 'I347M': 1, 'E40T': 1, 'I290R': 1, 'V769_D770insGVV': 1, 'D770_P772dup': 1, 'V84L': 1, 'D1778Y': 1, 'R93W': 1, 'D3095E': 1, 'R93Q': 1, 'A77T': 1, 'P96Q': 1, 'A77P': 1, 'A77S': 1, 'D1778H': 1, 'Q324L': 1, 'N198_F199delinsI': 1, 'R200W': 1, 'D1778N': 1, 'P573_D579del': 1, 'R248Q': 1, 'EZR-ROS1 Fusion': 1, 'A2351G': 1, 'T507K': 1, 'H773dup': 1, 'K575M': 1, 'R248C': 1, 'R159G': 1, 'R248L': 1, 'R248K': 1, 'L858Q': 1, 'L858R': 1, 'T50I': 1, 'D557H': 1, 'I32del': 1, 'Y375_K455del': 1, 'G751R': 1, 'K2472T': 1, 'S463P': 1, 'L858M': 1, 'V1741G': 1, 'G13C': 2, 'A60V': 1, 'G464E': 1, 'E1682V': 1, 'K50E': 1, 'L23F': 1, 'E380Q': 1, 'K50L': 1, 'T1720I': 1, 'G464V': 1, 'G464R': 1, 'R1040L': 1, 'M136R': 1, 'D537E': 1, 'E1682K': 1, 'I111P': 1, 'I111R': 1, 'R1563S': 1, 'A627T': 1, 'R161G': 1, 'Q556_K558del': 1, 'T1343I': 1, 'I111A': 1, 'IGH-MYC Fusion': 1, 'I28T': 1, 'S566_E571delinsR': 1, 'P38L': 1, 'BCR-FGFR1 Fusion': 1, 'I111N': 1, 'Y803N': 1, 'K288Q': 1, 'G106D': 1, 'R48W': 1, 'R172S': 1, 'K765R': 1, 'R24C': 1, 'TRIM24-BRAF Fusion': 1, 'R172K': 1, 'R172M': 1, 'G106V': 1, 'L481F': 1, 'W237_Y242del': 1, 'R24P': 1, 'H773insLGNP': 1, 'R172G': 1, 'R1060H': 1, 'R110P': 1, 'Y63C': 1, 'A859_L883delinsV': 1, 'V60E': 1, 'R80L': 1, 'R80C': 1, 'V60M': 1, 'E459K': 1, 'G23D': 1, 'N296I': 1, 'T117M': 1, '596_619splice': 1, 'R80P': 1, 'R544W': 1, 'V319D': 1, 'R110L': 1, 'D287H': 1, 'R544S': 1, 'S1841A': 1, 'E439del': 1, 'GIT2-PDGFRB Fusion': 1, 'T73I': 2, 'S1498N': 1, 'R957Q': 1, 'L747_A750del': 1, 'A39P': 1, 'I35S': 1, 'D402Y': 1, 'S1841R': 1, 'CHTOP-NTRK1 Fusion': 1, 'G1706A': 1, 'P70R': 1, 'L576P': 1, 'G1706E': 1, 'V11A': 1, 'P1819S': 1, 'F808L': 1, 'F1761S': 1, 'N655K': 1, 'F161L': 1, 'D1010H': 1, 'F1761I': 1, 'A530T': 1, 'Y27S': 1, 'A530V': 1, 'L128F': 1, 'T878A': 1, 'E542Q': 1, 'G719S': 1, 'K375A': 1, 'E542V': 1, 'E612_F613insGYVDFREYEYDLKWEFRPRENLEF': 1, 'H878Y': 1, 'R961W': 1, 'G719A': 1, 'R183Q': 1, 'G719C': 1, 'G719D': 1, 'E542G': 1, 'T241M': 1, 'N826S': 1, 'E542K': 2, 'G376R': 1, 'N1387A': 1, 'K413E': 1, 'V271L': 1, 'V271A': 1, 'V600R': 1, 'T47D': 1, 'V600K': 1, 'V600M': 1, 'V2098I': 1, 'V600E': 1, 'V600D': 1, 'V600G': 1, 'S921R': 1, 'R143Q': 1, 'Y591D': 1, 'V750E': 1, 'K641N': 1, 'K641R': 1, 'R415G': 1, 'C105F': 1, 'I162M': 1, 'R873Q': 1, 'G216R': 1, 'R79P': 1, 'L283_D294del': 1, 'Y24C': 1, 'P395A': 1, 'E285K': 1, 'F1704S': 1, 'L146R': 1, 'Q249E': 1, 'A151T': 1, 'E285V': 1, 'Y68H': 1, 'C275S': 1, 'S36Y': 1, 'T352M': 1, 'R680*': 1, 'S1303N': 1, 'F154L': 1, 'R844H': 1, 'Y68D': 1, 'EGFR-RAD51 Fusion': 1, 'G334R': 1, 'Q1554H': 1, 'D641N': 1, 'D1853N': 1, 'F958S': 1, 'I834V': 1, 'F958V': 1, 'Y599_D600insSTDNEYFYVDFREYEY': 1, 'A120S': 1, 'P417A': 1, 'E143K': 1, 'Q809R': 1, 'W719*': 1, 'D60N': 1, 'I853T': 1, 'Y1414C': 1, 'X1008_splice': 1, 'R671Q': 1, 'K550_V555delinsI': 1, 'K128Q': 1, 'I42V': 1, 'K128N': 1, 'S123T': 1, 'Q816*': 1, 'R290H': 1, 'D816V': 1, 'BCOR-RARA Fusion': 1, 'C456_N468del': 1, 'R290G': 1, 'D186A': 1, 'F2108L': 1, 'V560D': 1, 'P173S': 1, 'R267P': 1, 'R267Q': 1, 'L1224F': 1, 'FGFR2?PPHLN1 Fusion': 1, 'V356R': 1, 'L1574P': 1, 'L755S': 1, 'L755P': 1, 'E207K': 1, 'P1675L': 1, 'R487*': 1, 'K483E': 1, 'V842I': 1, 'R304*': 1, 'K483M': 1, 'S214C': 1, 'T730S': 1, 'L704N': 1, 'R361C': 1, 'H78Q': 1, 'H1805P': 1, 'Q347_A348del': 1, 'P577_W582delinsPYD': 1, 'Y772_A775dup': 1, 'K1434I': 1, 'Y555C': 1, 'M117V': 1, 'A391E': 1, 'V422del': 1, 'R1699L': 1, 'P1776S': 1, 'D816E': 1, 'A633T': 1, 'W742L': 1, 'A633V': 1, 'MKRN1-BRAF Fusion': 1, 'I852M': 1, 'P2417A': 1, 'N1026S': 1, 'V487_P492delinsA': 1, 'S768I': 1, 'S206C': 1, 'ATF7IP-PDGFRB Fusion': 1, 'E365K': 1, 'Y418_D419del': 1, 'T1977R': 1, 'K550_W557del': 1, 'M1293A': 1, 'E1214K': 1, 'G1269S': 1, 'V343L': 1, 'H845_N848delinsP': 1, 'A263V': 1, 'L617M': 1, 'G1269A': 1, 'L747_P753delinsS': 1, 'Promoter Mutations': 1, 'S151A': 1, 'I408V': 1, 'P95L': 1, 'C618R': 2, 'R287A': 1, 'P95H': 1, 'Y1703H': 1, 'H875Y': 1, 'C311R': 1, 'L274P': 1, 'R201Q': 1, 'R2784W': 1, 'P4309A': 1, 'E267G': 1, 'V705M': 1, 'R249W': 1, 'V35M': 1, 'R249S': 1, 'L597V': 1, 'H1382Y': 1, 'F354L': 1, 'L597R': 1, 'L597S': 1, 'L597Q': 1, 'L1705P': 1, 'ARv567es': 1, 'V194M': 1, 'D572A': 1, 'N71I': 1, 'SPTBN1-PDGFRB Fusion': 1, 'D1733G': 1, 'I103N': 1, 'E70K': 1, 'CUL1-BRAF Fusion': 1, 'R796G': 1, 'V270A': 1, 'Y901C': 1, 'E49K': 1, 'N233Y': 1, 'G785S': 1, 'R441P': 1, 'G863D': 1, 'T80R': 1, 'E812K': 1, 'W563L': 1, 'W398V': 1, 'M244V': 1, 'M391R': 1, 'T1977K': 1, 'T167A': 2, 'E81K': 1, 'G863S': 1, 'A750_E758del': 1, 'T80K': 1, 'S1497A': 1, 'Y1235D': 1, 'S1025C': 1, 'S765P': 1, 'M253K': 1, 'N676S': 1, 'G31A': 1, 'C24Y': 1, 'N676K': 1, 'G914R': 1, 'C24R': 1, 'N78S': 1, 'G31V': 1, 'N676D': 1, 'G31R': 1, 'A23E': 1, 'S37Y': 1, 'L455M': 1, 'A598V': 1, 'R834Q': 1, 'A598T': 1, 'T783A': 1, 'FUS-ERG Fusion': 1, 'C712R': 1, 'K218T': 1, 'R173H': 1, 'E946*': 1, 'R49L': 1, 'W1837R': 1, 'R173C': 2, 'L28P': 1, 'C324Y': 1, 'P798L': 1, 'A2034V': 1, 'D61Y': 1, 'A121P': 1, 'A121V': 1, 'F53S': 1, 'G186R': 1, 'N987I': 1, 'V1092I': 1, 'F53C': 1, 'KIF5B-ALK Fusion': 1, 'A121E': 1, 'D737N': 1, 'K120M': 1, 'F53L': 1, 'Q157P': 1, 'S432L': 1, 'C250T': 1, 'R181C': 1, 'T1685I': 1, 'D717V': 1, 'E161del': 1, 'A18D': 1, 'A919V': 1, 'P2273S': 1, 'R47Q': 1, 'P551_W557delinsL': 1, 'M299I': 1, 'E1660G': 1, 'S37F': 1, 'V143A': 1, 'I562M': 1, 'R1598P': 1, 'N1100Y': 1, 'T1685A': 1, 'G375C': 1, 'S37C': 1, 'L485_P490delinsY': 1, 'S308A': 2, 'EZR-ERBB4 Fusion': 1, 'L485_P490delinsF': 1, 'P596L': 1, 'G375P': 1, 'H179R': 1, 'H179Q': 1, 'V1180L': 1, 'H179Y': 1, 'S279Y': 1, 'S505N': 1, 'M1411T': 1, 'Y823D': 1, 'H179N': 1, 'L770V': 1, 'H179L': 1, 'Y285C': 1, 'EGFR-KDD': 1, 'W24R': 1, 'H284P': 1, 'T630I': 1, 'Y297A': 1, 'T315I': 1, 'D473G': 1, 'D2312V': 1, 'N553S': 1, 'Q56P': 1, 'H284N': 1, 'S567L': 1, 'D1778G': 1, 'A298T': 1, 'R264C': 1, 'T1977S': 1, 'Copy Number Loss': 1, 'A459V': 1, 'R304W': 1, 'D325A': 1, 'E124Q': 1, 'Q545A': 1, 'P1637L': 1, 'L929S': 1, 'CEP110-FGFR1 Fusion': 1, 'F341V': 1, 'Exon 9 mutations': 1, 'T1977I': 1, 'TP53BP1-PDGFRB Fusion': 1, 'G1529R': 1, 'V104M': 1, 'FGFR2-BICC1 Fusion': 1, 'E139D': 1, 'A717G': 1, 'E636K': 1, 'R1262A': 1, 'L181P': 1, 'D617G': 1, 'S840_N841insGS': 1, 'S37A': 1, 'S2215T': 1, 'R280A': 1, 'R202S': 1, 'L611_E612insCSSDNEYFYVDFREYEYDLKWEFPRENL': 1, 'R482Q': 1, 'D473H': 1, 'A707T': 1, 'R280K': 1, 'R886W': 1, 'V465M': 1, 'R280T': 1, 'T195I': 1, 'P655R': 1, 'F341C': 1, 'E1060A': 1, 'ZNF198-FGFR1 Fusion': 1, 'R1097H': 1, 'K517R': 1, 'G13E': 1, 'S723F': 1, 'N480del': 1, 'T599_V600insEAT': 1, 'FGFR2-AHCYL1 Fusion': 1, 'M737I': 1, 'C49Y': 1, 'G419V': 1, 'S1206R': 1, 'Y65C': 1, 'L1593P': 1, 'E2014K': 1, 'L330R': 1, 'P648S': 1, 'S1206Y': 1, 'Y87C': 1, 'K745_A750del': 1, 'A95D': 1, 'P648L': 1, 'S247F': 1, 'C1365Y': 1, 'Y835F': 1, 'R1391S': 1, 'E746_A750del': 1, 'S227F': 1, 'R1594Q': 1, 'S227A': 1, 'P753S': 1, 'G1596V': 1, 'R1391G': 1, 'A763_Y764insFQEA': 1, 'KIAA1509-PDGFRB Fusion': 1, 'V843I': 1, 'K52R': 1, 'ESR1-YAP1 Fusion': 1, 'P169H': 1, 'N659K': 1, 'C1265S': 1, 'C47G': 1, 'V1665M': 1, 'K382E': 1, 'R2336P': 1, 'K507Q': 1, 'G161V': 1, 'C27A': 1, 'G776delinsVC': 1, 'K2729N': 1, 'N505I': 1, 'V894I': 1, 'K335I': 1, 'P531L': 1, 'S459del': 1, 'G853D': 1, 'S501_A502dup': 1, 'P130S': 2, 'Y163C': 1, 'L535P': 1, 'PAX8-PPAR? Fusion': 1, 'R273C': 1, 'N387K': 1, 'I1766S': 1, 'G831E': 1, 'P83L': 1, 'C482R': 1, 'R273H': 1, 'R273L': 1, 'E362H': 1, 'EWSR1-ATF1 Fusion': 1, 'ETV6-NTRK3 Fusion': 2, 'LRIG3-ROS1 Fusion': 1, 'G14V': 1, 'E3002K': 1, 'S768_V769insVAS': 1, 'H61R': 1, 'C71Y': 1, 'L246V': 1, 'H61D': 1, 'A1170V': 1, 'M980T': 1, 'P2476L': 1, 'D404G': 1, 'E462G': 1, 'A1065T': 1, 'Y87N': 1, 'ESR1-CCDC170 Fusion': 1, 'Y3098H': 1, 'F74S': 1, 'G2101A': 1, 'A1843T': 1, 'A1843P': 1, 'W1782C': 1, 'L1844R': 1, 'T413N': 1, 'P531S': 1, 'L747P': 1, 'I2285V': 1, 'D1344H': 1, 'K650E': 1, 'KDELR2-ROS1 Fusion': 1, 'R1276P': 1, 'D1384V': 1, 'T798I': 1, 'K650N': 1, 'K650T': 1, 'F170I': 1, 'C141Y': 1, 'K650Q': 1, 'E1836K': 1, 'K650R': 1, 'EWSR1-ERG Fusion': 1, 'L461V': 1, 'P375S': 1, 'A829P': 1, 'P48R': 1, 'R170W': 2, 'R170Q': 1, 'C381A': 1, 'V173L': 1, 'N822Y': 1, 'E5K': 1, 'S1206C': 1, 'V173E': 1, 'W1837G': 1, 'Y570H': 1, 'D324N': 1, 'T74P': 1, 'P48T': 1, 'W368H': 1, 'R1608S': 1, 'P291Qfs*51': 1, 'A1701P': 1, 'T3211K': 1, 'V765A': 1, 'Deletion': 74, 'G81D': 1, 'K78I': 1, 'K78A': 1, 'T725M': 1, 'F156L': 1, 'S1301R': 1, 'R1209W': 1, 'G81R': 1, 'A122*': 1, 'T34_A289del': 1, 'L180P': 1, 'H597Y': 1, 'S34F': 1, 'P1709L': 1, 'R1093*': 1, 'A1823T': 1, 'T1691I': 1, 'L19F': 1, 'V2969M': 1, 'E1286V': 1, 'R755S': 1, 'V155A': 1, 'E571K': 1, 'K83N': 1, 'V665A': 1, 'Q110R': 1, 'K83E': 1, 'R715G': 1, 'I31M': 2, 'T1151M': 1, 'N551K': 1, 'G1803A': 1, 'V109G': 1, 'R265S': 1, 'N549K': 1, 'W257G': 1, 'Y64A': 2, 'R370C': 1, 'N549S': 1, 'EWSR1-CREB1 Fusion': 1, 'S746fs': 1, 'N549T': 1, 'S1655F': 1, 'Q1826H': 1, 'L576del': 1, 'S23R': 1, 'P168T': 1, 'D2723G': 1, 'H174R': 1, 'V471F': 1, 'D2723H': 1, 'M541L': 1, 'Y174N': 1, 'Y35N': 1, 'IGL-MYC Fusion': 1, 'Y1248F': 1, 'S196N': 1, 'D661Y': 1, 'D661V': 1, 'R378A': 1, 'L708P': 1, 'G325R': 1, 'V1804A': 1, 'T2250A': 1, 'S451E': 1, 'S451F': 1, 'K65M': 1, 'G325A': 1, 'L57V': 1, 'A11_G12insGA': 1, 'TMPRSS2-ETV1 Fusion': 2, 'H132Y': 1, 'A171V': 1, 'M134L': 1, 'V561_I562insER': 1, '981_1028splice': 1, 'K1436Q': 1, 'HMGA2-RAD51B Fusion': 1, 'D842_M844del': 1, 'G1763V': 1, 'R87P': 1, 'G127E': 1, 'G127N': 1, 'L209F': 1, 'R272C': 1, 'R514C': 1, 'EP300-MLL Fusion': 1, 'V370D': 1, 'D935N': 1, 'Y806C': 1, 'T468M': 1, 'R87L': 1, 'P44L': 1, 'M1628V': 1, 'T599I': 1, 'P44A': 1, 'D450E': 1, 'I1183T': 1, 'D450H': 1, 'A1066V': 1, 'E839K': 1, 'R201W': 1, 'P186S': 1, 'L272F': 1, 'S1715C': 1, 'R1088C': 1, 'V839G': 1, 'P654L': 1, 'S1715N': 1, 'S259F': 1, 'Exon 2 mutations': 1, 'R203C': 1, 'V1673D': 1, 'S1715R': 1, 'A72V': 1, 'S259P': 1, 'A72S': 1, 'W557R': 1, 'A339V': 1, 'K22A': 1, 'D835H': 1, 'NFASC-NTRK1 Fusion': 1, 'D835E': 1, 'E746_T751insIP': 1, 'W1456R': 1, 'E265K': 1, 'L265P': 1, 'MLL-TET1 Fusion': 1, 'L325F': 1, 'D835Y': 1, 'T1219I': 1, 'TMPRSS2-ETV4 Fusion': 1, 'L37P': 1, 'P26S': 1, 'N181S': 1, 'C47S': 1, 'N235D': 1, 'R841K': 2, 'Y179C': 1, 'N771_H773dup': 1, 'R841Q': 1, 'L1019V': 1, 'V272L': 1, 'R1627': 1, 'C124N': 1, 'Y220S': 1, 'T131I': 1, 'HIP1-PDGFRB Fusion': 1, 'R140Q': 1, 'T131L': 1, 'E709_T710delinsD': 1, 'T131S': 1, 'C809G': 1, 'CEP85L-ROS1 Fusion': 1, 'C124S': 1, 'C124R': 1, 'T28I': 1, 'R162*': 1, 'T82A': 1, 'L1026F': 1, 'Q635E': 1, 'W515L': 1, 'W515K': 1, 'I563_L576del': 1, 'N56T': 1, 'I33del': 1, 'G1770V': 1, 'R574fs': 1, 'N486_P490del': 1, 'IGK-MYC Fusion': 1, 'V128del': 1, 'R292A': 1, 'L97R': 1, 'G101S': 1, 'L582F': 1, 'G101W': 1, 'Y846C': 1, 'R683T': 1, 'F1888L': 1, 'L668F': 1, 'F1888I': 1, 'H701P': 1, 'PAPSS1-BRAF Fusion': 1, 'L493P': 1, 'R683G': 1, 'M199del': 1, 'F1888V': 1, 'L112P': 1, 'L112R': 1, 'H1094R': 1, 'S310Y': 1, 'H1094Y': 1, 'Q61R': 3, 'K147E': 1, 'Q61L': 3, 'TEL-JAK2 Fusion': 1, 'Q61K': 2, 'Q61H': 3, 'F594_R595insSDNEYFYVDF': 1, 'E106G': 1, 'D83V': 1, 'H1094L': 1, 'Y364H': 1, 'V569_L576del': 1, 'K2411T': 1, 'I130M': 1, 'D579del': 1, 'D2033N': 1, 'Y572C': 1, 'N1102Y': 1, 'G503V': 1, 'T844M': 1, 'E190K': 1, 'G373R': 1, 'G2274V': 1, 'C91A': 1, 'Y646N': 1, 'L1854P': 1, 'L2865V': 1, 'R69C': 1, 'L2106P': 1, 'E541K': 1, 'C91S': 1, 'T75M': 1, 'F123I': 1, 'Y1703S': 1, 'W2626C': 1, 'G1128A': 1, 'I49S': 1, '385_418del': 1, 'K125R': 1, 'K125L': 1, 'K125M': 1, 'I668V': 1, 'E160*': 1, 'S35Q': 1, 'S273C': 1, 'I289M': 1, 'K125E': 1, 'R866C': 1, 'D326N': 1, 'CCND1-IGH Fusion': 1, 'G1286R': 1, 'P490_Q494del': 1, 'V1075Yfs*2': 1, 'D1810A': 1, 'W308C': 1, 'Y646S': 1, 'L485_P490del': 1, 'D258N': 1, 'R339W': 1, 'ETV6-PDGFRA Fusion': 1, 'M90I': 1, 'D350G': 1, 'D387V': 1, 'R561C': 1, 'S1986F': 1, 'P704S': 1, 'D108N': 1, 'Q510E': 1, 'S267_D273dup': 1, 'A1459P': 1, 'I559_D560insDKRMNS': 1, 'V1643A': 1, 'Promoter Hypermethylation': 2, 'E286K': 1, 'G596R': 1, 'S502T': 1, 'G596V': 1, 'H1862L': 1, 'K189N': 1, 'R2659T': 1, 'R661W': 1, 'S68W': 1, 'K79E': 1, 'G596C': 1, 'D842_H845del': 1, 'R2659K': 1, 'C554W': 1, 'G1971E': 1, 'Q209L': 2, 'C278F': 1, 'P278A': 1, 'P278L': 1, 'C238S': 1, 'T1700A': 1, 'R112G': 1, 'E1552del': 1, 'L362R': 1, 'P278S': 1, 'H492R': 1, 'K508M': 1, 'R282Q': 1, 'N1819S': 1, 'N1819Y': 1, 'Y599_D600insEYEYEYEY': 1, 'W714*': 1, 'W1718L': 1, 'M1043V': 1, 'M1043I': 1, 'R571W': 1, 'R348*': 1, 'PTPRZ1-MET Fusion': 1, 'N387P': 1, 'Exon 13 deletion': 1, 'D835N': 1, 'C620R': 1, 'K420A': 1, 'R262T': 1, 'C620Y': 1, 'PDE4DIP-PDGFRB Fusion': 1, 'H133Q': 1, 'V509A': 1, 'I2627F': 1, 'Y32S': 1, 'E239A': 1, 'G17A': 1, 'P1311T': 1, 'R2502H': 1, 'E1250K': 1, 'A349P': 1, 'D513Y': 1, 'R2502C': 1, 'K5N': 1, 'L239R': 1, 'R380A': 1, 'G34V': 1, 'K2950N': 1, 'G660D': 1, 'S214A': 1, 'R1699W': 1, 'R1699Q': 1, 'F877L': 1, 'C378R': 1, 'D816Y': 1, 'D816F': 1, 'D816G': 1, 'G34E': 1, 'I843_D846del': 1, 'H1686R': 1, 'H1686Q': 1, 'D816A': 1, 'D816N': 1, 'S214T': 1, 'D816H': 1, 'R20Q': 1, 'Y149D': 1, 'TFG-NTRK1 Fusion': 1, 'C420R': 1, 'V560G': 1, 'V592A': 1, 'L585I': 1, 'Y130C': 1, 'Q2405Rfs*17': 1, 'E1322*': 1, 'K618T': 1, 'Y112H': 1, 'A59T': 1, 'BTBD1-NTRK3 Fusion': 1, 'FGFR2-MGEA5 Fusion': 1, 'W603_E604insDREYEYDLKW': 1, 'V197L': 1, 'R248W': 1, 'P85L': 1, 'Y1045*': 1, 'A59G': 1, 'R497H': 1, 'R515G': 1, 'Y489C': 1, 'V995M': 1, 'Y931C': 1, 'K398A': 1, 'R2973C': 1, 'R306S': 1, 'R99P': 1, 'K607T': 1, 'P29S': 1, 'R109*': 1, 'V348L': 1, 'I15T': 1, 'E1071W': 1, 'P29L': 1, 'P152L': 1, 'V710A': 1, 'F537_K539delinsL': 1, 'D769A': 1, 'A1752V': 1, 'P81T': 1, 'E82D': 1, 'E82G': 1, 'D769H': 1, 'P531A': 1, 'L232LI': 1, 'W290_I291delinsC': 1, 'R748G': 1, 'E82V': 1, 'D769Y': 1, 'N659R': 1, 'A546D': 1, 'R71G': 1, 'Truncating Mutations': 93, 'Q1811R': 1, 'E746_T751delinsA': 1, 'K292T': 1, 'K379E': 1, 'P1614S': 1, 'P81L': 1, 'T150I': 1, 'N822I': 1, 'N822H': 1, 'N234I': 1, 'L485F': 1, 'CCDC6-ROS1 Fusion': 1, 'H41R': 1, 'P577S': 1, 'D408Y': 1, 'Epigenetic Silencing': 1, 'C1697R': 1, 'K745M': 1, 'D408E': 1, 'TRB-NKX2-1 Fusion': 1, 'D1091N': 1, 'D408H': 1, 'Q579_L581del': 1, 'R462E': 1, 'C125S': 1, 'R428A': 1, 'E35*': 1, 'V1810G': 1, 'W557_V560del': 1, 'T19N': 1, 'E846K': 1, 'T2722R': 1, 'R689S': 1, 'D1010Y': 1, 'K179M': 1, 'H876Q': 1, 'E127G': 1, 'S428F': 1, 'E281K': 1, 'D1010N': 1, 'Q22R': 1, 'Y40A': 1, 'P286H': 1, 'S1722F': 1, 'P286R': 1, 'V1534M': 1, 'N1333Gfs*': 1, 'D1029Y': 1, 'EGFRvIII': 1, 'I279P': 1, 'N517S': 1, 'V157D': 1, 'V777L': 1, 'V157F': 1, 'E219K': 1, 'R320Q': 1, 'R462C': 1, 'I168F': 1, 'I151S': 1, 'L122R': 1, 'R462I': 1, 'L1764P': 1, 'K650M': 1, 'Q60K': 1, 'G309A': 1, 'G309E': 1, 'EWSR1-NR4A3 Fusion': 1}
In [44]:
plt.figure()
x=np.arange(len(unique_variation_gene_count))
plt.bar(x,unique_variation_gene_count.values(),align='center')
plt.xticks(x,unique_variation_gene_count.keys(),rotation=90)
ymax=max(unique_variation_gene_count.values())+1
plt.ylim(0,ymax)

plt.title("No. of Variations per Gene")

plt.show()

Observation

From this plot we can conclude that most of the variation per gene is having value as 1.

In [47]:
# number of unique genes
print "the number of unique genes =", len(unique_genes)
the number of unique genes = 264
In [48]:
# assigning unique numbers for each unique genes
number_gene={}
ctr=1
for i in set(unique_genes):
    number_gene[ctr]=i
    ctr=ctr+1
In [49]:
# assigning unique numbers for each unique variation
number_variation={}
ctr=1
for i in set(unique_variation):
    number_variation[ctr]=i
    ctr=ctr+1
In [50]:
# creating a new dataframe with the numerical values
import pandas as pd
dfnum_traino=train.copy(deep=True)
#for gene

for index, row in dfnum_traino.iterrows():
    for j in dict(number_gene):
        if(row['Gene']==number_gene[j]):
            dfnum_traino.loc[index,'Gene']=j
In [51]:
#for variation   

for index, row in dfnum_traino.iterrows():
    for j in dict(number_variation):
        if(row['Variation']==number_variation[j]):
            dfnum_traino.loc[index,'Variation']=j                
print dfnum_traino
        ID Gene Variation  Class  \
0        0  121      2500      1   
1        1   60       913      2   
2        2   60      2069      2   
3        3   60      2311      3   
4        4   60      1164      4   
5        5   60       651      4   
6        6   60       756      5   
7        7   60      2517      1   
8        8   60      1362      4   
9        9   60       117      4   
10      10   60      1312      4   
11      11   60      2422      4   
12      12   60      1542      4   
13      13   60       116      4   
14      14   60      1579      5   
15      15   60      1481      4   
16      16   60      2500      1   
17      17   60       549      4   
18      18   60      2473      5   
19      19   60      1366      4   
20      20   60      1453      6   
21      21   60      1541      4   
22      22   60      1999      4   
23      23   60       739      4   
24      24   60      2091      4   
25      25   60      1569      4   
26      26  216      1699      4   
27      27  107      2701      4   
28      28  107       824      7   
29      29  107       239      4   
...    ...  ...       ...    ...   
3291  3291   65      2833      7   
3292  3292   65      2483      7   
3293  3293   65      1680      1   
3294  3294   65      2374      7   
3295  3295   65       359      4   
3296  3296   65      2211      7   
3297  3297   35      2485      4   
3298  3298   35      2643      4   
3299  3299   35      2692      4   
3300  3300   35      2544      4   
3301  3301   35      1391      4   
3302  3302   35       529      4   
3303  3303   35      2901      4   
3304  3304   35      2189      4   
3305  3305   35      1070      4   
3306  3306   35      1826      1   
3307  3307   35      2500      1   
3308  3308   35      1564      7   
3309  3309   35      1779      1   
3310  3310   35       730      7   
3311  3311   35       781      4   
3312  3312   35      1296      4   
3313  3313   35      2138      4   
3314  3314   35       518      6   
3315  3315   35       689      4   
3316  3316   35      1782      4   
3317  3317   35      2528      1   
3318  3318   35       960      1   
3319  3319   35      1983      4   
3320  3320   35      2547      4   

                                                   TEXT  LenText  Text_count  
0     Cyclin-dependent kinases (CDKs) regulate a var...    39765        6089  
1      Abstract Background  Non-small cell lung canc...    36831        5722  
2      Abstract Background  Non-small cell lung canc...    36831        5722  
3     Recent evidence has demonstrated that acquired...    36308        5572  
4     Oncogenic mutations in the monomeric Casitas B...    41427        6202  
5     Oncogenic mutations in the monomeric Casitas B...    41427        6202  
6     Oncogenic mutations in the monomeric Casitas B...    41427        6202  
7     CBL is a negative regulator of activated recep...    99183       14683  
8      Abstract Juvenile myelomonocytic leukemia (JM...    80070       11923  
9      Abstract Juvenile myelomonocytic leukemia (JM...    38642        5721  
10    Oncogenic mutations in the monomeric Casitas B...    41427        6202  
11    Noonan syndrome is an autosomal dominant conge...    83577       12174  
12    Noonan syndrome is an autosomal dominant conge...   173915       25594  
13    Noonan syndrome is an autosomal dominant conge...    83577       12174  
14    Oncogenic mutations in the monomeric Casitas B...    41427        6202  
15    Noonan syndrome is an autosomal dominant conge...    42149        5972  
16    To determine if residual cylindrical refractiv...    54603        8118  
17    Acquired uniparental disomy (aUPD) is a common...    20770        2992  
18    Oncogenic mutations in the monomeric Casitas B...    41427        6202  
19    Acquired uniparental disomy (aUPD) is a common...    20770        2992  
20     Abstract Background  Non-small cell lung canc...    78259       11924  
21    Oncogenic mutations in the monomeric Casitas B...    41427        6202  
22    Oncogenic mutations in the monomeric Casitas B...    41427        6202  
23    Recent evidence has demonstrated that acquired...    36308        5572  
24    Recent evidence has demonstrated that acquired...    36308        5572  
25    Recent evidence has demonstrated that acquired...    36308        5572  
26     Abstract N-myristoylation is a common form of...    45247        6648  
27    Heterozygous mutations in the telomerase compo...     9298        1412  
28    Sequencing studies have identified many recurr...    91254       13783  
29    Heterozygous mutations in the telomerase compo...     9298        1412  
...                                                 ...      ...         ...  
3291  We investigatedthe transformingactivityofthe r...    12246        1913  
3292  We investigatedthe transformingactivityofthe r...    12246        1913  
3293  RET is a transmembrane tyrosine kinase partici...    25840        3866  
3294  Introduction  Inherited germ line activating m...    35755        5291  
3295  Many missense mutations in the RET proto-oncog...    32424        4733  
3296  The RET proto-oncogene encodes a receptor tyro...   122733       18629  
3297  THE AML1 GENE IS KNOWN as the most frequent ta...   102856       15079  
3298  Introduction  Myelodysplastic syndromes (MDS) ...    40363        6068  
3299  The BCR-ABL fusion protein generated by t(9;22...    43466        6332  
3300  The most frequent mutations associated with le...    67372        9818  
3301  The most frequent mutations associated with le...    75167       11001  
3302  Familial platelet disorder with predisposition...    38887        5799  
3303  Introduction  Myelodysplastic syndromes (MDS) ...    40363        6068  
3304  Familial platelet disorder with predisposition...    38887        5799  
3305  Introduction  Myelodysplastic syndromes (MDS) ...    40363        6068  
3306  Here we report two new RUNX1 mutations in one ...    11999        1764  
3307  The RUNX genes have come to prominence recentl...    99482       14964  
3308   Familial platelet disorder with propensity to...    41189        6058  
3309  The BCR-ABL fusion protein generated by t(9;22...    43466        6332  
3310  RUNX proteins belong to a family of metazoan t...    65747        9117  
3311  The AML1/EVI-1 chimeric gene is generated by t...    56551        7898  
3312  Balanced chromosomal translocations are freque...    33648        4960  
3313  The BCR-ABL fusion protein generated by t(9;22...    43466        6332  
3314  Introduction  Myelodysplastic syndromes (MDS) ...    94605       14210  
3315  The RUNX gene family includes three evolutiona...    81503       11803  
3316  Introduction  Myelodysplastic syndromes (MDS) ...    74278       11021  
3317  Introduction  Myelodysplastic syndromes (MDS) ...    40363        6068  
3318  The Runt-related transcription factor 1 gene (...    36384        5122  
3319  The RUNX1/AML1 gene is the most frequent targe...    32568        4885  
3320  The most frequent mutations associated with le...    67372        9818  

[3321 rows x 7 columns]
In [52]:
dfnum_traino.shape
Out[52]:
(3321, 7)
In [53]:
import seaborn as sns
import matplotlib.pyplot as plt
import numpy as np
plt.figure(figsize=(200,150))
sns.FacetGrid(dfnum_traino, hue="Class", size=12) \
   .map(plt.scatter, "Gene", "Variation") \
   .add_legend();
# sns.FacetGrid(dfnum_traino.loc[dfnum_traino['Class']==7], hue="Class", size=12) \
#    .map(plt.scatter, "Gene", "Variation") \
#    .add_legend();
plt.ylim(0, 3000)
plt.xlim(0,265)
plt.show();
<matplotlib.figure.Figure at 0x1b8b1748>

Takeaway from this plot

On a first look this plot gives nothing but a feel of randomness. But on careful observation, we can see that there are some verical as well as horizontal relationship existing here.

For instance Class 1 have almost every unique genes for a particular variation. Class 7 has almost every variation for a particular gene. We can find similar relation for class 4 as well.

We can see that class 7 dominates over most of the genes and variation

In [54]:
dfnum_train2=dfnum_traino.copy(deep=True)
In [55]:
dfnum_train2.loc[:,'ClassName'] =dfnum_traino['Class'].astype(str)
In [57]:
del dfnum_train2['Class']
In [60]:
dfnum_train2.head()
Out[60]:
ID Gene Variation TEXT LenText Text_count ClassName
0 0 121 2500 Cyclin-dependent kinases (CDKs) regulate a var... 39765 6089 1
1 1 60 913 Abstract Background Non-small cell lung canc... 36831 5722 2
2 2 60 2069 Abstract Background Non-small cell lung canc... 36831 5722 2
3 3 60 2311 Recent evidence has demonstrated that acquired... 36308 5572 3
4 4 60 1164 Oncogenic mutations in the monomeric Casitas B... 41427 6202 4
In [61]:
sns.pairplot(dfnum_train2, hue="ClassName", size=3, diag_kind="kde");
plt.show()
In [37]:
#checking max occurance of genes
gene_group = train.groupby("Gene")['Gene'].count()
print gene_group.sort_values(ascending=False)[:15]
Gene
BRCA1     264
TP53      163
EGFR      141
PTEN      126
BRCA2     125
KIT        99
BRAF       93
ERBB2      69
ALK        69
PDGFRA     60
PIK3CA     56
CDKN2A     52
FGFR2      50
FLT3       49
TSC2       47
Name: Gene, dtype: int64

Top dominating genes for Class 1

In [258]:
train1=train[train.Class==1]
gene_group = train1.groupby("Gene")['Gene'].count()
cls1=gene_group.sort_values(ascending=False)[:10]
print cls1
Gene
TP53     83
BRCA1    60
TSC2     23
BRCA2    18
ERCC2    17
VHL      15
SMAD4    13
PTPRT    13
SMAD3    11
SMAD2    11
Name: Gene, dtype: int64
In [259]:
cls1.plot(kind='bar')
Out[259]:
<matplotlib.axes._subplots.AxesSubplot at 0x3dac4d30>

Top dominating genes for Class 2

In [260]:
train2=train[train.Class==2]
gene_group = train2.groupby("Gene")['Gene'].count()
cls2=gene_group.sort_values(ascending=False)[:10]
print cls2
Gene
EGFR      45
KIT       44
BRAF      31
ABL1      24
ROS1      18
HRAS      15
PDGFRB    14
ERBB2     14
PIK3CA    14
NOTCH1    13
Name: Gene, dtype: int64
In [261]:
cls2.plot(kind='bar')
Out[261]:
<matplotlib.axes._subplots.AxesSubplot at 0x21d41550>

Top dominating genes for Class 3

In [262]:
train3=train[train.Class==3]
gene_group = train3.groupby("Gene")['Gene'].count()
cls3=gene_group.sort_values(ascending=False)[:10]
print cls1
Gene
TP53     83
BRCA1    60
TSC2     23
BRCA2    18
ERCC2    17
VHL      15
SMAD4    13
PTPRT    13
SMAD3    11
SMAD2    11
Name: Gene, dtype: int64
In [263]:
cls3.plot(kind='bar')
Out[263]:
<matplotlib.axes._subplots.AxesSubplot at 0x1d8550f0>

Top dominating genes for Class 4

In [264]:
train4=train[train.Class==4]
gene_group = train4.groupby("Gene")['Gene'].count()
cls4=gene_group.sort_values(ascending=False)[:10]
print cls4
Gene
PTEN      115
TP53       67
CDKN2A     47
BRCA1      46
TSC2       23
VHL        23
CBL        16
RUNX1      16
PIK3R1     15
SPOP       15
Name: Gene, dtype: int64
In [265]:
cls4.plot(kind='bar')
Out[265]:
<matplotlib.axes._subplots.AxesSubplot at 0x48fc2f98>

Top dominating genes for Class 5

In [266]:
train5=train[train.Class==5]
gene_group = train5.groupby("Gene")['Gene'].count()
cls5=gene_group.sort_values(ascending=False)[:10]
print cls5
Gene
BRCA1     90
BRCA2     20
FGFR3     10
BRAF       8
PIK3CA     8
MLH1       6
ERBB2      6
IDH1       6
FGFR2      5
PMS2       5
Name: Gene, dtype: int64
In [267]:
cls5.plot(kind='bar')
Out[267]:
<matplotlib.axes._subplots.AxesSubplot at 0x48fdb438>

Top dominating genes for Class 6

In [268]:
train6=train[train.Class==6]
gene_group = train6.groupby("Gene")['Gene'].count()
cls6=gene_group.sort_values(ascending=False)[:10]
print cls6
Gene
BRCA2    84
BRCA1    53
JAK2     14
ERBB2    12
MLH1     11
TET2      6
TP53      5
KRAS      4
PIM1      4
ERBB4     4
Name: Gene, dtype: int64
In [269]:
cls6.plot(kind='bar')
Out[269]:
<matplotlib.axes._subplots.AxesSubplot at 0xf838e10>

Top dominating genes for Class 7

In [29]:
train7=train[train.Class==7]
gene_group = train7.groupby("Gene")['Gene'].count()
cls7=gene_group.sort_values(ascending=False)[:10]
print cls7
Gene
EGFR      84
KIT       52
BRAF      48
ALK       47
PDGFRA    42
FLT3      37
KRAS      33
PIK3CA    31
ERBB2     30
FGFR2     27
Name: Gene, dtype: int64
In [271]:
cls7.plot(kind='bar')
Out[271]:
<matplotlib.axes._subplots.AxesSubplot at 0x4909cb70>

Top dominating genes for Class 8

In [272]:
train8=train[train.Class==8]
gene_group = train8.groupby("Gene")['Gene'].count()
cls8=gene_group.sort_values(ascending=False)[:10]
print cls8
Gene
BCOR      4
H3F3A     3
IDH2      2
AKT1      2
SF3B1     1
RHOA      1
RAD51B    1
PTPN11    1
IDH1      1
ERBB2     1
Name: Gene, dtype: int64
In [273]:
cls8.plot(kind='bar')
Out[273]:
<matplotlib.axes._subplots.AxesSubplot at 0x17404208>

Top dominating genes for Class 9

In [274]:
train9=train[train.Class==9]
gene_group = train9.groupby("Gene")['Gene'].count()
cls9=gene_group.sort_values(ascending=False)[:10]
print cls9
Gene
SF3B1    15
EZH2      6
IDH1      4
U2AF1     3
IDH2      3
AKT2      2
TP53      1
MED12     1
CTCF      1
CIC       1
Name: Gene, dtype: int64
In [275]:
cls9.plot(kind='bar')
Out[275]:
<matplotlib.axes._subplots.AxesSubplot at 0x1b00a550>

Observation from the above plots:

The 9 plots for each class shows the distribution of the highest occuring genes for each class. This would help us in understanding and thus would help us in featurization.

In [39]:
import re
import nltk
from nltk.corpus import stopwords
from wordcloud import WordCloud
import string
import matplotlib.pyplot as plt
import collections
def create_word_corpus(class_value):
    text=train[train.Class==int(class_value)].TEXT.tolist()
    for i in range (len(text)):
        for k in text[i].split("\n"):
            text[i]=re.sub(r"[^a-zA-Z0-9]+", ' ', k)
    
    word_dict=[]
    stopword=set(stopwords.words("english"))
    stopword.update((u'fig','1','figure'))
    stopword.update(str(i) for i in range(2,1000))
    for val in text:
        tokens =[]
        for element in val.split(" "):
            tokens.append(element)
    word_dict = [word for word in tokens if word.lower() not in stopword] 
    counter=collections.Counter(word_dict)
    top_1000_words=counter.most_common(1000)
    top_words=[]
    for i in range(0,len(top_1000_words)):
        top_words.append(top_1000_words[i][0])
    print top_words
    corpus=""
    for element in top_words:
        corpus=corpus+element+" "
    return corpus
In [15]:
# Plot cloud
def plot_Cloud(wordCloud):
    plt.figure( figsize=(20,10), facecolor='w')
    plt.imshow(wordCloud)
    plt.axis("off")
    plt.tight_layout(pad=0)
    plt.show()
    plt.savefig('wordclouds.png', facecolor='w', bbox_inches='tight')
In [40]:
# Generate a word cloud image
class1_wordclou = WordCloud(width=900, height=500).generate(create_word_corpus(str(1)))
['RUNX1', 'q22', 'AML', 'gene', 'RUNX1T1', 'M2', 'fusion', 'genes', 'protein', 'RHD', 'translocations', 'translocation', 'q13', 'involving', 'domain', 'chromosome', 'TAD', 'leukemia', 'patients', 'three', 'partner', 'identified', 'transcription', 'frame', 'ins', 'cases', 'p22', 'region', 'Table', 'ETV6', 'M1', 'MDS', 'located', 'two', 'part', 'q24', 'q12', 'activation', 'RP11', 'p13', 'one', 'transcript', 'amino', 'n', 'q26', 'truncated', 'cell', 'reported', 'band', 'q21', 'way', 'different', 'zinc', 'involved', 'chimeric', 'including', 'formation', 'chromosomes', 'binding', 'complex', 'q22q22', 'MECOM', 'M4', 'X', 'finger', 'function', 'type', 'AML1', 'target', 'analysis', 'homology', 'terminal', 'FISH', 'transcriptional', 'described', 'q21q22', 'Runt', 'RPL22L1', 'sequence', 'PRDM16', 'myeloid', 'breakpoint', 'retaining', 'acids', 'However', 'four', 'CMML', 'known', 'alias', 'hematopoietic', 'exons', 'leading', 'B', 'leads', 'also', 'using', 'cells', 'associated', 'includes', 'PCR', 'products', 'intron', 'family', 'CBFA2T3', 'p36', 'bCML', 'involve', 'acts', 'ZFPM2', 'Complex', 'shown', 'first', 'molecular', 'probe', 'q31', 'fused', 'codon', 'Acute', 'case', 'hematopoiesis', 'wild', 'rearrangement', 'retains', 'DNA', 'LPXN', 'entire', 'insertion', 'lymphoblastic', 'leukemogenesis', 'studies', 'repressor', 'q23', 'syndrome', '3q26', 'six', 'Fusion', 'material', 'proteins', 'YTHDF2', 'et', 'occurs', 'could', 'coding', 'another', 'showed', 'effect', 'USP42', 'factor', 'dominant', 'result', 'C', 'transcripts', 'almost', 'acute', 'clones', 'made', 'al', 'generated', 'insertions', 'splicing', 'MACROD1', 'pre', 'GATA', 'patient', 'acid', 'Ref', 'even', 'SH3D19', 'p35', 'occurred', 'TRPS1', 'BAC', 'regions', 'variant', 'marrow', 'cytogenetics', 'ZFP64', 'p12', 'exon', 'generation', 'acting', 'chromosomal', 'Disease', 'rearrangements', 'retained', 'unknown', 'major', 'already', 'myelomonocytic', 'PRDX4', 'relapse', 'normal', 'stop', 'negative', 'q34', 'p21', 'AFF3', 'cryptic', 'Chronic', 'q11', 'expression', 'sequences', 'N', 'Furthermore', 'recurrent', 'member', 'resulting', 'p11', 'differentiation', 'remaining', 'bands', 'q14', 'lead', 'USP16', 'TEL', 'fuses', '21q22', 'role', 'coded', 'generates', 'fusing', 'ZNF687', 'level', 'biological', 'new', 'q15', 'splice', 'others', 'unidentified', 'CBP', 'total', 'conventional', 'would', 'following', 'CPNE8', 'establishment', 'may', 'revealed', 'third', 'corepressor', 'blastic', 'FGA7', 'RT', 'several', 'involvement', 'characterized', 'mRNAs', 'cytogenetic', 'related', 'factors', 'crucial', '8q22', 'mechanisms', 'consequence', 'retain', 'regulation', 'signaling', 'b', 'likely', '20q', 'abnormalities', '77G18', 'phase', 'kb', 'normally', 'partners', '184L6', 'covering', 'RUNX2', 'activity', 'leukemogenic', 'p300', 'among', 'unusual', 'inv', 'seven', 'dual', 'Translocation', 'containing', 'promoters', 'Myelodysplastic', 'domains', 'fuse', 'without', 'alternate', 'color', 'definitive', 'five', 'Therefore', 'RUNX1b', 'q25', 'q28', 'stem', 'found', 'rare', 'remains', 'bone', 'chronic', 'coactivators', 'encodes', 'functions', 'owing', 'Chromosomal', 'p15', 'promotor', 'consists', 'aliases', 'contributing', 'contained', 'ten', 'recruitment', 'second', 'M6', 'proliferation', 'CBF', 'constitutive', 'usually', 'Three', 'distinct', 'contains', 'thus', 'cooperate', 'disruption', 'nine', 'VIII', 'example', 'performed', 'control', 'allowed', 'exclude', 'Several', 'end', 'prognosis', 'activator', 'AMP19', 'clone', 'blood', 'q', 'Abbott', 'presence', 'non', 'good', 'remission', 'Analysis', 'disrupted', '20q13', '135L8', 'elucidated', 'karyotype', 'termination', 'localization', 'CSF', 'length', 'frequent', 'embryo', 'Translocations', 'distal', 'least', 'open', 'interact', 'peripheral', 'HDACs', 'M5', 'regulator', 'well', 'subtype', 'majority', 'consequences', '434P20', 'malignant', 'translocated', 'histone', 'Whether', 'q33', 'q32', 'cannot', 'q35', 'remove', 'RUNX3', 'common', 'achieved', 'observed', 'full', 'p23', 'VWRPY', 'Molecular', 'breakpoints', 'various', 'reading', 'importance', 'suppresses', 'key', 'analyzed', 'many', 'presumably', 'regulates', 'spans', 'ETO', 'addition', 'contribute', 'search', 'conserved', 'inhibiting', 'Unfortunately', '245C23', 'Complete', 'FAB', 'higher', 'literature', 'driven', 'hemopathies', 'myeloblastic', 'spliced', 'identify', 'MDS1', 'except', 'confirmed', 'EVI1', 'affinity', 'real', 'early', 'PEBP2b', 'variants', 'nonfunctional', 'rarely', 'repression', 'HDAC', 'core', 'Copine', 'although', 'analyses', 'repressors', 'act', 'mSin3A', 'within', 'Two', 'locus', 'receptor', 'inhibitors', 'RUNX1a', 'elsewhere', 'portions', 'Truncated', 'us', 'similar', 'classic', 'single', 'codes', 'whereas', 'nuclear', 'Structure', 'nucleus', 'pathways', 'included', 'structure', 'myelodysplastic', 'therapy', 'portion', 'compete', 'time', 'code', 'illustrates', 'Sakamoto', 'results', 'nonrandomly', 'demonstrate', '0', 'preserved', 'include', 'applied', 'belonging', 'woman', 'downstream', 'activated', 'induce', 'CoR', 'putting', 'GM', 'categories', 'Cytogenetic', 'harbor', 'fewer', 'large', 'MTG16', 'Much', 'q11q22', 'small', 'prevent', 'Conventional', 'CDKN1A', 'repetitive', 'translating', 'promoter', 'bacterial', 'situated', 'plays', 'contributions', 'fingers', 'shares', 'peroxiredoxin', 'learned', 'contributes', 'overexpressed', 'recruits', 'hundreds', 'focused', 'strong', 'interval', 'obtained', 'inhibitor', 'study', 'larger', 'acetyltransferase', 'suggests', 'pediatric', 'Alternative', 'composed', 'named', 'followed', 'fluorescent', 'maturation', 'therefore', 'encode', 'tricho', 'central', 'high', 'heterogeneity', 'M7', 'intronic', 'yearold', 'repressing', 'male', 'ca', 'give', 'loci', 'share', 'ERF', 'lacks', 'leupaxin', 'subtelomere', 'information', 'provide', 'classifi', 'causing', 'possibly', 'significance', 'instead', 'ubiquitin', 'mesenchymal', 'M0', 'product', 'tried', 'MACRO', 'leukemic', 'thymopoiesis', 'types', 'efficiently', 'bind', 'therapyrelated', 'One', 'complexity', '6p21', 'q36', 'Unusual', 'cellular', 'insight', 'FOG2', 'TIS11b', 'years', 'facilitates', 'MECOMRPL22L1', 'group', 'late', 'RAS', 'clusters', 'junction', 'fragment', 'break', 'effects', 'Hospital', 'half', 'accumulate', 'mimicking', 'structural', 'lineage', 'Indeed', 'classified', 'multitype', 'referred', 'retrieve', 'Transcription', 'RHDs', 'year', 'try', 'Executive', 'AF4', 'Hybridization', 'G', 'retrieved', 'University', 'possibility', 'standard', 'YTH', 'generate', 'interaction', 'created', 'Rungis', 'Normal', 'Chan', 'paxillin', 'due', 'SMRT', '1p36', 'activators', 'ATG', 'possessing', 'number', 'FLT3', 'stimulation', 'approximately', 'L22', 'chemotherapy', 'friend', 'adaptor', 'always', 'XX', 'TLE', 'Gene', 'dependent', 'p53', '19q13', 'binds', 'determined', 'negatively', 'perspective', '20p', 'dualcolor', 'close', 'apparently', 'dozen', 'latter', 'responsible', 'nucleotide', 'mechanism', 'potential', 'pharyngeal', 'peroxidase', 'Unlike', 'arguing', 'complexes', 'eight', 'significant', 'quantitative', 'contain', 'Bone', 'probes', 'evolutionarily', 'disease', 'introns', 'banded', 'subsequent', 'usual', 'heterodimerization', 'precise', 'ending', 'discovered', 'expressed', 'aggressive', 'data', 'based', 'knowledge', 'untranslated', 'receptors', 'molecules', 'de', 'UTRs', 'q37', 'HAT', 'recurrently', 'peptidase', 'superfamily', 'runt', 'q13q22', 'seldom', 'colocalization', 'summary', 'implication', 'France', 'culture', 'see', 'arm', 'best', 'PR', 'probable', '4q31', 'Compilation', 'currently', 'artificial', 'Nguyen', 'trans', 'future', 'Members', 'Clones', 'consecutive', 'approach', 'available', 'Src', 'ability', 'however', 'rarer', 'WIG1', 'concentrated', 'Among', 'last', 'termed', 'according', 'transplantation', 'BLK', 'cancer', 'point', 'appeared', 'ubiquitinspecific', 'better', 'neurogenesis', 'MTG8', 'capable', 'intracellular', 'fusions', 'belongs', 'initiation', 'expected', 'generating', 'osteogenesis', 'versa', 'LAF4', 'understand', 'suspected', 'mapped', 'signing', 'present', 'exception', 'novel', 'subtelomeric', 'premature', 'replaced', 'deregulation', 'phenotypic', 'RNA', 'studied', 'cont', 'induced', 'site', 'surface', 'cluster', 'FMR2', 'situ', 'recruitingThe', 'preferentially', 'split', '1q21', 'subunit', 'development', 'upon', 'differing', 'IL', 'frequently', 'abnormality', 'shortly', 'Almost', 'depending', '1973', 'antiapoptotic', 'BCL2', 'translational', 'CBFA2', 'M5b', 'Given', 'proximal', 'ribosomal', 'useful', 'yet', 'previous', 'classification', 'candidate', 'signals', 'Although', 'Five', 'extensive', 'MYB', 'hybridization', 'putative', 'showing', 'possible', 'inhibition', 'PEBP2aB', 'apart', 'mRNA', 'like', 'identifying', 'alternatively', 'signal', 'represses', '12p13', 'specific', 'ectropic', 'Insertion', 'essential', 'old', 'often', 'Since', 'growth', 'q27', 'altered', 'organizing', 'viral', 'demonstrates', 'NOTCH1', 'integration', 'rhino', 'deacetylase', 'masked', 'AMP', 'fragments', 'Conclusion', 'demonstrated', 'myelodysplasia', 'assembly', 'broken', 'p32', 'STAT3', 'First', 'MPO', 'recognized', 'Still', 'protease', 'previously', '1990s', 'theoretically', 'novo', 'terminated', 'interacted', 'assumed', 'RUNX1c', 'Upon', 'long', '1989', 'neoplasm', 'EEN', 'Rowley', 'corepressors', 'wildtype', 'form', 'laboratory', 'Agerstam', 'myeloproliferative', 'megabases', 'Partner', 'true', 'vice', 'Groucho', 'exerts', 'Brest', 'fulllength', 'bearing', 'codons', 'TCRA', 'totaling', 'TCRB', 'evidence', 'Raimondi', 'accounting', 'p14', 'metaphases', 'EAP', 'poor', 'CtBP', 'LRP16', 'translated', 'searched', 'Future', 'Identification', 'homolog', 'Drosophila', 'Thus', 'presenting']

Word Cloud for Class 1

In [57]:
a=create_word_corpus(str(1))
word=[]
for element in a.split(" "):
    word.append(element)
b=set(word)
print b
['RUNX1', 'q22', 'AML', 'gene', 'RUNX1T1', 'M2', 'fusion', 'genes', 'protein', 'RHD', 'translocations', 'translocation', 'q13', 'involving', 'domain', 'chromosome', 'TAD', 'leukemia', 'patients', 'three', 'partner', 'identified', 'transcription', 'frame', 'ins', 'cases', 'p22', 'region', 'Table', 'ETV6', 'M1', 'MDS', 'located', 'two', 'part', 'q24', 'q12', 'activation', 'RP11', 'p13', 'one', 'transcript', 'amino', 'n', 'q26', 'truncated', 'cell', 'reported', 'band', 'q21', 'way', 'different', 'zinc', 'involved', 'chimeric', 'including', 'formation', 'chromosomes', 'binding', 'complex', 'q22q22', 'MECOM', 'M4', 'X', 'finger', 'function', 'type', 'AML1', 'target', 'analysis', 'homology', 'terminal', 'FISH', 'transcriptional', 'described', 'q21q22', 'Runt', 'RPL22L1', 'sequence', 'PRDM16', 'myeloid', 'breakpoint', 'retaining', 'acids', 'However', 'four', 'CMML', 'known', 'alias', 'hematopoietic', 'exons', 'leading', 'B', 'leads', 'also', 'using', 'cells', 'associated', 'includes', 'PCR', 'products', 'intron', 'family', 'CBFA2T3', 'p36', 'bCML', 'involve', 'acts', 'ZFPM2', 'Complex', 'shown', 'first', 'molecular', 'probe', 'q31', 'fused', 'codon', 'Acute', 'case', 'hematopoiesis', 'wild', 'rearrangement', 'retains', 'DNA', 'LPXN', 'entire', 'insertion', 'lymphoblastic', 'leukemogenesis', 'studies', 'repressor', 'q23', 'syndrome', '3q26', 'six', 'Fusion', 'material', 'proteins', 'YTHDF2', 'et', 'occurs', 'could', 'coding', 'another', 'showed', 'effect', 'USP42', 'factor', 'dominant', 'result', 'C', 'transcripts', 'almost', 'acute', 'clones', 'made', 'al', 'generated', 'insertions', 'splicing', 'MACROD1', 'pre', 'GATA', 'patient', 'acid', 'Ref', 'even', 'SH3D19', 'p35', 'occurred', 'TRPS1', 'BAC', 'regions', 'variant', 'marrow', 'cytogenetics', 'ZFP64', 'p12', 'exon', 'generation', 'acting', 'chromosomal', 'Disease', 'rearrangements', 'retained', 'unknown', 'major', 'already', 'myelomonocytic', 'PRDX4', 'relapse', 'normal', 'stop', 'negative', 'q34', 'p21', 'AFF3', 'cryptic', 'Chronic', 'q11', 'expression', 'sequences', 'N', 'Furthermore', 'recurrent', 'member', 'resulting', 'p11', 'differentiation', 'remaining', 'bands', 'q14', 'lead', 'USP16', 'TEL', 'fuses', '21q22', 'role', 'coded', 'generates', 'fusing', 'ZNF687', 'level', 'biological', 'new', 'q15', 'splice', 'others', 'unidentified', 'CBP', 'total', 'conventional', 'would', 'following', 'CPNE8', 'establishment', 'may', 'revealed', 'third', 'corepressor', 'blastic', 'FGA7', 'RT', 'several', 'involvement', 'characterized', 'mRNAs', 'cytogenetic', 'related', 'factors', 'crucial', '8q22', 'mechanisms', 'consequence', 'retain', 'regulation', 'signaling', 'b', 'likely', '20q', 'abnormalities', '77G18', 'phase', 'kb', 'normally', 'partners', '184L6', 'covering', 'RUNX2', 'activity', 'leukemogenic', 'p300', 'among', 'unusual', 'inv', 'seven', 'dual', 'Translocation', 'containing', 'promoters', 'Myelodysplastic', 'domains', 'fuse', 'without', 'alternate', 'color', 'definitive', 'five', 'Therefore', 'RUNX1b', 'q25', 'q28', 'stem', 'found', 'rare', 'remains', 'bone', 'chronic', 'coactivators', 'encodes', 'functions', 'owing', 'Chromosomal', 'p15', 'promotor', 'consists', 'aliases', 'contributing', 'contained', 'ten', 'recruitment', 'second', 'M6', 'proliferation', 'CBF', 'constitutive', 'usually', 'Three', 'distinct', 'contains', 'thus', 'cooperate', 'disruption', 'nine', 'VIII', 'example', 'performed', 'control', 'allowed', 'exclude', 'Several', 'end', 'prognosis', 'activator', 'AMP19', 'clone', 'blood', 'q', 'Abbott', 'presence', 'non', 'good', 'remission', 'Analysis', 'disrupted', '20q13', '135L8', 'elucidated', 'karyotype', 'termination', 'localization', 'CSF', 'length', 'frequent', 'embryo', 'Translocations', 'distal', 'least', 'open', 'interact', 'peripheral', 'HDACs', 'M5', 'regulator', 'well', 'subtype', 'majority', 'consequences', '434P20', 'malignant', 'translocated', 'histone', 'Whether', 'q33', 'q32', 'cannot', 'q35', 'remove', 'RUNX3', 'common', 'achieved', 'observed', 'full', 'p23', 'VWRPY', 'Molecular', 'breakpoints', 'various', 'reading', 'importance', 'suppresses', 'key', 'analyzed', 'many', 'presumably', 'regulates', 'spans', 'ETO', 'addition', 'contribute', 'search', 'conserved', 'inhibiting', 'Unfortunately', '245C23', 'Complete', 'FAB', 'higher', 'literature', 'driven', 'hemopathies', 'myeloblastic', 'spliced', 'identify', 'MDS1', 'except', 'confirmed', 'EVI1', 'affinity', 'real', 'early', 'PEBP2b', 'variants', 'nonfunctional', 'rarely', 'repression', 'HDAC', 'core', 'Copine', 'although', 'analyses', 'repressors', 'act', 'mSin3A', 'within', 'Two', 'locus', 'receptor', 'inhibitors', 'RUNX1a', 'elsewhere', 'portions', 'Truncated', 'us', 'similar', 'classic', 'single', 'codes', 'whereas', 'nuclear', 'Structure', 'nucleus', 'pathways', 'included', 'structure', 'myelodysplastic', 'therapy', 'portion', 'compete', 'time', 'code', 'illustrates', 'Sakamoto', 'results', 'nonrandomly', 'demonstrate', '0', 'preserved', 'include', 'applied', 'belonging', 'woman', 'downstream', 'activated', 'induce', 'CoR', 'putting', 'GM', 'categories', 'Cytogenetic', 'harbor', 'fewer', 'large', 'MTG16', 'Much', 'q11q22', 'small', 'prevent', 'Conventional', 'CDKN1A', 'repetitive', 'translating', 'promoter', 'bacterial', 'situated', 'plays', 'contributions', 'fingers', 'shares', 'peroxiredoxin', 'learned', 'contributes', 'overexpressed', 'recruits', 'hundreds', 'focused', 'strong', 'interval', 'obtained', 'inhibitor', 'study', 'larger', 'acetyltransferase', 'suggests', 'pediatric', 'Alternative', 'composed', 'named', 'followed', 'fluorescent', 'maturation', 'therefore', 'encode', 'tricho', 'central', 'high', 'heterogeneity', 'M7', 'intronic', 'yearold', 'repressing', 'male', 'ca', 'give', 'loci', 'share', 'ERF', 'lacks', 'leupaxin', 'subtelomere', 'information', 'provide', 'classifi', 'causing', 'possibly', 'significance', 'instead', 'ubiquitin', 'mesenchymal', 'M0', 'product', 'tried', 'MACRO', 'leukemic', 'thymopoiesis', 'types', 'efficiently', 'bind', 'therapyrelated', 'One', 'complexity', '6p21', 'q36', 'Unusual', 'cellular', 'insight', 'FOG2', 'TIS11b', 'years', 'facilitates', 'MECOMRPL22L1', 'group', 'late', 'RAS', 'clusters', 'junction', 'fragment', 'break', 'effects', 'Hospital', 'half', 'accumulate', 'mimicking', 'structural', 'lineage', 'Indeed', 'classified', 'multitype', 'referred', 'retrieve', 'Transcription', 'RHDs', 'year', 'try', 'Executive', 'AF4', 'Hybridization', 'G', 'retrieved', 'University', 'possibility', 'standard', 'YTH', 'generate', 'interaction', 'created', 'Rungis', 'Normal', 'Chan', 'paxillin', 'due', 'SMRT', '1p36', 'activators', 'ATG', 'possessing', 'number', 'FLT3', 'stimulation', 'approximately', 'L22', 'chemotherapy', 'friend', 'adaptor', 'always', 'XX', 'TLE', 'Gene', 'dependent', 'p53', '19q13', 'binds', 'determined', 'negatively', 'perspective', '20p', 'dualcolor', 'close', 'apparently', 'dozen', 'latter', 'responsible', 'nucleotide', 'mechanism', 'potential', 'pharyngeal', 'peroxidase', 'Unlike', 'arguing', 'complexes', 'eight', 'significant', 'quantitative', 'contain', 'Bone', 'probes', 'evolutionarily', 'disease', 'introns', 'banded', 'subsequent', 'usual', 'heterodimerization', 'precise', 'ending', 'discovered', 'expressed', 'aggressive', 'data', 'based', 'knowledge', 'untranslated', 'receptors', 'molecules', 'de', 'UTRs', 'q37', 'HAT', 'recurrently', 'peptidase', 'superfamily', 'runt', 'q13q22', 'seldom', 'colocalization', 'summary', 'implication', 'France', 'culture', 'see', 'arm', 'best', 'PR', 'probable', '4q31', 'Compilation', 'currently', 'artificial', 'Nguyen', 'trans', 'future', 'Members', 'Clones', 'consecutive', 'approach', 'available', 'Src', 'ability', 'however', 'rarer', 'WIG1', 'concentrated', 'Among', 'last', 'termed', 'according', 'transplantation', 'BLK', 'cancer', 'point', 'appeared', 'ubiquitinspecific', 'better', 'neurogenesis', 'MTG8', 'capable', 'intracellular', 'fusions', 'belongs', 'initiation', 'expected', 'generating', 'osteogenesis', 'versa', 'LAF4', 'understand', 'suspected', 'mapped', 'signing', 'present', 'exception', 'novel', 'subtelomeric', 'premature', 'replaced', 'deregulation', 'phenotypic', 'RNA', 'studied', 'cont', 'induced', 'site', 'surface', 'cluster', 'FMR2', 'situ', 'recruitingThe', 'preferentially', 'split', '1q21', 'subunit', 'development', 'upon', 'differing', 'IL', 'frequently', 'abnormality', 'shortly', 'Almost', 'depending', '1973', 'antiapoptotic', 'BCL2', 'translational', 'CBFA2', 'M5b', 'Given', 'proximal', 'ribosomal', 'useful', 'yet', 'previous', 'classification', 'candidate', 'signals', 'Although', 'Five', 'extensive', 'MYB', 'hybridization', 'putative', 'showing', 'possible', 'inhibition', 'PEBP2aB', 'apart', 'mRNA', 'like', 'identifying', 'alternatively', 'signal', 'represses', '12p13', 'specific', 'ectropic', 'Insertion', 'essential', 'old', 'often', 'Since', 'growth', 'q27', 'altered', 'organizing', 'viral', 'demonstrates', 'NOTCH1', 'integration', 'rhino', 'deacetylase', 'masked', 'AMP', 'fragments', 'Conclusion', 'demonstrated', 'myelodysplasia', 'assembly', 'broken', 'p32', 'STAT3', 'First', 'MPO', 'recognized', 'Still', 'protease', 'previously', '1990s', 'theoretically', 'novo', 'terminated', 'interacted', 'assumed', 'RUNX1c', 'Upon', 'long', '1989', 'neoplasm', 'EEN', 'Rowley', 'corepressors', 'wildtype', 'form', 'laboratory', 'Agerstam', 'myeloproliferative', 'megabases', 'Partner', 'true', 'vice', 'Groucho', 'exerts', 'Brest', 'fulllength', 'bearing', 'codons', 'TCRA', 'totaling', 'TCRB', 'evidence', 'Raimondi', 'accounting', 'p14', 'metaphases', 'EAP', 'poor', 'CtBP', 'LRP16', 'translated', 'searched', 'Future', 'Identification', 'homolog', 'Drosophila', 'Thus', 'presenting']
set(['', 'splicing', 'code', 'evidence', 'illustrates', 'Sakamoto', 'activated', 'results', 'integration', 'four', 'zinc', 'consists', 'fusing', 'aliases', 'repression', 'intron', 'acting', 'TAD', '0', 'preserved', 'differing', 'include', 'applied', 'belonging', 'woman', 'downstream', 'activation', 'ZNF687', 'induce', 'putting', 'MACROD1', 'GM', 'categories', 'entire', 'pre', 'Cytogenetic', 'level', 'harbor', 'fewer', 'occurs', 'large', 'MTG16', 'Much', 'demonstrated', 'small', 'biological', 'prevent', 'Conventional', 'approximately', 'ten', 'PCR', 'CDKN1A', 'repetitive', 'TCRB', 'M5', 'recruitment', 'second', 'M6', 'M1', 'M0', 'insertion', 'M2', 'lymphoblastic', 'assembly', 'bacterial', '77G18', 'Ref', 'situated', 'even', 'plays', 'q22q22', 'contributions', 'disease', 'fingers', 'shares', 'cell', 'peroxiredoxin', 'new', 'learned', 'contributes', 'overexpressed', 'proliferation', 'recruits', 'genes', 'hundreds', 'q15', 'reported', 'q11', 'q13', 'q12', 'strong', 'obtained', 'unidentified', 'male', 'CBF', 'SH3D19', 'involved', 'study', 'larger', 'acetyltransferase', 'suggests', 'CBP', 'resulting', 'constitutive', 'pediatric', 'usually', 'Alternative', 'gene', 'studies', 'nonrandomly', 'repressor', 'composed', 'named', 'followed', 'family', 'belongs', 'RUNX1c', 'Three', 'Therefore', 'fluorescent', 'total', 'maturation', 'conventional', 'would', 'remains', 'repressors', 'contains', 'negative', 'therefore', 'encode', 'MDS', 'First', 'type', 'loci', 'AML1', 'binding', 'occurred', 'cooperate', 'disruption', 'known', 'central', 'cases', 'potential', 'high', 'heterogeneity', 'M7', 'created', 'intronic', 'nine', 'VIII', 'MPO', 'following', 'repressing', 'inhibitor', 'example', 'BAC', 'control', 'give', 'tricho', 'share', 'involve', 'ERF', 'performed', 'allowed', 'exclude', 'Several', 'leupaxin', 'subtelomere', 'AMP19', 'lacks', 'end', 'syndrome', 'provide', 'six', 'classifi', 'regions', 'prognosis', 'Fusion', 'possibly', 'significance', 'activator', 'instead', 'ubiquitin', 'establishment', 'translocation', 'mesenchymal', 'product', 'tried', 'information', 'may', 'MACRO', 'clone', 'variant', 'Disease', 'leukemic', 'blood', 'marrow', 'thymopoiesis', 'revealed', 'codes', 'types', 'efficiently', 'third', 'cytogenetics', 'bind', 'therapyrelated', '3q26', 'q31', 'One', 'q', 'alias', 'complexity', 'cannot', '6p21', 'chimeric', 'blastic', 'hematopoietic', 'Unusual', 'acts', 'cellular', 'Hospital', 'ZFPM2', 'FGA7', 'insight', 'FOG2', 'TIS11b', 'years', 'including', 'facilitates', 'RT', 'MECOMRPL22L1', 'Abbott', 'group', 'located', 'mSin3A', 'late', 'RAS', 'clusters', 'ZFP64', 'RP11', 'non', 'good', 'junction', 'remission', 'fragment', 'material', 'Analysis', 'break', 'band', 'p12', 'p13', 'exon', 'p15', 'receptor', 'half', 'accumulate', 'locus', '20q13', 'structural', 'lineage', 'subunit', '135L8', 'Indeed', 'classified', 'proteins', 'identified', 'multitype', 'found', 'RHD', 'characterized', 'referred', 'mRNAs', 'domain', 'cytogenetic', 'retrieve', 'exons', 'generation', 'Transcription', 'elucidated', 'RHDs', 'related', 'YTHDF2', 'Complex', 'year', 'et', 'karyotype', 'try', 'shown', 'termination', 'factors', 'crucial', 'Brest', 'FLT3', 'contained', 'Executive', 'AF4', 'induced', 'GATA', 'G', 'localization', 'retrieved', 'University', 'CSF', 'mechanisms', 'possibility', 'leukemia', 'formation', 'YTH', 'chromosomal', 'generate', 'rearrangements', 'interaction', 'retained', 'RUNX3', 'Rungis', 'could', 'interacted', 'Chan', 'paxillin', 'length', 'due', 'leading', 'consequence', 'retain', 'SMRT', '1p36', 'activators', 'frequent', 'first', 'embryo', 'ATG', 'major', 'already', 'Translocations', 'distal', 'possessing', 'coding', 'number', 'one', 'least', 'another', 'open', '8q22', 'MECOM', 'unknown', 'interact', 'bCML', 'stimulation', 'quantitative', 'regulation', 'myelomonocytic', 'molecular', 'peripheral', 'friend', 'HDACs', 'B', 'growth', 'adaptor', 'always', 'XX', 'part', 'signaling', 'TLE', 'chemotherapy', 'Gene', 'p53', 'translating', '19q13', 'b', 'target', 'showed', 'M4', 'likely', 'regulator', 'binds', 'patients', 'determined', 'negatively', 'aggressive', 'perspective', '20q', '20p', 'banded', 'dualcolor', 'CMML', 'hemopathies', 'close', 'abnormalities', 'apparently', 'dozen', 'BLK', 'acid', 'relapse', 'depending', 'transcription', 'latter', 'responsible', 'mechanism', 'also', 'causing', 'pharyngeal', 'peroxidase', 'chromosomes', 'Unlike', 'normal', 'arguing', 'codons', 'complexes', 'involving', 'eight', 'significant', 'breakpoint', 'phase', 'contain', 'transcript', 'Bone', 'probes', 'kb', 'evolutionarily', 'consequences', 'inhibitors', 'introns', 'subsequent', 'usual', 'normally', 'heterodimerization', '434P20', 'precise', 'probe', 'ending', 'discovered', 'expressed', 'malignant', 'ca', 'breakpoints', 'data', 'homology', 'partners', 'based', 'knowledge', 'untranslated', 'CBFA2T3', 'terminal', 'molecules', 'factor', 'translocated', 'dominant', 'histone', 'Whether', '184L6', 'de', 'stop', 'assumed', 'UTRs', 'abnormality', 'q33', 'q32', 'corepressor', 'made', 'q37', 'q36', 'q35', 'q34', 'X', 'HAT', 'recurrently', 'peptidase', 'superfamily', 'runt', 'covering', 'approach', 'seldom', 'colocalization', 'remove', 'summary', 'RUNX2', 'subtype', 'RUNX1', 'common', 'activity', 'implication', 'leukemogenic', 'often', 'achieved', 'frame', 'France', 'observed', 'culture', 'see', 'full', 'p21', 'result', 'p23', 'p22', 'VWRPY', 'arm', 'best', 'PR', 'MDS1', 'probable', 'nuclear', 'Compilation', 'currently', 'AFF3', 'artificial', 'USP42', 'Nguyen', 'trans', 'future', 'various', 'finger', 'Members', 'Clones', 'consecutive', 'reading', 'q13q22', 'available', 'Src', 'C', 'ability', 'importance', 'suppresses', 'however', 'key', 'rarer', 'analyzed', 'WIG1', 'concentrated', 'Among', 'cryptic', 'last', 'q14', 'many', 'region', 'termed', 'according', 'Chronic', 'transplantation', 'present', 'focused', 'expression', 'p300', 'among', 'presumably', 'regulates', 'splice', 'cancer', 'point', 'color', 'FISH', 'others', 'codon', 'spans', 'sequences', 'dependent', 'transcriptional', 'ETO', 'amino', 'better', 'unusual', 'fused', 'neurogenesis', 'leads', 'capable', 'addition', 'intracellular', 'three', 'TCRA', 'fuses', 'initiation', 'expected', 'totaling', 'Acute', 'generating', 'PRDX4', 'osteogenesis', 'Although', 'LAF4', 'search', 'N', 'understand', 'suspected', 'mapped', 'signing', 'standard', 'case', 'hematopoiesis', 'exception', 'novel', 'subtelomeric', 'single', 'promoter', 'n', 'replaced', 'deregulation', 'wild', 'phenotypic', 'transcripts', 'exerts', 'RNA', 'q21q22', 'seven', 'studied', 'cont', 'almost', 'Furthermore', 'Raimondi', 'thus', 'site', 'surface', 'cluster', 'RUNX1T1', 'dual', 'inhibiting', 'partner', 'protein', 'FMR2', 'situ', 'Translocation', 'Unfortunately', '245C23', 'containing', 'Complete', 'recruitingThe', 'recurrent', 'preferentially', 'member', 'complex', 'split', '1q21', 'FAB', 'products', 'several', 'higher', 'development', 'literature', 'Runt', 'promoters', 'upon', 'effect', 'Myelodysplastic', 'driven', 'domains', 'fuse', 'IL', 'frequently', 'CoR', 'shortly', 'p11', 'Almost', 'well', 'analysis', 'antiapoptotic', 'without', 'Table', 'effects', 'myeloblastic', 'CBFA2', 'M5b', 'Given', 'spliced', 'identify', 'proximal', 'ribosomal', 'differentiation', 'useful', 'Molecular', 'yet', 'previous', 'RPL22L1', 'mimicking', 'classification', 'candidate', 'majority', 'alternate', 'except', 'confirmed', 'signals', 'appeared', 'EVI1', 'affinity', 'versa', 'role', 'extensive', 'MYB', 'remaining', 'hybridization', 'real', 'STAT3', 'definitive', 'putative', 'bands', 'showing', 'possible', 'early', 'PEBP2b', 'fusion', 'inhibition', 'five', 'premature', 'disrupted', 'EAP', 'using', 'variants', 'apart', 'mRNA', 'acute', 'like', 'identifying', 'alternatively', 'signal', 'represses', '12p13', 'specific', 'ectropic', 'Insertion', 'nonfunctional', 'rearrangement', '4q31', 'essential', 'old', 'RUNX1a', 'RUNX1b', 'truncated', 'translational', 'q21', 'q22', 'q23', 'q24', 'q25', 'q26', 'q27', 'q28', 'altered', 'organizing', 'viral', 'rarely', 'AML', 'lead', 'demonstrates', 'NOTCH1', 'However', 'rhino', 'contributing', 'deacetylase', 'masked', 'HDAC', 'AMP', 'fragments', 'Conclusion', 'q11q22', 'myelodysplasia', 'core', 'patient', 'retains', 'Copine', 'stem', 'broken', 'although', 'p32', 'described', 'analyses', 'p36', '1973', 'p35', 'PRDM16', 'rare', 'MTG8', 'ETV6', 'yearold', 'recognized', 'distinct', 'myeloid', 'bone', 'act', 'ubiquitinspecific', 'nucleus', 'Still', 'protease', 'presence', 'previously', 'within', 'Two', 'two', 'chronic', 'involvement', '1990s', 'Five', 'theoretically', 'Hybridization', 'novo', 'terminated', 'TEL', 'coactivators', 'Normal', 'nucleotide', 'sequence', 'Upon', 'long', '1989', 'neoplasm', 'L22', 'EEN', 'way', 'Rowley', 'corepressors', 'wildtype', 'function', 'Since', 'DNA', 'elsewhere', 'form', 'Agerstam', 'receptors', '21q22', 'encodes', 'myeloproliferative', 'megabases', 'Partner', 'clones', 'true', 'TRPS1', 'portions', 'Truncated', 'cells', 'Drosophila', 'leukemogenesis', 'different', 'Groucho', 'owing', 'retaining', 'demonstrate', 'similar', 'fulllength', 'bearing', 'interval', 'associated', 'LPXN', 'classic', 'fusions', 'inv', 'contribute', 'al', 'ins', 'conserved', 'laboratory', 'accounting', 'chromosome', 'functions', 'CPNE8', 'translocations', 'us', 'whereas', 'p14', 'metaphases', 'coded', 'Chromosomal', 'PEBP2aB', 'Structure', 'poor', 'CtBP', 'generates', 'LRP16', 'translated', 'searched', 'includes', 'generated', 'Future', 'USP16', 'pathways', 'Identification', 'included', 'homolog', 'structure', 'insertions', 'vice', 'acids', 'Thus', 'myelodysplastic', 'therapy', 'portion', 'compete', 'presenting', 'BCL2', 'time', 'promotor'])
In [279]:
plot_Cloud(class1_wordclou)
<matplotlib.figure.Figure at 0x19fa6320>