数据挖掘与数据分析数据可视化试题.docx
数据挖掘与数据分析,数据可视化试题1. DataMiningisalsoreferredtoasdataanalysisdatadiscoverydatarecoveryDatavisualization2. DataMiningisamethodandtechniqueinclusiveofdataanalysis.datadiscoveryDatavisualizationdatarecovery3. InwhichstepofDataScienceconsumeAlmost80%oftheworkperiodoftheprocedure.AccumulatingthedataAnalyzingthedataWranglingthedataRecapitulationoftheData4. WhichStepofDataScienceallowsthemodeltoconsistentlyimproveandprovidepunctualperformanceanddeliverapproximateresults.WranglingthedataAccumulatingthedataAnalyzingthedata5. WhichtoolofDataScienceisrobustmachinelearninglibrary,whichallowstheimplementationofdeeplearning7algorithms.STableauD3.jsApacheSparkTensorFlow6. WhatisthemainaimofDataMining?toobtaindatafromalessnumberofsourcesandtotransformitintoamoreusefulversionofitself.toobtaindatafromalessnumberofsourcesandtotransformitintoalessusefulversionofitself.toobtaindatafromagreatnumberofsourcesandtotransformitintoalessusefulversionofitself.toobtaindatafromagreatnumberofsourcesandtotransformitintoamoreusefulversionofitself.7. Inwhichstepofdataminingtheirrelevantpatternsareeliminatedtoavoidcluttering?CleaningthedataEvaluatingthedataConversionofthedataIntegrationofdata8. DataSciencetismainlyusedforpurposes.Dataminingismainlyusedforpurposes.scientific,businessbusiness,scientificscientific,scientificNone9. Pandasisaonedimensionallabeledarraycapableofholdingdataofanytype(integer,string,float,pythonobjects,etc.).SeriesFramePanelNone10. HowmanyprincipalcomponentsPandasDataFrameconsistsof?421311. Importantdatastructureofpandasis/areSeriesDataFrameBoth.Noneoftheabove12. Whichofthefollowingcommandisusedtoinstallpandas?pipinstallpandasinstallpandaspippandas13. Whichofthefollowingfunction/methodhelptocreateSeries?series()Series()(CreateSeries()Noneoftheabove14. NumPYstandsfor?NumberingPythonNumberInPythonNumericalPythonNoneOftheabove15. Whichofthefollowingisnotcorrectsub-packagesofSciPy?scipy.integratescipy.sourcescipy.interpolatescipy.signal16. HowtoimportConstantsPackageinSciPy?importscipy.constantsfromscipy.constantsimportscipy.constants.packagefromscipy.constants.package17. involveslookingatanddescribingthedatasetfromdifferentanglesandthensummarizingit?DataFrameDataVisualizationEDAiAlloftheabove18. whatinvolvesthepreparationofdatasetsforanalysisbyremovingirregularitiesinthedatasothattheseirregularitiesdonotaffectfurtherstepsintheprocessofdataanalysisandmachinelearningmodelbuilding?DataAnalysisEDA!DataFrameNoneoftheabove19. WhatisnotUtilityofEDA?MaximizetheinsightinthedatasetDetectoutliersandanomaliesVisualizationofdataTestunderlyingassumptions20. whatcanhamperthefurtherstepsinthemachinelearningmodelbuildingprocessIfnotperformedproperly?RecapitulationoftheDataAccumulatingthedataEDA(正确答案)Noneoftheabove21. WhichplotforEDAtocheckthedependencybetweentwovariables?HistogramsScatterplotsMapsTimeseriesplots22. Whatfunctionwilltellyouthetoprecordsinthedataset?shapehead(正确答案)showalloftheaboce23. whattypeofdataisusefulforinternalpolicymakingandbusinessstrategybuildingforanorganization?publicdataprivatedatabothNoneoftheabove24. Thefunctioncan“fillin"NAvalueswithnon-nulldata?headfillnashapealloftheabove25. Ifyouwanttosimplyexcludethemissingvalues,thenwhatfunctionalongwiththeaxisargumentwillbeuse?llnareplacedropnaisnull26. WhichofthefollowingattributeofDataFrameisusedtodisplaydatatypeofeachcolumninDataFrame?DtypesDTypesdlypesdatatypes27. WhichofthefollowingfunctionisusedtoloadthedatafromtheCSVfileintoaDataFrame?read.csv()readcsv()read_csv()(正确涔案)Read_csv()28. howtoDisplayfirstrowofdataframe'DF'?print(DF.head(1)print(DF0:1)print(DF.iloc0:1)Alloftheabove29. Spreadfunctionisknownasinspreadsheets?pivotunpivotcastorder30. extractasubsetofrowsfromadataframbasedonlogicalconditions?renamefiltersetsubset31. WccanshifttheDataFrame,sindexbyacertainnumberofperiodsusingtheMethod?melt()merge()tail()shift()俚确答案)32. WecanjoinmeltedDataFramesintooneAnalyticalBaseTableusingthefunction.join()append()merge()truncate()33. Whatmethosisusedtoconcatenatedatasetsalonganaxis?concatenate()Oz-*Ph'i、(正确答案)addOmerge()34. Rowscanbeifthenumberofmissingvaluesisinsignificant,asthiswouldnotimpacttheoverallanalysisresults.deletedupdatedaddedall35. Thereisaspecificreasonbehindthemissingvalue.WhatstandsforMissingnotatrandomMCARMARMNARNoneoftheabove36. Whileplottingdata,somevaluesofonevariablemaynotliebeyondtheexpectedrange,butwhenyouplotthedatawithsomeothervariable,thesevaluesmayliefarfromtheexpectedvalue.Identifythetypeofoutliers?UnivariateoutliersMultivariateoutliers1;)ManyVariateOutlinersNoneoftheabove37. ifnumericvaluesarestoredasstrings,thenitwouldnotbepossibletoCalculatemetricssu