




版權(quán)說明:本文檔由用戶提供并上傳,收益歸屬內(nèi)容提供方,若內(nèi)容存在侵權(quán),請(qǐng)進(jìn)行舉報(bào)或認(rèn)領(lǐng)
文檔簡(jiǎn)介
《大數(shù)據(jù)專業(yè)英語教程》(機(jī)械工業(yè)出版社)參考試卷命題人:張強(qiáng)華司愛俠參考試卷一、寫出以下單詞的中文意思(每小題0.5分,共10分)1accumulate11authentication2operation12malware3complexity13ransomware4filtering14vulnerability5leakage15process6engine16validity7recovery17interpretation8storage18classification9ensure19element10accumulate20executable二、根據(jù)給出的中文意思,寫出英文單詞(每小題0.5分,共10分)1n.元數(shù)據(jù)11n.并發(fā)(性)2n.特性;屬性12n.數(shù)據(jù)庫3n.服務(wù)器13adj.程序的,過程的4n.推薦引擎;推薦系統(tǒng)14n.倉庫;貯藏室5n.標(biāo)準(zhǔn),規(guī)格15vt.收集;采集6adj.定性的16n.聚集;集成;集結(jié)7n.登記,注冊(cè)17adj.跨平臺(tái)的8n.備份18vt.取得,獲得;實(shí)現(xiàn)9n.容量;性能19n.體系結(jié)構(gòu);(總體、層次)結(jié)構(gòu)10n.冗余;過多,過剩20v.擔(dān)保;確保n.保證;保修單1dataflow2datamart3datamining4datasharing5datadefinition6datastorage7datavisualization8operatingsystem9semi-structureddata10sampledata四、根據(jù)給出的中文意思,寫出英文短語(每小題1分,共10分)1非結(jié)構(gòu)化數(shù)據(jù)2層次數(shù)據(jù)模型,分級(jí)數(shù)據(jù)模型3文本分析4數(shù)據(jù)點(diǎn)5數(shù)據(jù)收集6自治數(shù)據(jù)庫7數(shù)據(jù)倉庫8混合云9機(jī)器學(xué)習(xí)10非關(guān)系數(shù)據(jù)庫五、寫出以下縮略語的完整形式和中文意思(每小題1分,共10分)縮略語完整形式中文意思1AI2BDF3CMS4API5DDL6DML7DQL8ELT9JVM10SLA六、閱讀短文,回答問題(每小題2分,共10分)TheImportanceofClusteringandClassificationinDataScienceThepurposeofclusteringandclassificationalgorithmsistomakesenseofandextractvaluefromlargesetsofstructuredandunstructureddata.Ifyou’reworkingwithhugevolumesofunstructureddata,itonlymakessensetotrytopartitionthedataintosomesortoflogicalgroupingsbeforeattemptingtoanalyzeit.Clusteringandclassificationallowsyoutotakeasweepingglanceofyourdataenmasse,andthenformsomelogicalstructuresbasedonwhatyoufindtherebeforegoingdeeperintothenuts-and-boltsanalysis.Intheirsimplestform,clustersaresetsofdatapointsthatsharesimilarattributes,andclusteringalgorithmsarethemethodsthatgroupthesedatapointsintodifferentclustersbasedontheirsimilarities.You’llseeclusteringalgorithmsusedfordiseaseclassificationinmedicalscience,butyou’llalsoseethemusedforcustomerclassificationinmarketingresearchandforenvironmentalhealthriskassessmentinenvironmentalengineering.Therearedifferentclusteringmethods,dependingonhowyouwantyourdatasettobedivided.Thetwomaintypesofclusteringalgorithmsare:Hierarchical:Algorithmscreateseparatesetsofnestedclusters,eachintheirownhierarchallevel.Partitional:Algorithmscreatejustasinglesetofclusters.Youmighthaveheardofclassificationandthoughtthatclassificationisthesamethingasclustering.Manypeopledo,butthisisnotthecase.Inclassification,beforeyoustart,youalreadyknowthenumberofclassesintowhichyourdatashouldbegroupedandyoualreadyknowwhatclassyouwanteachdatapointtobeassigned.Inclassification,thedatainthedatasetbeinglearnedfromislabeled.Whenyouuseclusteringalgorithms,ontheotherhand,youhavenopredefinedconceptforhowmanyclustersareappropriateforyourdata,andyourelyupontheclusteringalgorithmstosortandclusterthedatainthemostappropriateway.Withclusteringtechniques,you’relearningfromunlabeleddata.Tobetterillustratethenatureofclassification,though,takealookatTwitteranditshash-taggingsystem.Sayyoujustgotholdofyourfavoritedrinkintheentireworld:anicedcaramellattefromStarbucks.You’resohappytohaveyourdrinkthatyoudecidetotweetaboutitwithaphotoandthephrase“ThisisthebestlatteEVER!#StarbucksRocks.”Well,ofcourse,youinclude“#StarbucksRocks”inyourtweetsothatthetweetgoesintothe#StarbucksRocksstreamandisclassifiedtogetherwithalltheothertweetsthathavebeenlabeledas#StarbucksRocks.YouruseofthehashtaglabelinyourtweettoldTwitterhowtoclassifyyourdataintoarecognizableandaccessiblegroup,orcluster.Whatisthepurposeofclusteringandclassificationalgorithms?Whatareclustersintheirsimplestform?Whatareclusteringalgorithms?3.Howmanymaintypesofclusteringalgorithmsarethere?Whatarethey?4.Whatdoyoualreadyknowinclaasification?5.Howcanyoubetterillustratethenatureofclassification?將下列詞填入適當(dāng)?shù)奈恢茫吭~只用一次)。(每小題10分,共20分)填空題1供選擇的答案:uniquehierarchicalprocessesincludeacceptableinvolvesaccuracyhackerslinkedissuesTypesofDataIntegrityTherearetwotypesofdataintegrity:physicalintegrityandlogicalintegrity.Bothareacollectionofprocessesandmethodsthatenforcedataintegrityinboth___1___andrelationaldatabases.PhysicalintegrityPhysicalintegrityistheprotectionofdata’swholenessand___2___asit’sstoredandretrieved.Whennaturaldisastersstrike,powergoesout,orhackersdisruptdatabasefunctions,physicalintegrityiscompromised.Humanerror,storageerosion,andahostofother___3___canalsomakeitimpossiblefordataprocessingmanagers,systemprogrammers,applicationsprogrammers,andinternalauditorstoobtainaccuratedata.LogicalintegrityLogicalintegritykeepsdataunchangedasit’susedindifferentwaysinarelationaldatabase.Logicalintegrityprotectsdatafromhumanerrorand___4___aswell,butinamuchdifferentwaythanphysicalintegritydoes.Therearefourtypesoflogicalintegrity.2.1EntityintegrityEntityintegrityreliesonthecreationofprimarykeys,or___5___valuesthatidentifypiecesofdata,toensurethatdataisn’tlistedmorethanonceandthatnofieldinatableisnull.It’safeatureofrelationalsystemswhichstoredataintablesthatcanbe___6___andusedinavarietyofways.2.2ReferentialintegrityReferentialintegrityreferstotheseriesof___7___thatmakesuredataisstoredanduseduniformly.Rulesembeddedintothedatabase’sstructureabouthowforeignkeysareusedensurethatonlyappropriatechanges,additions,ordeletionsofdataoccur.Rulesmay___8___constraintsthateliminatetheentryofduplicatedata,guaranteethatdataisaccurate,and/ordisallowtheentryofdatathatdoesn’tapply.2.3DomainintegrityDomainintegrityisthecollectionofprocessesthatensuretheaccuracyofeachpieceofdatainadomain.Inthiscontext,adomainisasetof___9___valuesthatacolumnisallowedtocontain.Itcanincludeconstraintsandothermeasuresthatlimittheformat,type,andamountofdataentered.2.4User-definedintegrityUser-definedintegrity___10___therulesandconstraintscreatedbytheusertofittheirparticularneeds.Sometimesentity,referential,anddomainintegrityaren’tenoughtosafeguarddata.Often,specificbusinessrulesmustbetakenintoaccountandincorporatedintodataintegritymeasures.填空題2供選擇的答案:programsarchitecturelayerhandlingcreatecenterinfrastructurenetworksstoragemachinesBigDataCloudReferenceArchitectureThecloudarchitectureforbigdataisefficienttomanagecomplicatedcomputingscalability,storage,andnetworkinginfrastructure.Theinfrastructureasserviceprovidersmainlydealswithservers,___1___,inadditiontostorageapplicationsandoffersfacilitiessuchasvirtualization,basicmonitoringandsafety,operatingsystem,serverinadata___2___,andstorageservices.Thefourlayersofbigdatacloudarchitecturearediscussedbelow:BigDataAnalytics-SoftwareasaService(BDA-SaaS):Theanalyticsofbigdataofferedasservicegivesusersthecapabilitytoquicklyworkonanalyticswithoutspendingon___3___andpayforthefacilitiesused.Thefunctionsofthislayerare:?Arrangementofsoftwareapplicationsrepository?Software___4___deploymentontheinfrastructure?Resultdeliverytotheusers.BigDataAnalytics-PlatformasaService(BPaaS):Thisisthesecondlayerofthe___5___.Itisthecorelayerthatprovidesplatform-relatedservicestoworkwithstoredbigdataandcomputing.Datamanagementtools,schedulers,andprogrammingenvironmentsfordata-intensiveanddataprocessingtasks,whichareconsideredasmiddlewaremanagementtoolsresideinthisregion.This___6___responsiblefordevelopingsoftwaredevelopmentkitsandtoolsnecessaryforanalytics.BigDataFabric(BDF):Thisisthefabriclayerofbigdata,responsibleforaddressingtoolsandAPIsthatsupportthe___7___ofdata,datacomputation,andaccesstodifferentapplicationservices.ThislayercomprisesAPIsandinteroperableprotocoldesignedtoconnectthespecifiedmultiplecloudinfrastructuralstandards.CloudInfrastructure(CI):Thecloudinfrastructureisresponsiblefor___8___theinfrastructurefordatastorageandcomputationasservices.TheservicesofferedbyCIlayerareasfollows:●Tocreatelarge-scaleelasticinfrastructureforbigdatastorage,capableofon-demanddeployment.●Tosetupdynamicvirtual___9___.●Togenerateson-demandstoragefacilitiesthatrelatetobigdatamanagementforfile,block,andobject-based.●Toenableseamlesspassageofdataacrossthestoragerepositories.●To___10___virtualmachinesandtomountthefilesystemwiththecomputenode.短文翻譯(每小題10分,共20分)翻譯題1DataCleaningWhatisdatacleaning?Datacleaningistheprocessoffixingorremovingincorrect,corrupted,incorrectlyformatted,duplicate,orincompletedatawithinadataset.Datacleaning,whichisalsoreferredtoasdatacleansinganddatascrubbing,isoneofthemostimportantstepsforyourorganizationifyouwanttocreateaculturearoundqualitydatadecision-making.Datacleaningisnotsimplyabouterasinginformationtomakespacefornewdata,butratherfindingawaytomaximizeadataset’saccuracywithoutnecessarilydeletinginformation.Datacleaningincludesmoreactionsthanremovingdata,suchasfixingspellingandsyntaxerrors,standardizingdatasets,andcorrectingmistakessuchasemptyfields,missingcodes,andidentifyingduplicatedatapoints.Mostimportantly,thegoalofdatacleaningistocreatedatasetsthatarestandardizedanduniformtoallowbusinessintelligenceanddataanalyticstoolstoeasilyaccessandfindtherightdataforeachquery.Whatisthedifferencebetweendatacleaninganddatatransformation?Datacleaningistheprocessthatremovesdatathatdoesnotbelonginyourdataset.Datatransformationistheprocessofconvertingdatafromoneformatorstructureintoanother.Transformationprocessescanalsobereferredtoasdatawrangling,ordatamunging,transformingandmappingdatafromone"raw"dataformintoanotherformatforwarehousingandanalyzing.BenefitsofdatacleaningHavingcleandatawillultimatelyincreaseoverallproductivityandallowforthehighestqualityinformationinyourdecision-making.Thebenefitsinclude:●Removaloferrorswhenmultiplesourcesofdataareatplay.●Fewererrorsmakeforhappierclientsandless-frustratedemployees.●Abilitytomapthedifferentfunctionsandwhatyourdataisintendedtodo.●Monitoringerrorsandbetterreportingtoseewhereerrorsarecomingfrom,makingiteasiertofixincorrectorcorruptdataforfutureapplications.●Usingtoolsfordatacleaningwillmakeformoreefficientbusinesspracticesandquickerdecision-making.翻譯題2DataVisualization??Datavisualizationisthepracticeoftranslatinginformationintoavisualcontext,suchasamaporgraph,tomakedataeasierforthehumanbraintounderstandandpullinsightsfrom.Themaingoalofdatavisualizationistomakeiteasiertoidentifypatterns,trendsandoutliersinlargedatasets.Thetermisoftenusedinterchangeablywithothers,includinginformationgraphics,informationvisualizationandstatisticalgraphics.Datavisualizationisoneofthestepsofthedatascienceprocess,whichstatesthatafterdatahasbeencollected,processedandmodeled,itmustbevisualizedforconclusionstobemade.Datavisualizationisalsoanelementofthebroaderdatapresentationarchitecture(DPA)discipline,whichaimstoidentify,locate,manipulate,formatanddeliverdatainthemostefficientwaypossible.Datavisualizationisimportantforalmosteverycareer.Itcanbeusedbyteacherstodisplaystudenttestresults,bycomputerscientistsexploringadvancementsinartificialintelligence(AI)orbyexecutiveslookingtoshareinformationwithstakeholders.Italsoplaysanimportantroleinbigdataprojects.Asbusinessesaccumulatedmassivecollectionsofdataduringtheearlyyearsofthebigdatatrend,theyneededawaytoquicklyandeasilygetanoverviewoftheirdata.Visualizationtoolswereanaturalfit.Visualizationiscentraltoadvancedanalyticsforsimilarreasons.Whenadatascientistiswritingadvancedpredictiveanalyticsormachinelearning(ML)algorithms,itbecomesimportanttovisualizetheoutputstomonitorresultsandensurethatmodelsareperformingasintended.Thisisbecausevisualizationsofcomplexalgorithmsaregenerallyeasiertointerpretthannumericaloutputs.Datavisualizationprovidesaquickandeffectivewaytocommunicateinformationinauniversalmannerusingvisualinformation.Thepracticecanalsohelpbusinessesidentifywhichfactorsaffectcustomerbehavior;pinpointareasthatneedtobeimprovedorneedmoreattention;makedatamorememorableforstakeholders;understandwhenandwheretoplacespecificproducts;andpredictsalesvolumes.Otherbenefitsofdatavisualizationinclude:●theabilitytoabsorbinformationquickly,improveinsightsandmakefasterdecisions;●anincreasedunderstandingofthenextstepsthatmustbetakentoimprovetheorganization;●animprovedabilitytomaintaintheaudience'sinterestwithinformationtheycanunderstand;●aneasydistributionofinformationthatincreasestheopportunitytoshareinsightswitheveryoneinvolved;●eliminatingtheneedfordatascientistssincedataismoreaccessibleandunderstandable;and●anincreasedabilitytoactonfindingsquicklyand,therefore,achievesuccesswithgreaterspeedandlessmistakes.
參考試卷答案一、寫出以下單詞的中文意思(每小題0.5分,共10分)1accumulatev.堆積,積累11authenticationn.身份驗(yàn)證;認(rèn)證2operationn.操作;運(yùn)算12malwaren.惡意軟件,流氓軟件3complexityn.復(fù)雜性13ransomwaren.勒索軟件4filteringn.過濾14vulnerabilityn.弱點(diǎn);脆弱性5leakagen.漏出;泄露15processvt.加工;處理6enginen.引擎,發(fā)動(dòng)機(jī)16validityn.有效性,合法性7recoveryn.恢復(fù),復(fù)原17interpretationn.解釋,說明8storagen.貯存18classificationn.分類,歸類9ensurevt.確保19elementn.元素;要素;原理10accumulatev.堆積,積累20executableadj.可執(zhí)行的;實(shí)行的二、根據(jù)給出的中文意思,寫出英文單詞(每小題0.5分,共10分)1n.元數(shù)據(jù)metadata11n.并發(fā)(性)concurrency2n.特性;屬性property12n.數(shù)據(jù)庫database3n.服務(wù)器server13adj.程序的,過程的procedural4n.推薦引擎;推薦系統(tǒng)recommender14n.倉庫;貯藏室repository5n.標(biāo)準(zhǔn),規(guī)格standard15vt.收集;采集gather6adj.定性的qualitative16n.聚集;集成;集結(jié)aggregation7n.登記,注冊(cè)registration17adj.跨平臺(tái)的cross-platform8n.備份backup18vt.取得,獲得;實(shí)現(xiàn)achieve9n.容量;性能capacity19n.體系結(jié)構(gòu);(總體、層次)結(jié)構(gòu)architecture10n.冗余;過多,過剩redundancy20v.擔(dān)保;確保n.保證;保修單guarantee1dataflow數(shù)據(jù)流2datamart數(shù)據(jù)集市3datamining數(shù)據(jù)挖掘4datasharing數(shù)據(jù)共享5datadefinition數(shù)據(jù)定義6datastorage數(shù)據(jù)存儲(chǔ)7datavisualization數(shù)據(jù)可視化8operatingsystem操作系統(tǒng)9semi-structureddata半結(jié)構(gòu)化數(shù)據(jù)10sampledata樣本數(shù)據(jù)四、根據(jù)給出的中文意思,寫出英文短語(每小題1分,共10分)1非結(jié)構(gòu)化數(shù)據(jù)unstructureddata2層次數(shù)據(jù)模型,分級(jí)數(shù)據(jù)模型hierarchicaldatamodel3文本分析textanalysis4數(shù)據(jù)點(diǎn)datapoint5數(shù)據(jù)收集datacollection6自治數(shù)據(jù)庫autonomousdatabases7數(shù)據(jù)倉庫datawarehouse8混合云hybridcloud9機(jī)器學(xué)習(xí)machinelearning10非關(guān)系數(shù)據(jù)庫nonrelationaldatabase五、寫出以下縮略語的完整形式和中文意思(每小題1分,共10分)縮略語完整形式中文意思1AIArtificialIntelligence人工智能2BDFBigDataFabric大數(shù)據(jù)結(jié)構(gòu)3CMSContentManagementSystem內(nèi)容管理系統(tǒng)4APIApplicationProgrammingInterface應(yīng)用程序編程接口5DDLDataDefinitionLanguage數(shù)據(jù)定義語言6DMLDataManipulationLanguage數(shù)據(jù)操作語言7DQLDataQueryLanguage數(shù)據(jù)查詢語言8ELTExtract,Load,Transform提取、加載、轉(zhuǎn)換9JVMJavaVirtualMachineJava虛擬機(jī)10SLAServiceLevelAgreement服務(wù)等級(jí)協(xié)議,服務(wù)級(jí)別協(xié)議六、閱讀短文,回答問題(每小題2分,共10分)Thepurposeofclusteringandclassificationalgorithmsistomakesenseofandextractvaluefromlargesetsofstructuredandunstructureddata.Intheirsimplestform,clustersaresetsofdatapointsthatsharesimilarattributes,andclusteringalgorithmsarethemethodsthatgroupthesedatapointsintodifferentclustersbasedontheirsimilarities.Therearetwomaintypesofclusteringalgorithms.Theyarehierarchicalalgorithmsandpartitionalalgorithms.Inclassification,beforeyoustart,youalreadyknowthenumberofclassesintowhichyourdatashouldbegroupedandyoualreadyknowwhatclassyouwanteachdatapointtobeassigned.Tobetterillustratethenatureo
溫馨提示
- 1. 本站所有資源如無特殊說明,都需要本地電腦安裝OFFICE2007和PDF閱讀器。圖紙軟件為CAD,CAXA,PROE,UG,SolidWorks等.壓縮文件請(qǐng)下載最新的WinRAR軟件解壓。
- 2. 本站的文檔不包含任何第三方提供的附件圖紙等,如果需要附件,請(qǐng)聯(lián)系上傳者。文件的所有權(quán)益歸上傳用戶所有。
- 3. 本站RAR壓縮包中若帶圖紙,網(wǎng)頁內(nèi)容里面會(huì)有圖紙預(yù)覽,若沒有圖紙預(yù)覽就沒有圖紙。
- 4. 未經(jīng)權(quán)益所有人同意不得將文件中的內(nèi)容挪作商業(yè)或盈利用途。
- 5. 人人文庫網(wǎng)僅提供信息存儲(chǔ)空間,僅對(duì)用戶上傳內(nèi)容的表現(xiàn)方式做保護(hù)處理,對(duì)用戶上傳分享的文檔內(nèi)容本身不做任何修改或編輯,并不能對(duì)任何下載內(nèi)容負(fù)責(zé)。
- 6. 下載文件中如有侵權(quán)或不適當(dāng)內(nèi)容,請(qǐng)與我們聯(lián)系,我們立即糾正。
- 7. 本站不保證下載資源的準(zhǔn)確性、安全性和完整性, 同時(shí)也不承擔(dān)用戶因使用這些下載資源對(duì)自己和他人造成任何形式的傷害或損失。
最新文檔
- 2025至2030中國(guó)白色家電行業(yè)市場(chǎng)運(yùn)行分析及競(jìng)爭(zhēng)格局與投資方向報(bào)告
- 2025至2030中國(guó)男士商務(wù)正裝行業(yè)深度研究及發(fā)展前景投資評(píng)估分析
- 2025至2030中國(guó)用于食品和飲料的金屬罐行業(yè)產(chǎn)業(yè)運(yùn)行態(tài)勢(shì)及投資規(guī)劃深度研究報(bào)告
- 2025至2030中國(guó)玻璃門行業(yè)深度研究及發(fā)展前景投資評(píng)估分析
- 2025至2030中國(guó)玫瑰花露行業(yè)供需分析及發(fā)展前景報(bào)告
- 2025至2030中國(guó)物理治療軟件行業(yè)市場(chǎng)深度研究及發(fā)展前景投資可行性分析報(bào)告
- 商業(yè)培訓(xùn)中激勵(lì)措施的心理機(jī)制研究
- 商業(yè)環(huán)境中殘疾人餐具使用的培訓(xùn)與指導(dǎo)
- 招聘技巧培訓(xùn)課件
- 智能教育設(shè)備應(yīng)用中的隱私保護(hù)問題研究
- JCT1041-2007 混凝土裂縫用環(huán)氧樹脂灌漿材料
- SPA水療管理手冊(cè)
- 充電樁工程施工方案解決方案
- 7、煤礦安全管理二級(jí)質(zhì)量標(biāo)準(zhǔn)化驗(yàn)收標(biāo)準(zhǔn)
- USSF-美國(guó)太空部隊(duì)數(shù)字服務(wù)遠(yuǎn)景(英文)-2021.5-17正式版
- 靜配中心應(yīng)急預(yù)案處理流程
- 江蘇省射陽中等專業(yè)學(xué)校工作人員招聘考試真題2022
- 廣東英語中考必背1600詞
- 醫(yī)療器械銷售代表工作計(jì)劃工作總結(jié)述職報(bào)告PPT模板下載
- 壓力分散型預(yù)應(yīng)力錨索張拉計(jì)算書 附張拉表
評(píng)論
0/150
提交評(píng)論