Excel Python双工具数据集创建教程,5分钟快速上手
鍝庯紝浣犳槸涓嶆槸鐩潃鐢佃剳灞忓箷鍙戞剚鍛紵鎯虫暣鐞嗛攢鍞暟鎹彂鐜癊xcel鍗℃垚PPT锛岀敤Python鍐欎唬鐮佸張鍍忕湅澶╀功锛熷埆鎱岋紒浠婂ぉ鏁欎綘涓ゆ嫑蹇呮潃鎶€锛屼繚鍑嗚浣犲儚鍒囪タ鐡滀竴鏍疯交鏉惧鐞嗘暟鎹€斺€斿乏杈圭敤Excel鍒囩墖锛屽彸杈圭敤Python闆曡姳锛屽挶浠洿鎺ヤ笂纭揣锛?/p>
馃洜锔?宸ュ叿閫夋嫨鎸囧崡锛氫粈涔堟椂鍊欒鎶凟xcel浣滀笟锛熶粈涔堟椂鍊欒Python澶栨彺锛?/h3>
鍏堟潵涓伒榄傛嫹闂細鈥?strong>鈥嬪悓鏍峰鐞?000鏉℃暟鎹紝涓轰粈涔堟湁浜?鍒嗛挓鎼炲畾锛屾湁浜烘姌鑵?灏忔椂锛熲€?/strong>鈥?绛旀鍏ㄥ湪杩欏紶瀵规瘮琛ㄩ噷锛?/p>
鈥?strong>鈥嬪満鏅€?/strong>鈥?/th> | Excel浼樺娍 鉁?/th> | Python浼樺娍 馃悕 |
---|---|---|
鏁版嵁閲?/td> | 1涓囪浠ュ唴娴佺晠杩愯 | 100涓囪涓嶅甫鍠樻皵鐨?/td> |
鎿嶄綔闅惧害 | 榧犳爣鐐圭偣灏辫兘鐢?/td> | 闇€瑕佽浠g爜浣嗚兘澶嶅埗绮樿创 |
鑷姩鍖栭渶姹?/td> | 姣忓ぉ鎵嬪姩鏇存柊 | 璁剧疆濂借剼鏈嚜鍔ㄨ窇 |
涓句釜鏍楀瓙锛氭槰澶╁府鏈嬪弸鏁寸悊618閿€鍞暟鎹紝5000鏉¤褰曠敤Excel绛涢€?鍏紡璁$畻鑺变簡20鍒嗛挓锛屾崲鎴怭ython鐨刾andas搴擄紵5琛屼唬鐮?绉掑嚭缁撴灉鈥﹁繖宸窛锛岀畝鐩村儚鏄嚜琛岃溅鍜岄珮閾佽禌璺戯紒
馃摜 鏁版嵁閲囬泦涓夋澘鏂э細浠庨浂寮€濮嬪缓鏁版嵁闆?/h3>
鈥?strong>鈥嬮棶锛氭€庝箞鎶婁贡涓冨叓绯熺殑鏁版嵁鍙樻垚鏁撮綈鐨勮〃鏍硷紵鈥?/strong>鈥?br/> 涓嶇鏄綉绔欎笅杞界殑CSV鏂囦欢锛岃繕鏄粠绯荤粺瀵煎嚭鐨勪贡鐮乀XT锛岃浣忚繖涓竾鑳藉彛璇€锛氣€?strong>鈥嬧€滃厛娲楁尽锛屽啀绌胯。鈥濃€?/strong>鈥嬧€斺€斿厛娓呮礂鍐嶆暣鐞嗭紒
鈥?strong>鈥婨xcel鐗堟楠わ紙閫傚悎灏忕櫧锛夛細鈥?/strong>鈥?/p>
- 鎵撳紑绌虹櫧宸ヤ綔琛紝鈥?strong>鈥婥trl+V绮樿创鈥?/strong>鈥嬪師濮嬫暟鎹?/li>
- 鐐瑰嚮銆愭暟鎹€?銆愬垎鍒椼€戞悶瀹氫贡鎴愪竴鍥㈢殑鏂囧瓧
- 鐢ㄢ€?strong>鈥?TRIM()鍑芥暟鈥?/strong>鈥嬪幓鎺夌┖鏍硷紝鈥?strong>鈥?CLEAN()鍑芥暟鈥?/strong>鈥嬪垹闄ょ壒娈婄鍙?br/> 鈿狅笍 娉ㄦ剰锛氶亣鍒扳€?023骞?鏈堚€濊繖绉嶆枃鏈棩鏈燂紝璁板緱鐢ㄣ€愬垎鍒椼€戝姛鑳借浆鎴愮湡鏃ユ湡鏍煎紡锛屼笉鐒舵眰鍜屼細鍑洪敊锛?/li>
鈥?strong>鈥婸ython鐗堜唬鐮侊紙閫傚悎杩涢樁锛夛細鈥?/strong>鈥?/p>
python澶嶅埗import pandas as pd # 璇诲彇涔辩爜鏂囦欢灏辫繖涔堝啓 data = pd.read_csv('涔辩爜鏂囦欢.csv', encoding='gbk') # 鑷姩娓呯悊绌烘牸鍜岀壒娈婄鍙? data = data.apply(lambda x: x.str.strip() if x.dtype == "object" else x)
馃憠 璇翠汉璇濓細杩欐浠g爜灏卞儚涓櫤鑳藉惛灏樺櫒锛屽厛鎶婃暟鎹惛杩涘幓鑷姩鎵撴壂骞插噣
馃Ъ 鏁版嵁娓呮礂鎬ユ晳鍖咃細澶勭悊缂哄け鍊煎拰閲嶅椤?/h3>
鈥?strong>鈥嬮棶锛氳〃鏍奸噷鎬绘湁鍑犱釜绌虹櫧鏍煎瓙鎬庝箞鍔烇紵鈥?/strong>鈥?br/> 鍒€ョ潃鍒狅紒鏁欎綘涓ゅ瑙e喅鏂规锛?/p>
鈥?strong>鈥婨xcel鎬ユ晳鏂规锛氣€?/strong>鈥?/p>
- 鈥?strong>鈥嬪畾浣嶇湡绌哄崟鍏冩牸鈥?/strong>鈥嬶細鎸塁trl+G 鈫?閫夋嫨銆愮┖鍊笺€?鈫?涓€閿~鍏?鎴栤€滄湭鐭モ€?/li>
- 鈥?strong>鈥嬮珮浜噸澶嶅€尖€?/strong>鈥嬶細閫変腑鍒?鈫?銆愭潯浠舵牸寮忋€戔啋 銆愮獊鍑烘樉绀洪噸澶嶅€笺€?/li>
- 鈥?strong>鈥嬪垹闄ゆ暣琛屸€?/strong>鈥嬶細鍙抽敭鈫掋€愬垹闄よ銆戜絾瑕佸皬蹇冭鍒犲叧閿暟鎹紒
鈥?strong>鈥婸ython楂橀樁鎿嶄綔锛氣€?/strong>鈥?/p>
python澶嶅埗# 澶勭悊缂哄け鍊肩殑绁炲櫒 data.fillna({'骞撮緞': data['骞撮緞'].mean(), '鍦板潃': '鏈煡'}, inplace=True) # 鍒犻櫎閲嶅琛岀殑缁堟瀬澶ф嫑 data.drop_duplicates(subset=['鎵嬫満鍙?], keep='last', inplace=True)
馃挕 缁忛獙涔嬭皥锛氱數鍟嗘暟鎹腑鐨勨€滄敹璐у湴鍧€鈥濈┖鍊硷紝鐩存帴濉€滄湭鐭モ€濅細涓㈠け淇℃伅锛屼笉濡傜敤鍚屽煄鐢ㄦ埛鐨勫湴鍧€鍧囧€兼浛浠b€斺€斾笂娆℃垜杩欎箞澶勭悊锛岃鎺ㄨ崘绯荤粺鍑嗙‘鐜囨彁鍗囦簡18%锛?/p>
馃捑 鏁版嵁淇濆瓨涓庡鍑猴細鍒杈涜嫤鎴愭灉鎵撴按婕?/h3>
鈥?strong>鈥嬮棶锛氭暣鐞嗗ソ鐨勬暟鎹€庝箞瀛樻渶瀹夊叏锛熲€?/strong>鈥?br/> 琛€娉暀璁璀︼紒鏇剧粡鏈変釜瀹炰範鐢熸病淇濆瓨Excel灏卞叧鏈猴紝缁撴灉鈥︼紙姝ゅ鐪佺暐500瀛楁儴妗堬級
鈥?strong>鈥婨xcel闃插穿婧冩寚鍗楋細鈥?/strong>鈥?/p>
- 闅忔椂鎸夆€?strong>鈥婥trl+S鈥?/strong>鈥嬶紝璁剧疆銆愭枃浠躲€戔啋銆愰€夐」銆戔啋姣?鍒嗛挓鑷姩淇濆瓨
- 瀵煎嚭CSV鍓嶏紝鎶婂叕寮忕粨鏋溾€?strong>鈥嬪鍒剁矘璐翠负鍊尖€?/strong>鈥嬶紝閬垮厤鍦ㄥ叾浠栬蒋浠舵墦寮€鍑洪敊
鈥?strong>鈥婸ython闃茬炕杞︿唬鐮侊細鈥?/strong>鈥?/p>
python澶嶅埗# 瀵煎嚭鍓嶅厛澶囦唤鐨勪繚鍛芥搷浣? data.to_excel('娓呮礂鍚庢暟鎹?xlsx', index=False) # 椤轰究鐢熸垚鏃ュ織鏂囦欢 with open('鎿嶄綔璁板綍.log', 'a') as f: f.write(f'{time.ctime()} 宸插鐞?span>{len(data)}鏉℃暟鎹甛n')
鏈€杩戝拰鍑犱釜澶у巶鏁版嵁澶т浆鍠濋厭鍞犲棏锛屽惉鍒颁釜鍐风煡璇嗭細鈥?strong>鈥嬬幇鍦?0%鐨勬暟鎹矖鎷涜仒瑕佹眰閲岋紝Excel鍜孭ython閮芥槸鎹嗙粦鍑虹幇鐨勨€?/strong>鈥嬨€備絾鏈夋剰鎬濈殑鏄紝浠栦滑鍐呴儴璋冪爺鍙戠幇锛?0%鐨勬棩甯告暟鎹鐞嗙敤Excel灏辫兘鎼炲畾锛屽墿涓?0%澶嶆潅鍦烘櫙鎵嶉渶瑕丳ython鍑洪┈銆傛墍浠ュ晩锛屽埆琚綉涓婇偅浜涒€淧ython涓囪兘璁衡€濆悡鍒帮紝鍜卞氨璁颁綇鈥斺€擡xcel鏄棩甯稿悆楗殑绛峰瓙锛孭ython鏄墎楠ㄥご鐨勮彍鍒€锛岀敤寰楅『鎵嬫墠鏄帇閬擄紒
本文由嘻道妙招独家原创,未经允许,严禁转载