Kuxazululiwe: engeza ukhefana ku-csv kuma-panda

 

Ukusebenza ngamafayela e-CSV kuwumsebenzi ovamile lapho usebenza ngokukhohlisa nokuhlaziya idatha. Inkinga eyodwa okuvame ukubhekana nayo isidingo sokwengeza okhefana kufayela le-CSV ukuze kuhlukaniswe kahle izinkambu zedatha. Kulesi sihloko, sizocubungula imininingwane yokuthi ungangeza kanjani okhefana kufayela le-CSV kusetshenziswa umtapo wezincwadi wePython onamandla, iPandas. Sizohlinzeka ngencazelo yesinyathelo nesinyathelo yekhodi, kulandelwa ukuhlola okujulile kwemitapo yolwazi ehlobene nemisebenzi ehilelekile kule nqubo. Ngakho-ke ake sicwilise futhi senze idatha yakho ihleleke kakhulu futhi ifinyeleleke!

Isixazululo senkinga

Ukwengeza okhefana kufayela le-CSV, singathembela kulabhulali ye-Pandas, eyenza inqubo yokukhohlisa ye-CSV isheshe, ihlanzeke, futhi isebenze kahle. Isinyathelo sokuqala ukufaka amaPanda uma ungenayo, okungenziwa ngokusebenzisa umyalo olandelayo kutheminali yakho:

pip install pandas

Ngemva kokufaka ama-Panda, sekuyisikhathi sokulayisha ifayela lakho le-CSV, wengeze okhefana njengoba kudingeka, futhi udale ifayela elisha le-CSV elinedatha ebuyekeziwe.

Incazelo yesinyathelo ngesinyathelo sekhodi

1. Qala ngokungenisa umtapo wezincwadi wePandas:

import pandas as pd

2. Layisha ifayela lakho le-CSV usebenzisa i- pd.read_csv() umsebenzi. Qiniseka ukuthi ushintsha okuthi “input_file.csv” ufake indlela eya efayeleni lakho.

csv_data = pd.read_csv("input_file.csv")

3. Manje njengoba usulayishe ifayela le-CSV entweni ye-Pandas DataFrame, ungakwazi ukulikhohlisa njengoba kudingeka. Kulokhu, ufuna ukwengeza okhefana ukuze uhlukanise izinkambu zedatha. Lokhu kungenziwa ngokusebenzisa i- ku_csv() function, okuvumela ukuthi ucacise i-delimiter yefayela le-CSV.

csv_data.to_csv("output_file.csv", sep=",", index=False)

4. Ekugcineni, ifayela le-CSV elibuyekeziwe lizogcinwa njenge-“output_file.csv” kwengezwe okhefana abafanele.

Manje, ake singene kweminye imiqondo ehlobene, imitapo yolwazi, nemisebenzi.

I-Pandas: I-Powerhouse Library Yokukhohlisa Idatha

I-Pandas iyinhlangano evulekile-umthombo umtapo wolwazi ohlinzeka ngamathuluzi okukhohlisa nokuhlaziya idatha wePython. Idizayinelwe ngokuqondile ukusebenza ngedatha yethebula, ihlinzeka ngezakhiwo zedatha njenge-Series kanye ne-DataFrame ukuze iphathe idatha kahle. I-Pandas yakhelwe phezu kweminye imitapo yolwazi yePython eqinile nesebenza kahle efana neNumPy, futhi inikeza isikhombimsebenzisi sezinga eliphezulu sokusebenzelana nemithombo yedatha efana ne-CSV, Excel, ne-SQL yolwazi.

  • I-Pandas DataFrame: I-DataFrame iyisakhiwo sedatha esinezinhlamvu ezi-2 ezinamakholomu ezinhlobo ezingase zihluke. Kuyithuluzi eliyinhloko lokukhohlisa idatha elinikezwa i-Pandas futhi yakhelwe ukuphatha amafomethi wedatha anhlobonhlobo.
  • I-Pandas Series: Uchungechunge luyi-array enohlangothi olulodwa olunelebula olukwazi ukubamba noma yiluphi uhlobo lwedatha. Idizayinelwe ukuphatha amakholomu awodwa wedatha futhi isetshenziswa njengebhlokhi yokwakha ye-DataFrame.

I-Python CSV Module: Enye I-Pandas

Nakuba i-Pandas yenza kube lula ukusebenza ngamafayela e-CSV emisebenzini eyinkimbinkimbi, i-Python inikeza imojula eyakhelwe ngaphakathi ebizwa ngokuthi. csv ehlinzeka ngomsebenzi wokufunda nokubhala kumafayela e-CSV.

Amakilasi amakhulu ongasebenza nawo kumojula ye-csv yilawa:

  • csv.reader: Lesi sigaba sifunda ifayela le-CSV futhi sibuyisela i-iterator ukuze ikhiqize umugqa ngamunye njengohlu lweyunithi yezinhlamvu.
  • csv.writer: Leli klasi lihlinzeka ngezindlela zokubhala imigqa kufayela le-CSV.

Nakuba ingenawo amandla njenge-Pandas, imojula ye-csv ingaba enye indlela efanelekile yemisebenzi elula engadingi ukukhohliswa kwedatha okusezingeni eliphezulu noma uma ungafuni ukusebenzisa okuncikile kuphrojekthi yakho.

Sengiphetha, ukwengeza okhefana efayeleni le-CSV kuwumsebenzi obalulekile lapho usebenza ngokukhohlisa nokuhlaziya idatha. Ukusebenzisa umtapo wezincwadi wePython onamandla njengePandas kwenza le nqubo ibe lula, iyenze iqonde futhi isebenze kahle. I-Pandas ihlinzeka ngenqwaba yezici nezindlela ezikuvumela ukuthi ulawule idatha ngempumelelo nangokungenamthungo. Kungenjalo, ukuze kwenziwe imisebenzi elula, imojuli ye-csv eyakhelwe ngaphakathi ye-Python ingasetshenziswa, inikeze amathuluzi adingekayo ukuze kusetshenzwe ngamafayela e-CSV. Kungakhathaliseki ukuthi iyiphi indlela ekhethiwe, ukusebenza ngedatha ehlelwe kahle kuyisihluthulelo sokuhlaziywa kwedatha okuyimpumelelo kanye nokukhohlisa.

Okuthunyelwe okuhlobene:

Shiya amazwana