Kuxazululiwe: sebenzisa i-dict ukuze ungene esikhundleni sama-panda amanani angekho

Ezweni lokukhohlisa nokuhlaziya idatha, ukuphatha amanani ashodayo kuwumsebenzi obalulekile. AmaPandas, umtapo wezincwadi wePython osetshenziswa kakhulu, usivumela ukuthi siphathe kahle idatha engekho. Indlela eyodwa evamile yokubhekana namanani angekho ihilela ukusebenzisa izichazamazwi ukwenza imephu nokushintshanisa la manani. Kulesi sihloko, sizoxoxa ngokuthi angawasebenzisa kanjani amandla e-Pandas ne-Python ukusebenzisa izichazamazwi zokushintsha amanani angekho kudathasethi.

Isixazululo

Isixazululo esiyinhloko esizosihlola ukusebenzisa i- gcwalisa() ukusebenza ngokubambisana nezichazamazwi. Le ndlela izosenza sikwazi ukushintsha amanani angekho ngamavelu ahambisanayo asuka kusichazamazwi esicacisiwe.

Incazelo yesinyathelo ngesinyathelo sekhodi

Ukubonisa le nqubo, ake sicabange ukuthi sinesethi yedatha equkethe ulwazi mayelana nezitayela zemfashini ezihlukahlukene, okuhlanganisa izingubo, imibala, nomongo womlando. Kwezinye izimo, kungase kube khona amanani ashodayo kule dathasethi.

Okokuqala, ngenisa imitapo yolwazi edingekayo bese udala isampula ye-DataFrame:

import pandas as pd

data = {
    'style': ['Grunge', 'Bohemian', 'Preppy', None, 'Punk', 'Casual'],
    'garments': ['Plaid shirt', None, 'Blazer', 'Maxi dress', 'Leather jacket', 'T-shirt'],
    'colors': ['Black', 'Faded', 'Light', 'Earthy', None, None]
}

df = pd.DataFrame(data)

Manje njengoba sesineDathaFrame ebonisa inkinga, qaphela ukuthi amanye amanani awatholakali (aphawulwe nguNone). Ukuze umiselele la manani, dala izichazamazwi eziqukethe amamephu afanelekile:

style_dict = {None: 'Unknown'}
garments_dict = {None: 'Other'}
colors_dict = {None: 'Various'}

# Combine dictionaries
replacement_dict = {'style': style_dict, 'garments': garments_dict, 'colors': colors_dict}

Okokugcina, sebenzisa i- gcwalisa() umsebenzi wokushintsha amanani angekho usebenzisa isichazamazwi esihlanganisiwe:

df_filled = df.fillna(replacement_dict)

Ukuqonda umtapo wezincwadi wePandas

AmaPandas iwumtapo wezincwadi osebenza ngezindlela eziningi ePython oklanyelwe ukukhohlisa nokuhlaziya idatha. Ihlinzeka ngezakhiwo zedatha eziguquguqukayo nezinamandla njenge-Series ne-DataFrame. Lezi zakhiwo zibalulekile ekusebenzeni ngempumelelo nedatha yethebula ehleliwe.

I-Pandas inikeza iqoqo elicebile lemisebenzi, njenge gcwalisa(), esetshenziselwa ukuphatha idatha engekho. Eminye imisebenzi, efana nokuhlanganisa idatha, idatha ye-pivoting, nokuhlaziywa kochungechunge lwesikhathi, kungenziwa kalula ngama-Panda.

Imisebenzi yokuphatha idatha engekho

Ngaphezu kwe gcwalisa() umsebenzi, i-Pandas inikeza eminye imisebenzi nezindlela ezimbalwa zokubhekana nedatha engekho, njenge:

  • i-drona(): Susa imigqa noma amakholomu anedatha engekho.
  • isna(): Nquma ukuthi yiziphi izici ze-DataFrame noma zochungechunge ezingekho noma ezingekho.
  • hhayi (): Thola ukuthi yiziphi izici ze-DataFrame noma zochungechunge ezingashodayo noma ezingasho lutho.
  • interpolate(): Gcwalisa amanani angekho usebenzisa ukuhumusha ngomugqa.

Lezi zindlela, kanye gcwalisa(), hlinzeka ngohlu olubanzi lwamathuluzi okuphatha idatha engekho ezimweni ezihlukahlukene.

Ekugcineni, lesi sihloko sibonise indlela yokusebenzisa ukubizela ukubuyisela amanani angekho ku-Pandas DataFrame. Umsebenzi obalulekile esiwusebenzisile, gcwalisa(), iyithuluzi elinamandla kulabhulali ye-Pandas esivumela ukuthi siphathe idatha engekho kahle. Ngokusebenzisa izichazamazwi ezisebenzayo, singakwazi ukubeka amanani ashodayo ezindaweni ezifanele futhi siqinisekise ukuthi idathasethi yethu iphelele futhi inenjongo. Ngokuqonda okujulile kwelabhulali ye-Pandas nemisebenzi yayo efakiwe, singasebenza namasethi amakhulu edatha ngempumelelo futhi sithole imininingwane ebalulekile kudatha yethu.

Okuthunyelwe okuhlobene:

Shiya amazwana