Kuxazululiwe: ukuthola inombolo yamanani angekho kuma-panda

I-Pandas iyilabhulali yokukhohlisa yedatha yomthombo ovulekile esetshenziswa kabanzi yePython. Ihlinzeka ngezakhiwo zedatha nemisebenzi edingekayo ukuze kusetshenziswe ngempumelelo nokuhlaziya amasethi edatha amakhulu. Inkinga eyodwa evamile yedatha ososayensi nabahlaziyi abahlangabezana nayo ngenkathi besebenzisa ama-panda ukuphatha amanani ashodayo kudathasethi. Kulesi sihloko, sizohlola ukuthi singabala kanjani inani lamanani alahlekile ku-DataFrame ye-pandas sisebenzisa amasu ahlukahlukene, izincazelo zesinyathelo ngesinyathelo zekhodi, futhi sijule kweminye yemitapo yolwazi nemisebenzi ehilelekile ekuxazululeni le nkinga.

Ukubala Amanani Angekho KumaPanda

Ukuze siqale, sidinga kuqala ukungenisa umtapo wezincwadi we-pandas. Uma ungakayifaki, vele usebenzise umyalo othi `pip install pandas` kutheminali yakho noma umyalo womyalo.

import pandas as pd

Uma sesingenisile umtapo wezincwadi we-pandas, masidale isampula ye-DataFrame enamanani angekho, esizowasebenzisa kuso sonke lesi sihloko ukuze sibonise amasu ahlukene okubala amanani angekho.

data = {
    'Name': ['Anna', 'Ben', 'Carla', None, 'Eva'],
    'Age': [25, None, 30, 35, None],
    'City': ['NY', 'LA', None, 'SF', 'LA']
}

df = pd.DataFrame(data)

Kulesi sibonelo, sineDathaFrame enamakholomu amathathu: Igama, Ubudala, kanye Nedolobha. Kunamanani ashodayo, esizowathola futhi siwabale esigabeni esilandelayo.

Ukuthola nokubala Amanani Angekho kusetshenziswa okuthi isnull() kanye nesamba()

Indlela yokuqala yokubala amanani angekho ku-pandas DataFrame ukusebenzisa i susa() umsebenzi. Lo msebenzi ubuyisela i-DataFrame enesimo esifanayo nesokuqala, kodwa ngamavelu eQiniso noma Amanga abonisa ukuthi okufakiwe okuhambisanayo akukho yini (okungukuthi, kuqukethe Akukho noma i-NaN) noma cha.

missing_values = df.isnull()

Manje sesineDathaFrame yomumo ofanayo, onamanani eqiniso abonisa okufakiwe okushodayo. Ukuze ubale lawa manani angekho, singamane sisebenzise i- isamba() umsebenzi. Ngokuyisebenzisa phezu kwe-DataFrame, singathola inani lamanani angekho kukholomu ngayinye.

count_missing_values = df.isnull().sum()

Lokhu kuzosinika uchungechunge lwama-pandas anenani lamanani angekho kukholomu ngayinye ku-DataFrame yethu.

Enye Indlela: Ukusebenzisa isna() kanye nesamba()

Enye indlela yokubala amanani angekho ku-pandas DataFrame ukusebenzisa ifayela le- isna() umsebenzi. Kuyisibizo sokuthi isnull() futhi isebenza ngendlela efanayo.

count_missing_values = df.isna().sum()

Lokhu kuzonikeza umphumela ofanayo nendlela yangaphambilini, kubalwa inani lamanani angekho kukholomu ngayinye ku-DataFrame yethu.

Ukubala Amanani Angekho Kuwo Lonke Uhlaka Lwedatha

Uma sifuna ukuthola isamba senani lamanani angekho ku-DataFrame yonke, singavele sibophe elinye isamba() umsebenzi ngemva kwesamba sokuqala () umsebenzi.

total_missing_values = df.isnull().sum().sum()

Lokhu kuzobuyisela isamba senani lamanani angekho kuyo yonke i-DataFrame.

Kafushane, ukuphatha amanani ashodayo kuma-panda kuyisinyathelo esibalulekile ekuhlanzeni idatha nesigaba sokucubungula ngaphambilini. Ngokusebenzisa i-isnull() noma imisebenzi ye-isna(), kuhlanganiswe nomsebenzi we-sum(), singakwazi ukubala ngokuphumelelayo inani lamanani ashodayo ku-DataFrame yethu, senze kube lula ukubhekana nokulawula izinkinga zedatha ezingekho ekuhlaziyeni kwethu.

Okuthunyelwe okuhlobene:

Shiya amazwana