Isonjululwe: ukufumana inani lamaxabiso angekhoyo kwiipanda

I-Pandas yilayibrari esetyenziswa ngokubanzi evulekileyo yokukhohlisa idatha yePython. Ibonelela ngolwakhiwo lwedatha kunye nemisebenzi efunekayo ukulawula ngokufanelekileyo kunye nokuhlalutya iiseti zedatha ezinkulu. Enye ingxaki eqhelekileyo yedatha izazinzulu kunye nabahlalutyi badibana ngelixa usebenzisa i-pandas ukuphatha amaxabiso alahlekileyo kwidathasethi. Kweli nqaku, siza kuphonononga indlela yokubala inani lamaxabiso alahlekileyo kwi-pandas DataFrame usebenzisa iindlela ezahlukeneyo, iinkcazo zenyathelo ngenyathelo lekhowudi, kwaye sijonge nzulu kwezinye iilayibrari kunye nemisebenzi ebandakanyekayo ekusombululeni le ngxaki.

Ukubala amaxabiso alahlekileyo kwiiPanda

Ukuqala, kufuneka singenise kuqala ithala leencwadi le-pandas. Ukuba awukayifaki, sebenzisa ngokulula umyalelo `ipip yokufaka ipandas` kwitheminali yakho okanye ukukhawuleza komyalelo.

import pandas as pd

Nje ukuba singenise ngaphandle ithala leencwadi le-pandas, masenze isampulu yeDathaFrame enamaxabiso angekhoyo, esiya kuwasebenzisa kulo lonke eli nqaku ukubonisa iindlela ezahlukeneyo zokubala amaxabiso angekhoyo.

data = {
    'Name': ['Anna', 'Ben', 'Carla', None, 'Eva'],
    'Age': [25, None, 30, 35, None],
    'City': ['NY', 'LA', None, 'SF', 'LA']
}

df = pd.DataFrame(data)

Kulo mzekelo, sineDathaFrame enekholamu ezintathu: Igama, Ubudala, kunye neSixeko. Kukho amaxabiso alahlekileyo, esiya kuwafumana kwaye sibale kwicandelo elilandelayo.

Ukufumana nokubala amaxabiso alahlekileyo usebenzisa isnull() kunye ne sum()

Indlela yokuqala yokubala amaxabiso alahlekileyo kwi-pandas DataFrame kukusebenzisa i susa () umsebenzi. Lo msebenzi ubuyisela iSakhelo seDatha esimilo efanayo neyoqobo, kodwa ngeNyaniso okanye Ubuxoki amaxabiso abonisa ukuba ungeno oluhambelanayo alukho (okt, luqulathe Akukho okanye NaN) okanye hayi.

missing_values = df.isnull()

Ngoku sineDatha yeDatha emile ngendlela efanayo, enamaxabiso ayiNyaniso abonisa amangeniso angekhoyo. Ukubala la maxabiso angekhoyo, sinokusebenzisa ngokulula i isibambiso () umsebenzi. Ngokuyisebenzisa phezu kweDathaFrame, sinokufumana inani lamaxabiso alahlekileyo kwikholamu nganye.

count_missing_values = df.isnull().sum()

Oku kuya kusinika i-pandas Series kunye nenani lamaxabiso alahlekileyo kwikholamu nganye kwiDathaFrame yethu.

Enye indlela: Ukusebenzisa isna () kunye nesamba ()

Enye indlela yokubala amaxabiso alahlekileyo kwi-pandas DataFrame kukusebenzisa i isna() umsebenzi. Ligama elithi isnull() kwaye lisebenza ngendlela efanayo.

count_missing_values = df.isna().sum()

Oku kuya kunika umphumo ofanayo njengendlela yangaphambili, ukubala inani lamaxabiso alahlekileyo kwikholamu nganye kwiDathaFrame yethu.

Ukubala amaxabiso alahlekileyo kwiSakhelo seDatha sisonke

Ukuba sifuna ukufumana inani elipheleleyo lamaxabiso angekhoyo kuyo yonke i-DataFrame, singavele sidibanise enye isibambiso () umsebenzi emva kwesibalo sokuqala () umsebenzi.

total_missing_values = df.isnull().sum().sum()

Oku kuya kubuyisela inani lilonke lamaxabiso angekhoyo kuyo yonke i-DataFrame.

Isishwankathelo, ukuphatha amaxabiso alahlekileyo kwi-pandas linyathelo elibalulekileyo ekucoceni idatha kunye nesigaba sokucubungula kwangaphambili. Ngokusebenzisa isnull() okanye isna() imisebenzi, ngokudityaniswa ne sum() umsebenzi, singabala ngokufanelekileyo inani lamaxabiso angekhoyo kwiDataFrame yethu, senza kube lula ukulungisa nokulawula imiba yedatha engekhoyo kuhlalutyo lwethu.

Izithuba ezihambelanayo:

Shiya Comment