Yakagadziriswa: kuwana huwandu hwekushaikwa kwakakosha mumapanda

Pandas inoshandiswa zvakanyanya yakavhurika-sosi data manipulation raibhurari yePython. Inopa zvimiro zve data uye mabasa anodiwa kuti anyatso shandisa uye kuongorora mahombe dataset. Rimwe dambudziko rakajairika data masayendisiti uye vanoongorora vanosangana vachiri kushandisa pandas kubata hunhu husipo mudhata. Muchinyorwa chino, isu tichaongorora maverengero ehuwandu hweakashaikwa muPandas DataFrame tichishandisa akasiyana matekiniki, nhanho-ne-nhanho tsananguro dzekodhi, uye kuzama zvakadzama mune mamwe maraibhurari nemabasa anobatanidzwa mukugadzirisa dambudziko iri.

Kuverenga Asipo Maitiro muPandas

Kutanga, tinoda kutanga taunza raibhurari yepandas. Kana usati waiisa, ingo mhanyisa iwo murairo `pip install pandas` mune yako terminal kana command prompt.

import pandas as pd

Kana tangounza kunze kwenyika raibhurari yepandas, ngatigadzire sampuli yeDataFrame ine hunhu husipo, iyo yatichashandisa muchinyorwa chino kuratidza akasiyana maitiro ekuverenga asipo.

data = {
    'Name': ['Anna', 'Ben', 'Carla', None, 'Eva'],
    'Age': [25, None, 30, 35, None],
    'City': ['NY', 'LA', None, 'SF', 'LA']
}

df = pd.DataFrame(data)

Mumuenzaniso uyu, tine DataFrame ine makoramu matatu: Zita, Zera, uye Guta. Pane humwe hunhu husipo, hwatichawana toverenga muchikamu chinotevera.

Kutsvaga uye Kuverenga Asipo Maitiro uchishandisa isnull() uye sum()

Nzira yekutanga yekuverenga yakashaikwa kukosha mupandas DataFrame ndeye kushandisa iyo bvisa () basa. Iri basa rinodzosa DataFrame yechimiro chakafanana nechapakutanga, asi chine Chokwadi kana Nhema chinoratidza kuti chinofambirana nacho chisipo (kureva, chine Hapana kana NaN) kana kwete.

missing_values = df.isnull()

Iye zvino tave neDataFrame yechimiro chimwe chete, ine Chokwadi hunhu hunoratidza zvisipo. Kuverenga izvi zvisipo zvakakosha, tinogona kungoshandisa iyo mari () basa. Nekuishandisa pamusoro peDataFrame, tinogona kuwana nhamba yekushayikwa kwekoramu yega yega.

count_missing_values = df.isnull().sum()

Izvi zvichatipa pandas Series ine nhamba yekushayikwa kwekoramu yega yega muDataFrame yedu.

Imwe nzira: Kushandisa isna () uye sum ()

Imwe nzira yekuverenga yakashaikwa kukosha mupandas DataFrame ndeye kushandisa iyo isna() basa. Iyo alias ye isnull() uye inoshanda nenzira imwechete.

count_missing_values = df.isna().sum()

Izvi zvinopa mhedzisiro yakafanana neyakapfuura maitiro, kuverenga huwandu hwekushaikwa kwekoramu yega yega muDataFrame yedu.

Kuverengera Kusina Kukosha mune Yese DataFrame

Kana isu tichida kuwana iyo yakazara nhamba yezvisipo muDataFrame yese, tinogona kungo sunga imwe mari () basa mushure mekutanga sum () basa.

total_missing_values = df.isnull().sum().sum()

Izvi zvinodzosa huwandu hwehuwandu husipo muDataFrame yese.

Muchidimbu, kubata nhanho dzakashaikwa mupanda idanho rakakosha mukuchenesa data uye pre-processing phase. Nekushandisa iyo isnull () kana isna () mabasa, pamwe chete neiyo sum () basa, tinogona kunyatsoverenga nhamba yezvisipo muDataFrame yedu, zvichiita kuti zvive nyore kugadzirisa uye kugadzirisa zvisipo zve data mukuongorora kwedu.

Related posts:

Leave a Comment