Isonjululwe: indlela yokuguqula igama libe kwinombolo kwi-python pandas

Kwihlabathi lanamhlanje, ukusetyenziswa kwedatha kunye nohlalutyo kuye kwaba yinxalenye ebalulekileyo kumashishini ahlukeneyo. Omnye umsebenzi onjalo osoloko usenzeka kukuguqula amagama abe ngamanani kwiiseti zedatha. Eli nqaku liza kuxubusha indlela ilayibrari enamandla yePython, i-pandas, enokusetyenziswa ngayo ukwenza lo msebenzi ngokufanelekileyo. Siza kuphonononga amanyathelo, ikhowudi, kunye neekhonsepthi ezibandakanyekayo ekusombululeni le ngxaki, siqinisekisa ukuba uyayiqonda inkqubo kwaye unokuyisebenzisa ngokulula.

Ukuqala, masiqonde ingxaki esijonge ukuyicombulula. Khawufane ucinge ukuba uneseti yedatha enekholamu equlethe amanani abhalwe ngamagama, anje ngo “nye,” “zimbini,” “ntathu,” njalo njalo. Injongo yethu kukuguqula la manani amagama abe ngoogxa bawo kusetyenziswa iPython kunye neepanda.

Inyathelo 1: Ukungenisa ngaphandle iilayibrari eziyimfuneko
Ukufezekisa lo msebenzi, kufuneka kuqala singenise amathala eencwadi afunekayo. Kule meko, siya kusebenzisa ithala leencwadi le-pandas ukuphatha kunye nokukhohlisa idatha, kunye ne-inflect yokuguqula amagama kumanani.

import pandas as pd
import inflect

ilayibrari ye-pandas

I-pandas yilayibrari yedatha evulelekileyo kunye nohlalutyo olubonelela ngezakhiwo zedatha kunye nemisebenzi efunekayo yokuphatha idatha eyakhiweyo. Yakhelwe phezu kolwimi lwenkqubo yePython kwaye idlala indima ebalulekileyo ekulungisweni kwedatha, ukucoca, kunye nokuhlalutya. Ezinye zezakhiwo zayo eziphambili zedatha ziquka iSeries, DataFrame, kunye ne-Index, ezinceda ekujonganeni neentlobo ezahlukeneyo zedatha kunye nokusebenza.

ithala leencwadi

i-inflect lithala leencwadi lePython elinceda ekubaleni isininzi kunye nezibizo ezikwisinye, ii-ordinals, kunye nokuguqula amanani abe ngamagama okanye amagama abe ngamanani. Kweli nqaku, siza kugxila ekukwazini ukuguqula amagama kumanani. Ukusebenzisa i-inflect, kuya kufuneka uyifake usebenzisa lo myalelo ulandelayo:

!pip install inflect

Inyathelo 2: Ukwenza iPandas DataFrame
Ngoku sele singenise ngaphandle amathala eencwadi afunekayo, masenze iPandas DataFrame enekholamu equlethe amanani njengamagama. Oku kuya kusebenza njengesampulu yethu yedatha ngeenjongo zokuzekelisa.

data = {'Numbers_in_words': ['one', 'two', 'three', 'four', 'five']}
df = pd.DataFrame(data)
print(df)

Inyathelo 3: Ukuguqula amagama abe ngamanani
Okulandelayo, siya kusebenzisa ilayibrari ye-inflect ukuguqula amanani emagameni ukuya kumaqabane awo apheleleyo. Siza kudala umsebenzi othi 'convert_word_to_number' othatha igama njengegalelo kwaye ubuyisele inani elihambelanayo.

def convert_word_to_number(word):
    p = inflect.engine()
    try:
        return p.singular_noun(word)
    except:
        return None

df['Numbers'] = df['Numbers_in_words'].apply(convert_word_to_number)
print(df)

Kule khowudi yasishwankathelo, sichaza umsebenzi osebenzisa i-injini ye-inflect ukuguqula amagama abe ngamanani. Emva koko sisebenzise i-pandas apply() indlela yokufaka lo msebenzi kuyo yonke into yekholamu ethi 'Numbers_in_words' kwiDathaFrame.

Ukushwankathela, sibonile indlela iPython, i-pandas, kunye ne-inflect enokusetyenziswa ngayo ukuguqula amagama kumanani kwidathasethi. IiPanda zisebenza njengesixhobo esibalulekileyo sokuguqula idatha, ngelixa ithala leencwadi le-inflect linceda ekusebenzeni okubandakanya amagama kunye namanani. Ngokulandela la manyathelo, unokuguqula ngokulula amanani egama abe ziinombolo kwiiseti zakho zedatha kwaye uhlalutye ngakumbi kwaye ulawule idatha yakho. Ikhowudi yolonwabo!

Izithuba ezihambelanayo:

Shiya Comment