Ixazululiwe: indlela yokuguqula igama libe inombolo kuma-python pandas

Emhlabeni wanamuhla, ukukhohlisa nokuhlaziya idatha sekuyingxenye ebalulekile yezimboni ezihlukahlukene. Umsebenzi owodwa onjalo ovame ukwenzeka ukuguqula amagama abe izinombolo kumasethi wedatha. Lesi sihloko sizoxoxa ngokuthi umtapo wezincwadi onamandla wePython, ama-panda, ungasetshenziswa kanjani ukwenza lo msebenzi ngempumelelo. Sizohlola izinyathelo, ikhodi, nemiqondo ehilelekile ekuxazululeni le nkinga, siqinisekise ukuthi uyayiqonda inqubo futhi ungayisebenzisa kalula.

Okokuqala, ake siqonde inkinga esihlose ukuyixazulula. Ake sithi unesethi yedatha enekholomu equkethe izinombolo ezibhalwe ngamagama, njengokuthi “enye,” “ezimbili,” “zintathu,” njalo njalo. Umgomo wethu uwukuguqula lezi zinombolo zamagama zibe ozakwabo baphelele sisebenzisa iPython namapanda.

Isinyathelo 1: Ukungenisa imitapo yolwazi edingekayo
Ukuze sifeze lo msebenzi, kufanele siqale singenise imitapo yolwazi edingekayo. Kulokhu, sizobe sisebenzisa umtapo wezincwadi we-pandas ukuphatha kanye nokukhohlisa idatha, futhi siguqule ukuguqula amagama abe izinombolo.

import pandas as pd
import inflect

umtapo we-pandas

I-pandas iyilabhulali yomthombo ovulekile yokukhohlisa nokuhlaziya ehlinzeka ngezakhiwo zedatha nemisebenzi edingekayo ukuze kuphathwe idatha ehlelekile. Yakhelwe phezu kolimi lohlelo lwePython futhi idlala indima ebalulekile ekucubunguleni kusengaphambili idatha, ekuhlanzeni, nasekuhlaziyeni. Ezinye zezinhlaka zayo zedatha eziyinhloko zifaka uchungechunge, i-DataFrame, kanye ne-Index, ezisiza ekubhekaneni nezinhlobo ezihlukahlukene zedatha nokusebenza.

faka umtapo wolwazi

i-inflect iwumtapo wezincwadi wePython osiza ekwenzeni ikhompuyutha ubuningi kanye namabizo ahlukene, ama- ordinals, kanye nokuguqula izinombolo zibe amagama noma amagama zibe izinombolo. Kulesi sihloko, sizogxila ekhonweni layo lokuguqula amagama abe izinombolo. Ukuze usebenzise i-inflect, uzodinga ukuyifaka usebenzisa umyalo olandelayo:

!pip install inflect

Isinyathelo sesi-2: Ukudala i-pandas DataFrame
Manje njengoba sesingenise imitapo yolwazi edingekayo, ake sakhe i-pandas DataFrame enekholomu equkethe izinombolo njengamagama. Lokhu kuzosebenza njengedathasethi yethu yesampula ngezinjongo zemifanekiso.

data = {'Numbers_in_words': ['one', 'two', 'three', 'four', 'five']}
df = pd.DataFrame(data)
print(df)

Isinyathelo sesi-3: Ukuguqula amagama abe izinombolo
Okulandelayo, sizosebenzisa ilabhulali ye-inflect ukuguqula izinombolo emagameni zibe izinombolo zazo eziphelele. Sizodala umsebenzi obizwa ngokuthi 'convert_word_to_number' othatha igama njengokufaka bese ubuyisela inombolo ehambisanayo.

def convert_word_to_number(word):
    p = inflect.engine()
    try:
        return p.singular_noun(word)
    except:
        return None

df['Numbers'] = df['Numbers_in_words'].apply(convert_word_to_number)
print(df)

Kula mazwibela ekhodi, sichaza umsebenzi osebenzisa injini ye-inflect ukuguqula amagama abe izinombolo. Sibe sesisebenzisa indlela ye-pandas apply() ukuze sisebenzise lo msebenzi kuyo yonke into yekholomu ethi 'Numbers_in_words' ku-DataFrame.

Ukufingqa, sibonile ukuthi iPython, i-pandas, ne-inflect ingasetshenziswa kanjani ukuguqula amagama abe izinombolo kudathasethi. I-Pandas isebenza njengethuluzi elibalulekile lokukhohlisa idatha, kuyilapho umtapo wezincwadi we-inflect usiza ekusebenzeni okubandakanya amagama nezinombolo. Ngokulandela lezi zinyathelo, ungakwazi ukuguqula kalula izinombolo zamagama zibe izinombolo kudathasethi yakho futhi uthuthuke uhlaziye futhi ulawule idatha yakho. Jabulela ukubhala ngekhodi!

Okuthunyelwe okuhlobene:

Shiya amazwana