Yakagadziriswa: maitiro ekushandura izwi kuita nhamba mupython pandas

Munyika yanhasi, data manipulation uye kuongororwa kwave chikamu chakakosha cheakasiyana maindasitiri. Rimwe basa rakadaro rinowanzoitika kushandura mazwi kuti ave manhamba mumadataset. Ichi chinyorwa chichakurukura kuti raibhurari yePython ine simba, pandas, inogona kushandiswa sei kuita basa iri nemazvo. Isu tichaongorora matanho, kodhi, uye pfungwa dzakabatanidzwa mukugadzirisa dambudziko iri, tive nechokwadi chekuti unonzwisisa maitiro uye unogona kuishandisa nyore.

Kutanga, ngatinzwisise dambudziko ratinovavarira kugadzirisa. Fungidzira uine dhatabheti rine koramu ine nhamba dzakanyorwa nemazwi, senge "imwe," "mbiri," "nhatu," zvichingodaro. Chinangwa chedu ndechekushandura manhamba emazwi aya kuti ave mamwe anoenderana nawo achishandisa Python uye pandas.

Danho 1: Kuunza kunze maraibhurari anodiwa
Kuti tiite basa iri, tinofanira kutanga taunza ma library anodiwa. Muchiitiko ichi, tichave tichishandisa pandas raibhurari yekubata nekugadzirisa iyo data, uye inflect yekushandura mazwi kuita manhamba.

import pandas as pd
import inflect

pandas library

pandas ndeye yakavhurika-sosi data manipulation uye yekuongorora raibhurari inopa data zvimiro uye mabasa anodiwa pakubata yakarongeka data. Iyo yakavakirwa pamusoro pePython programming mutauro uye inoita basa rakakosha mukugadzirisa data, kuchenesa, uye kuongorora. Mamwe eayo makuru data zvimiro anosanganisira Series, DataFrame, uye Index, iyo inobatsira mukubata nemhando dzakasiyana dze data uye mashandiro.

inflect library

inflect iraibhurari yePython inobatsira mukukombuta akawanda uye mamwe mazita, ordinals, uye kushandura manhamba kumashoko kana mazwi kune manhamba. Muchikamu chino, tichatarisa pakukwanisa kwayo kushandura mazwi kuva nhamba. Kuti ushandise inflect, iwe uchafanirwa kuiisa uchishandisa unotevera kuraira:

!pip install inflect

Nhanho yechipiri: Kugadzira pandas DataFrame
Zvino zvataunza kunze kwenyika maraibhurari anodiwa, ngatigadzire pandas DataFrame ine koramu ine manhamba semazwi. Izvi zvichashanda seyedu yemuenzaniso dataset yezvinangwa zvekuenzanisira.

data = {'Numbers_in_words': ['one', 'two', 'three', 'four', 'five']}
df = pd.DataFrame(data)
print(df)

Danho rechitatu: Kushandura mazwi kuva nhamba
Tevere, isu tichashandisa iyo inflect raibhurari kushandura manhamba mumashoko kune iwo akazara mamwe mamwe. Tichagadzira basa rinonzi 'convert_word_to_number' rinotora izwi sekuisa uye rinodzorera nhamba inoenderana.

def convert_word_to_number(word):
    p = inflect.engine()
    try:
        return p.singular_noun(word)
    except:
        return None

df['Numbers'] = df['Numbers_in_words'].apply(convert_word_to_number)
print(df)

Mune ino kodhi snippet, isu tinotsanangura basa rinoshandisa inflect injini kushandura mazwi kuita manhamba. Isu tinobva tashandisa iyo pandas apply() nzira yekushandisa basa iri kune chimwe chinhu che'Nhamba_in_words' column muDataFrame.

Kupfupisa, taona mashandisirwo ePython, pandas, uye inflect kushandura mazwi kuita manhamba mudhata. Pandas inoshanda sechinhu chakakosha chekushandisa data, nepo inflect raibhurari inobatsira mukushanda kunosanganisira mazwi nenhamba. Nekutevera nhanho idzi, unogona kushandura zviri nyore nhamba dzemazwi kuita manhamba mumaseti ako uye wowedzera kuongorora nekushandisa data rako. Happy coding!

Related posts:

Leave a Comment