[Python] Pandas _ Apply, Map

✅ Data Load

# Data Load & 데이터 행, 열의 갯수 출력

DataUrl = 'https://raw.githubusercontent.com/Datamanim/pandas/main/BankChurnersUp.csv'

df = pd.read_csv(DataUrl)

Ans = df.shape

Ans

(10127, 19)

✅ Mapping

# Income_Category의 카테고리를 map 함수를 이용하여 다음과 같이 변경하여 newIncome 컬럼에 매핑하라

# Unknown : N

# Less than $40K : a

# $40K - $60K : b

# $60K - $80K : c

# $80K - $120K : d

# $120K + : e

dic = {

'Unknown' : 'N',

'Less than $40K' : 'a',

'$40K - $60K' : 'b',

'$60K - $80K' : 'c',

'$80K - $120K' : 'd',

'$120K +' : 'e'

}

df['newIncome'] = df.Income_Category.map(lambda x : dic[x])

Ans = df['newIncome']

Ans.head(4)

0    c
1    a
2    d
3    a
Name: newIncome, dtype: object

✔️ Lambda 란 함수를 한 줄로 표현하는 함수 기법

✔️ Map 함수 란 함수와 Sequence형 데이터를 Parameter로 입력받아,

각 element 마다 함수를 적용하여 List 로 반환하는 함수

✅ Apply

# Income_Category의 카테고리를 apply 함수를 이용하여 다음과 같이 변경하여 newIncome 컬럼에 매핑하라

# Unknown : N

# Less than $40K : a

# $40K - $60K : b

# $60K - $80K : c

# $80K - $120K : d

# $120K + : e

def changeCategory(x):

if x == 'unknown':

return 'N'

elif x == 'Less than $40K':

return 'a'

elif x == '$40K - $60K':

return 'b'

elif x == '$60K - $80K':

return 'c'

elif x == '$80K - $120K':

return 'd'

elif x == '$120K +':

return 'e'

df['newIncome'] = df.Income_Category.apply(changeCategory)

Ans = df['newIncome']

Ans.head(4)

0    c
1    a
2    d
3    a
Name: newIncome, dtype: object

✅ Mapping - 빈도 수 출력 1

# Customer_Age의 값을 이용하여 나이 구간을 AgeState 컬럼으로 정의하기

# (0~9 : 0 , 10~19 :10 , 20~29 :20 … 각 구간의 빈도수를 출력)

df['AgeState'] = df.Customer_Age.map(lambda x : x//10 *10)

Ans = df['AgeState'].value_counts().sort_index()

Ans

AgeState
20     195
30    1841
40    4561
50    2998
60     530
70       2
Name: count, dtype: int64

✅ Mapping - 빈도 수 출력 2

# Education_Level의 값중 Graduate단어가 포함되는 값은 1 그렇지 않은 경우에는 0으로 변경하여

# newEduLevel 컬럼을 정의하고 빈도수를 출력

df['newEduLevel'] = df.Education_Level.map(lambda x : 1 if 'Graduate' in x else 0)

Ans = df['newEduLevel'].value_counts

Ans

<bound method IndexOpsMixin.value_counts of 0        0
1        1
2        1
3        0
4        0
        ..
10122    1
10123    0
10124    0
10125    1
10126    1
Name: newEduLevel, Length: 10127, dtype: int64>

✅ Mapping - 빈도 수 출력 3

# Credit_Limit 컬럼값이 4500 이상인 경우 1 그외의 경우에는 모두 0으로 하는 newLimit 정의하기

# newLimit 각 값들의 빈도수를 출력

df['newLimit'] = df.Credit_Limit.map(lambda x : 1 if x >= 4500 else 0)

Ans = df['newLimit'].value_counts

Ans

<bound method IndexOpsMixin.value_counts of 0        1
1        1
2        0
3        0
4        1
        ..
10122    0
10123    0
10124    1
10125    1
10126    1
Name: newLimit, Length: 10127, dtype: int64>

✅ Apply - 빈도 수 출력 1

# Marital_Status 컬럼값이 Married 이고 Card_Category 컬럼의 값이 Platinum인 경우 1 그외의 경우에는

# 모두 0으로 하는 newState컬럼을 정의하기, newState의 각 값들의 빈도수를 출력

def check(x):

if x.Marital_Status == "Married" and x.Card_Category == "Platinum":

return 1

else:

return 0

df['newState'] = df.apply(check, axis=1)

Ans = df['newState'].value_counts()

Ans

newState
0    10120
1        7
Name: count, dtype: int64

✅ Apply - 빈도 수 출력 2

# Gender 컬럼값 M인 경우 male, F인 경우 female로 값을 변경하여 Gender 컬럼에 새롭게 정의하기

# 각 value의 빈도를 출력

def ChangeGender(x):

if x == 'M' :

return 'male'

else :

return 'female'

df['Gender'] = df.Gender.apply(ChangeGender)

Ans = df['Gender'].value_counts()

Ans

Gender
female    5358
male      4769
Name: count, dtype: int64

저작자표시 변경금지

'Python' 카테고리의 다른 글

[Python] Pandas _ Pivot (0)	2023.08.24
[Python] Pandas _ Time Series (0)	2023.08.24
[Python] Pandas data 처리 (0)	2023.05.01
[Python] Pandas data: auto-mpg data 시각화 (0)	2023.04.30
[Python] matplotlib, seaborn 막대그래프 그리기 / 꾸미기 (0)	2023.04.17

🖐 열정이 무한한 개발자

[Python] Pandas _ Apply, Map

✅ Data Load

✅ Mapping

✅ Apply

✅ Mapping - 빈도 수 출력 1

✅ Mapping - 빈도 수 출력 2

✅ Mapping - 빈도 수 출력 3

✅ Apply - 빈도 수 출력 1

✅ Apply - 빈도 수 출력 2

'Python' 카테고리의 다른 글

티스토리툴바

[Python] Pandas _ Apply, Map

✅ Data Load

✅ Mapping

✅ Apply

✅ Mapping - 빈도 수 출력 1

✅ Mapping - 빈도 수 출력 2

✅ Mapping - 빈도 수 출력 3

✅ Apply - 빈도 수 출력 1

✅ Apply - 빈도 수 출력 2

'Python' 카테고리의 다른 글

'Python' Related Articles

티스토리툴바