python🐍🐼pandas 초보자 가이드

판다는 무엇입니까?

Python pandas는 데이터 분석에 널리 사용되는 오픈 소스 라이브러리입니다.
Pandas 라이브러리는 ML 및 데이터 과학에서 데이터를 읽고 조작하는 데 사용됩니다.

pip install pandas

시스템에 pandas를 설치하는 pip 명령.

데이터프레임이란?

pandas DataFrame은 2차원 데이터 배열 또는 행과 열이 있는 테이블입니다.

팬더에서 데이터 프레임 만들기:

import pandas as pd
car_dataset = {
'cars': ['Tata', 'Maruti', 'Tesla'], 'Model': ['Nano', 'i10', '11x3'], 'Range: [300, 315, 400]
}
car_df = pd.DataFrame(car_dataset)
print(car_df)

데이터 프레임의 기본 열 작업
대괄호를 사용하여 데이터 프레임 열에 쉽게 액세스하고 값을 할당하거나 업데이트할 수도 있습니다.
다음은 데이터 프레임 열에서 수행할 수 있는 몇 가지 기본 작업입니다.

#Accessing Single Column
print(car_df[['cars']])
# you can also use single square brackets to access single column
#Accessing Multiple Column
print(car_df [[ 'Model', 'Range']])
# Add New Column
car_df['new_column_name'] = [1, 2, 3] # new column value
# Delete Column
car_df.drop(columns=['new_col_name'], inplace=True)
# rename column
#Syntax: df.renamel columns={"oldName":"NewName"}, inplace=True)
car_df.rename(columns={ 'Model' : 'model'}, inplace=True)

CSV 파일 읽기:

빅 데이터 세트를 저장하는 간단한 방법은 CSV 파일(쉼표로 구분된 파일)을 사용하는 것입니다.
CSV 파일은 기계 학습 또는 데이터 과학에서 작업하는 동안 사용할 일반적인 파일 유형입니다.

import pandas as pd
df = pd.read_csv('Housing.csv') print(df)
# print(df.to_string())
# use to_string() to print the entire DataFrame.

데이터 살펴보기:

데이터의 높은 수준의 개요를 이해하기 위해 pandas는 여러 기능을 제공하며 그 중 일부는 다음과 같습니다.

import pandas as pd
 Read CSV File
df = pd.read_csv('Housing.csv')
#head of the data
print(df.head(10)) print first 19 rows of dataframe
#tall of the data
print(df.tail(10)) print last 10 rows of dataframe
#shape = To know the dimensions of the data print(df.shape)
#(545, 19) 11's means 545 rows and 13 columns
#Features
print(df.columns) # it return the columns name
#Index("price", "area", "bedrooms bathrooms, stories", "matnroad"
#guestroom", "basement, hotwaterheating', 'airconditioning,
#parking prefarea", furnishingstatus ], dtype="object")
#info
print(df.info())
prints info about the null values and the data types of each cols.

Pandas를 사용한 통계 분석:
Pandas는 데이터에서 더 깊이 파고들고 더 유용한 통찰력을 찾는 데 도움이 되는 몇 가지 기능을 제공하며 유용한 기능 중 일부는 다음과 같습니다.

# describe : returns statistical measures such as min and max values, mean, standard deviation and more.
df.describe()
# unique : return all the unique values in column.
df['columnName'].unique()
#value_count : returns the frequency of the values df['columnName'].value_counts()
# correlation : find the correlation among the features respectively.
df.corr()

Pandas에는 평균, 중앙값 및 모드 등과 같은 다른 통계적 척도를 찾는 기능도 있습니다.

Reference

이 문제에 관하여(python🐍🐼pandas 초보자 가이드), 우리는 이곳에서 더 많은 자료를 발견하고 링크를 클릭하여 보았다 https://dev.to/quitsen/beginner-guide-on-pythonpandas-4a9

텍스트를 자유롭게 공유하거나 복사할 수 있습니다.하지만 이 문서의 URL은 참조 URL로 남겨 두십시오.

우수한 개발자 콘텐츠 발견에 전념 (Collection and Share based on the CC Protocol.)

좋은 웹페이지 즐겨찾기

개발자 우수 사이트 수집

개발자가 알아야 할 필수 사이트 100선 추천 우리는 당신을 위해 100개의 자주 사용하는 개발자 학습 사이트를 정리했습니다