'기계 학습' 카테고리의 글 목록

기계 학습

CCE: Categorial Cross-Entropy 범주형 교차 엔트로피 2024.04.09
기계학습의 분류 지표 가이드 2023.12.21

CCE: Categorial Cross-Entropy 범주형 교차 엔트로피

지제로사 2024. 4. 9. 15:38

2024. 4. 9. 15:38

https://medium.com/@anishnama20/understanding-cost-functions-in-machine-learning-types-and-applications-cd7d8cc4b47d

Understanding Cost Functions in Machine Learning: Types and Applications

What is cost function?

medium.com

CCE 사용 시기

다중 클래스 분류: 인스턴스가 여러 가능한 범주 중 하나에만 속하는 문제에 이상적.
확률적 출력: 모델 출력이 각 클래스에 대한 확률일 때 유용.
출력층의 소프트맥스 활성화: 신경망의 마지막 계층에서 소프트맥스 활성화 함수와 함께 자주 사용됨.

CCE를 사용하지 않는 경우

이진 분류: 두 클래스 문제의 경우, 이진 교차 엔트로피가 더 적합함.
비확률적 출력: 모델 출력이 확률이 아닌 경우에는 적합하지 않음.
회귀 문제: 출력이 연속적인 값이라면 범주형이 아닌 회귀 작업에는 부적절.

장점

다중 클래스 문제에서의 성능: 다중 클래스를 효율적으로 처리하며 복잡한 분류 작업에서 좋은 결과를 제공.
소프트맥스와의 안정성: 확률을 정규화하기 때문에 소프트맥스 함수와 잘 작동하여 안정적인 훈련을 제공.
해석 가능한 손실 값: 모델이 예측한 확률 분포가 실제 분포와 얼마나 잘 일치하는지를 측정하는 해석 가능한 손실 값.

단점

불균형 데이터에 민감함: 훈련 데이터가 특정 클래스로 크게 치우친 경우 성능이 저하될 수 있음.
수치적 안정성 문제: CCE의 로그 함수는 특히 0에 가까운 확률에서 수치적 불안정을 일으킬 수 있음.
비범주형 출력에 부적합: 출력이 이산 범주가 아닌 작업, 예를 들어 회귀에는 부적합.

요약하자면, 범주형 교차 엔트로피는 다중 클래스 분류 작업에 강력하고 널리 사용되는 손실 함수.
특히 소프트맥스 출력 계층과 결합할 때 효과적.
그러나 데이터 불균형, 이진 분류 작업, 비확률적 모델 출력의 경우에는 효과가 감소.

BCE = LogLoss = 1/n sum(-(y_actual * log(y_pred) + (1-y_actual)*log(1-y_pred)))
CCE = -1/n * sum(sum(y_actual*log(y_pred)))

y_actual : truth value
y_pred : predicted probability

BCE

import numpy as np
y_pred = np.array([0.2, 0.8, 0.6, 0.3])
y_actual = np.array([0, 1, 1, 0])
n = len(y_pred)
bce = -1/n * np.sum(y_actual * np.log(y_pred) + (1 - y_actual) * np.log(1 - y_pred))
print('Binary Cross-Entropy:', bce)

CCE

import numpy as np
y_pred = np.array([[0.1, 0.3, 0.6], [0.2, 0.7, 0.1], [0.9, 0.05, 0.05]])
y_actual = np.array([[0, 0, 1], [0, 1, 0], [1, 0, 0]])
n = len(y_pred)

cce = -1/n * np.sum(np.sum(y_actual * np.log(y_pred)))
print('Categorical Cross-Entropy:', cce)

기계학습의 분류 지표 가이드

지제로사 2023. 12. 21. 15:18

2023. 12. 21. 15:18

https://www.evidentlyai.com/classification-metrics

● 분류 모델의 품질을 평가하는 방법 가이드
이진 및 다중 클래스 문제에 대한 다양한 기계 학습 메트릭 소개

● 가이드 제공 내용
1. 정확도, 정밀도, 리콜, F1점수 및 ROC AUC 포함하는 주요 분류 메트릭 계산하는 방법
2. 각 메트릭의 장단점 및 코너 케이스에서 동작하는 방식과 일부 메트릭이 적합한 경우 이해
3. ML 모니터링에서 분류 메트릭을 설정하고 사용하기 위한 실제 팁

● 가이드의 차별점
1. 메트릭 뒤에 숨겨진 직관 설명. 공식에 연결하여 누구나 이해 가능한 간단한 설명에 집중.
2. 그림으로 표시된 가이드. 각 메트릭의 시각화 제공.
3. 실제 사례 사용. 추상적인 시나리오보다는 작업 중에 발생 가능한 관련 비즈니스 사례 사용
4. 안내서를 처음부터 끝까지 읽을 필요 없이. 개별적으로 읽기 가능

● Reference

[1] Confusion Matrix

https://www.evidentlyai.com/classification-metrics/confusion-matrix

How to interpret a confusion matrix for a machine learning model

How to use the confusion matrix in machine learning? This illustrated guide breaks down each matrix component and explains how to read it.

www.evidentlyai.com

[2] Accuray, Precision, Recall

https://www.evidentlyai.com/classification-metrics/accuracy-precision-recall

Accuracy vs. precision vs. recall in machine learning: what's the difference?

Confused about accuracy, precision, and recall in machine learning? This illustrated guide breaks down each metric and provides examples to explain the differences.

www.evidentlyai.com

[3] Multi-class Precision and Recall
https://www.evidentlyai.com/classification-metrics/multi-class-metrics#binary-vs-multi-class-classification

Accuracy, precision, and recall in multi-class classification

How to use accuracy, precision, and recall in multi-class classification? This illustrated guide breaks down how to apply each metric for multi-class machine learning problems.

www.evidentlyai.com

[4] Classification Threshold

https://www.evidentlyai.com/classification-metrics/classification-threshold

How to use classification threshold to balance precision and recall

Is 0.5 a universal classification threshold in probabilistic machine learning problems? This illustrated guide explains the precision-recall trade-off and how to approach threshold choice.

www.evidentlyai.com

[5] ROC AUC Score

https://www.evidentlyai.com/classification-metrics/explain-roc-curve

How to explain the ROC AUC score and ROC curve?

How to interpret the ROC curve and ROC AUC scores? This illustrated guide breaks down the concepts and explains how to use them to evaluate classifier quality.

www.evidentlyai.com

PREV 이전 1 NEXT 다음

정리왕 지제로사

기계 학습

CCE: Categorial Cross-Entropy 범주형 교차 엔트로피

https://medium.com/@anishnama20/understanding-cost-functions-in-machine-learning-types-and-applications-cd7d8cc4b47d

CCE 사용 시기

CCE를 사용하지 않는 경우

장점

단점

기계학습의 분류 지표 가이드

● 분류 모델의 품질을 평가하는 방법 가이드
이진 및 다중 클래스 문제에 대한 다양한 기계 학습 메트릭 소개

● Reference

+ Recent posts

티스토리툴바

정리왕 지제로사

기계 학습

CCE: Categorial Cross-Entropy 범주형 교차 엔트로피

https://medium.com/@anishnama20/understanding-cost-functions-in-machine-learning-types-and-applications-cd7d8cc4b47d

CCE 사용 시기

CCE를 사용하지 않는 경우

장점

단점

기계학습의 분류 지표 가이드

● 분류 모델의 품질을 평가하는 방법 가이드이진 및 다중 클래스 문제에 대한 다양한 기계 학습 메트릭 소개

● Reference

+ Recent posts

티스토리툴바

● 분류 모델의 품질을 평가하는 방법 가이드
이진 및 다중 클래스 문제에 대한 다양한 기계 학습 메트릭 소개