[Python 머신러닝] 06-5 NMF(Non-Negative Matrix Factorization)

2024-05-17 최대 1 분 소요

차원 축소

NMF(Non-Negative Matrix Factorization)

NMF 개요

NMF(Non-Negative Matrix Factorization)

NMF는 원본 행렬 내의 모든 원소 값이 모두 양수(0 이상)라는 게 보장되면 다음과 같이 좀 더 간단하게 두 개의 기반 양수 행렬로 분해될 수 있는 기법을 칭한다.

스크린샷 2024-05-14 173137

행렬 분해(Matrix Factorization)

행렬 분해(Matrix Factorization)는 일반적으로 SVD와 같은 행렬 분해 기법을 통칭하는 것이다.
이처럼 행렬 분해들 하게 되면 W 행렬과 H 행렬은 일반적으로 길고 가는 행렬 W(즉, 원본 행렬의 행 크기와 같고 열 크기보다 작은 행렬)와 작고 넓은 행렬 H(원본 행렬의 행 크기보다 작고 열 크기와 같은 행렬)로 분해된다.
이렇게 분해된 행렬은 Latent Factor(잠재 요소)를 특성으로 가지게 된다.
분해 행렬 W는 원본 행에 대해서 이 잠재 요소의 값이 얼마나 되는지에 대응하며, 분해 행렬 H는 이 잠재 요소가 원본 열(즉, 원본 속성)로 어떻게 구성됐는지를 나타내는 행렬이다.

<실습>

from sklearn.decomposition import NMF
from sklearn.datasets import load_iris
import matplotlib.pyplot as plt
%matplotlib inline

iris = load_iris()
iris_ftrs = iris.data
nmf = NMF(n_components=2)

nmf.fit(iris_ftrs)
iris_nmf = nmf.transform(iris_ftrs)

plt.scatter(x=iris_nmf[:,0], y= iris_nmf[:,1], c= iris.target)
plt.xlabel('NMF Component 1')
plt.ylabel('NMF Component 2')

Text(0,0.5,'NMF Component 2')

output_18_1

Twitter Facebook LinkedIn

Sieun Kim

[Python 머신러닝] 06-5 NMF(Non-Negative Matrix Factorization)

차원 축소

NMF(Non-Negative Matrix Factorization)

NMF 개요

NMF(Non-Negative Matrix Factorization)

행렬 분해(Matrix Factorization)

<실습>

공유하기

참고

[CS] 1.1 디자인 패턴

[BIGDATA] 3-2 쿼리 엔진

[BIGDATA] 3-1 대규모 분산 처리의 프레임워크

[SQL] 08-3 GUI 응용 프로그램