KFServing (KServe)

KFServing, Transformer, Predictor, Explainer

๋ณธ ๋ฌธ์„œ๋Š” KServe 0.8 ๋ฒ„์ „ ๊ธฐ์ค€์œผ๋กœ ์ž‘์„ฑํ•˜์˜€๋‹ค.

KServe๋Š” Kubernetes์— ML Model์„ Deployํ•˜๊ณ  Serving ๊ธฐ๋Šฅ์„ ์ œ๊ณตํ•˜๋Š” Model Inference Platform์ด๋‹ค.

Control Plane

InferenceService CR ์ž‘์„ฑํ•˜๊ณ  Kubernetes API Server์— ๋“ฑ๋กํ•˜๋ฉด, Transformer, Predictor, Explainer ๋“ฑ์„ ์ƒ์„ฑํ•˜์—ฌ Inference Service ๋ฅผ ๊ตฌ์ถ•ํ•  ์ˆ˜ ์žˆ๋‹ค. Knative Serverless๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํŠธ๋ž˜ํ”ฝ์ด ์—†์„ ๋•Œ๋Š” scale-to-zero ๋™์ž‘ํ•œ๋‹ค.

Inferenceํ•  ๋ฐ์ดํ„ฐ์…‹ ์œ„์น˜๋ฅผ ์ •์˜ํ•œ๋‹ค. Predictor์— ML Framework spec์„ ์ •์˜ํ•œ ํ›„ endpoint spec์„ ์ƒ์„ฑํ•œ๋‹ค. ์ƒ์„ฑํ•œ endpoint spec์„ InferenceService Metadata spec์— ์ž‘์„ฑํ•ด InferenceService๋ฅผ ์ƒ์„ฑํ•œ๋‹ค.

์ถœ์ฒ˜: https://kserve.github.io/website/0.8/modelserving/control_plane/

Control Plane Components

Data Plane

KF Serving ๊ตฌ์„ฑ์š”์†Œ๋Š” Endpoint, Transformer, Predictor, Explainer ๊ฐ€ ์žˆ์œผ๋ฉฐ Endpoint ๋งˆ๋‹ค Explainer ๋ฅผ ๊ตฌ์„ฑํ•˜๊ณ  ํ•„์š”์— ๋”ฐ๋ผ Transformer, Explainer ๋ฅผ ์ถ”๊ฐ€ํ•  ์ˆ˜ ์žˆ๋‹ค.

Endpoint

InferenceService๋Š” Default Endpoint์™€ Canary 2๊ฐœ๋ฅผ ์ œ๊ณตํ•˜๋ฉฐ, Rollout ์ •์ฑ…์„ ์ •์˜ํ•˜์—ฌ ํŠธ๋ž˜ํ”ฝ ๋น„์œจ์„ ์กฐ์ ˆํ•  ์ˆ˜ ์žˆ๋‹ค.

Transformer

์‚ฌ์šฉ์ž๊ฐ€ Predictor๋‚˜ Explainer ์ˆ˜ํ–‰ ์ „ ํ›„์— ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ, ํ›„์ฒ˜๋ฆฌ ํ•  ์ˆ˜ ์žˆ๋‹ค.

Predictor

ML Model Server๋กœ ๋ฐ์ดํ„ฐ๋ฅผ ์˜ˆ์ธกํ•˜๊ฑฐ๋‚˜ ๋ถ„๋ฅ˜ํ•˜๋Š” ์—ญํ• ์„ ํ•œ๋‹ค.

Explainer

XAI๋กœ ๋ฐ์ดํ„ฐ๋ฅผ ์˜ˆ์ธกํ•˜๊ฑฐ๋‚˜ ๋ถ„๋ฅ˜ํ•œ ๊ฒฐ๊ณผ์— ๋Œ€ํ•ด ํŒ๋‹จ ์ด์œ ๋ฅผ ์ œ์‹œํ•˜๋Š” ์—ญํ• ์„ ํ•œ๋‹ค.

์ถœ์ฒ˜: https://kserve.github.io/website/modelserving/data_plane

API v1

API
Method
Path
Payload

Readiness

GET

/v1/models/

Response:{"name": , "ready": true/false}

Predict

POST

/v1/models/:predict

Request:{"instances": []} Response:{"predictions": []}

Explain

POST

/v1/models/:explain

Request:{"instances": []} Response:{"predictions": [], "explainations": []}

์ฐธ๊ณ ์ž๋ฃŒ

https://kserve.github.io/website/ https://www.kubeflow.org/docs/components/kfserving/ https://devocean.sk.com/blog/techBoardDetail.do?ID=163739

KFServing ์—์„œ ์ œ๊ณตํ•˜๋Š” Endpoint, Transformer, Explainer, Predictor ์™ธ์— ๋” ๊ตฌ์„ฑ์š”์†Œ๋ฅผ ์ถ”๊ฐ€ํ•  ์˜ˆ์ •์ด๋ฉฐ, Outlier Detection ๋„ ๊ทธ ์ค‘ ํ•˜๋‚˜์ด๋‹ค.

Last updated

Was this helpful?