Computer Vision / ML2025

Furniture Valuation CV Pipeline

Production computer vision pipeline for image-based asset evaluation: quality classification, resale prediction, scam detection, and deployed API serving 300+ listings per minute.

Role: ML / AI Engineer

PythonComputer VisionFastAPIDockerPandasscikit-learn

The problem

A resale marketplace needed an automated system to evaluate furniture from images and listing data at scale. The business challenges were specific and commercially real:

Is this item worth processing? — classify whether furniture is valuable or undesirable based on visual features
What's it worth? — predict resale value from image + listing data
Is this a scam? — detect duplicate images that indicate fraudulent listings
Can it scale? — process hundreds of listings per minute from bulk CSV uploads via an API

This was not a notebook experiment. It was a production ML system with real operational requirements.

My role

I worked as the ML/AI engineer responsible for the computer vision pipeline, model architecture, and deployment infrastructure. The work covered training, inference, API design, and production scaling.

What I built

Quality classification

Built a visual classification system that determines whether furniture is valuable (worth resale processing) or undesirable (damaged, low-quality, or not worth the logistics). This required:

Feature extraction from listing images
Training on labeled quality datasets
Balancing precision and recall for business-appropriate thresholds
Handling varied image quality, angles, and lighting conditions

Resale value prediction

Developed a pricing model that combines visual features with listing metadata to estimate resale value. The model accounts for:

Condition signals visible in images
Listing data (brand, dimensions, original price where available)
Market positioning for different furniture categories
Confidence scoring to flag uncertain predictions

Duplicate-image scam detection

Built a duplicate/near-duplicate image detection system to catch fraudulent listings that reuse photos from other sellers. This involved:

Image fingerprinting and perceptual hashing
Similarity scoring across the listing database
Threshold tuning to catch scams without false-flagging legitimate relisted items

Make/model/trim detection

Worked on fine-grained attribute prediction — identifying furniture make, model, and style characteristics from images. This is a harder classification problem than binary quality assessment because it requires:

Visual feature discrimination between similar-looking products
Structured attribute output rather than single-label classification
Business-relevant labeling aligned with resale categories

Bulk processing and API deployment

The system was deployed as a containerized API capable of handling production throughput:

FastAPI inference service — handles individual and batch prediction requests
CSV bulk processing — ingests spreadsheets of listings, processes images, and returns enriched output
Dockerized deployment — containerized for consistent production serving
Scaling architecture — designed to handle 300+ listings per minute
Backend integration — clean API contracts for downstream systems to consume predictions

Architecture

The pipeline follows a multi-stage approach:

Ingestion — listings arrive via API (single or CSV bulk), images are fetched and preprocessed
Classification — quality assessment determines if the item is worth processing further
Prediction — resale value estimation combines visual and metadata features
Fraud check — duplicate-image detection flags suspicious listings
Attribute extraction — make/model/style attributes are predicted for qualified items
Output — enriched listing data with predictions, confidence scores, and flags is returned via API

Each stage can be independently updated — new models can be swapped in without changing the API contract.

What this project proved

This is one of the strongest examples of ML engineering growth in my portfolio:

From app development into ML-powered products — this wasn't wrapping an API; it was building the actual predictive system
From prototype to production — Docker deployment, backend integration, bulk processing, and scaling to hundreds of listings per minute
From generic ML into business-specific modeling — furniture valuation is business logic wrapped in ML: determining processing worthiness, estimating value, detecting fraud
From one prediction task into multi-task systems — quality classification, price prediction, duplicate detection, and attribute extraction in one pipeline
From model work into system integration — training, inference serving, API design, and operational handoff

Outcome

Delivered a production-ready computer vision pipeline that automated what would otherwise require human evaluators to manually assess hundreds of listings. The system handles classification, pricing, fraud detection, and attribute extraction at scale — demonstrating that my ML work goes beyond prototypes into commercially deployed predictive systems.