MobileNet Image Classifier (Keras, `.keras` format)

A minimal, production-leaning image classification template using TensorFlow/Keras with a MobileNet backbone and a standard softmax head.
Out of the box it supports folder-based datasets (e.g. cats vs dogs) and saves models in the native .keras format.

✨ Features

MobileNet (ImageNet weights) + GAP + small MLP + softmax head
Folder-based datasets via image_dataset_from_directory
Preprocessing aligned with MobileNet (preprocess_input)
Reproducible training (seeded, CSV logs)
Callbacks: ModelCheckpoint (best), EarlyStopping, CSVLogger
Clean separation: data.py | model.py | train.py | evaluate.py
Native Keras saving/loading (.keras)
Ready to extend (TensorBoard/plot_model/finetune)

📦 Project Structure

├─ src/
│ ├─ data.py # dataset building + preprocessing
│ ├─ model.py # create_model(...) MobileNet + classifier head
│ ├─ train.py # training entrypoint (fit, callbacks, save)
│ └─ evaluate.py # evaluation entrypoint (load .keras, evaluate)
├─ data/
│ ├─ train/
│ │ ├─ class_a/ ... images ...
│ │ └─ class_b/ ...
│ └─ val/
│ ├─ class_a/ ...
│ └─ class_b/ ...
├─ models/ # saved final models (*.keras)
├─ checkpoints/ # best checkpoint from ModelCheckpoint
├─ logs/ # CSV logs (and optional TensorBoard)
├─ README.md
└─ requirements.txt

You can have more than two classes; the class list is inferred from the folder names inside train/ and val/.

## 🗂️ Data Layout
Place your images like this:

data/
train/
cats/ img001.jpg ...
dogs/ img101.jpg ...
val/
cats/ img201.jpg ...
dogs/ img301.jpg ...

Class names are taken from subfolder names.
Validation set should not overlap training set.

🛠️ Setup

# Python 3.10+ recommended
python -m venv .venv
source .venv/bin/activate        # Windows: .venv\Scripts\activate

pip install -r requirements.txt

Evaluate（加标题、代码块语言不变）

✅ Evaluate

python -m src.evaluate \
  --model models/final.keras \
  --val_dir data/val \
  --batch 32

Optional (for model diagram): To use plot_model, also install:

pip install pydot graphviz

On Linux/macOS you may need the Graphviz system package:

Ubuntu/Debian: sudo apt install graphviz

macOS (Homebrew): brew install graphviz

Windows: install Graphviz and add its bin to PATH.

🚀 Quickstart

python -m src.train \
  --train_dir data/train \
  --val_dir data/val \
  --epochs 15 \
  --batch 32 \
  --lr 1e-3 \
  --freeze_base \
  --out checkpoints/best.keras \
  --save_final models/final.keras

--freeze_base (optional) starts with the backbone frozen for stable warm-up. You can later fine-tune by unfreezing and lowering LR (see Tips below). Evaluate

python -m src.evaluate \
  --model models/final.keras \
  --val_dir data/val \
  --batch 32

This prints metrics (loss/accuracy) and a few sample predictions.

⚙️ Training Notes

Input size: 224×224×3 (changed in data.py if you need).

Labels: uses sparse integer labels (label_mode="int") by default.

Loss: sparse_categorical_crossentropy (switch to categorical if you prefer one-hot).

Preprocessing: MobileNet preprocess_input (scales to [-1, 1]).

Checkpoints: best model by val_accuracy → checkpoints/best.keras

Final model: always saved to models/final.keras

Logs: logs/train.csv for curves (you can plot later; add TensorBoard if you like).

📈 TensorBoard (optional)

Add a TensorBoard callback in src/train.py:

keras.callbacks.TensorBoard(log_dir="logs/tb", histogram_freq=1)

Then

tensorboard --logdir logs/tb --port 6006

🧪 Finetuning Tips

Warm-up: train with --freeze_base for a few epochs (faster convergence).

Unfreeze: modify model.py or add a CLI flag to set base.trainable=True.

Lower LR: e.g. 1e-4 for fine-tuning.

EarlyStopping: already configured; adjust patience as needed.

🧩 Inference Snippet

import numpy as np
from tensorflow import keras
from tensorflow.keras.utils import load_img, img_to_array
from tensorflow.keras.applications.mobilenet import preprocess_input

model = keras.models.load_model("models/final.keras")
class_names = ["cats", "dogs"]  # or load from training dataset metadata

img = load_img("path/to/image.jpg", target_size=(224, 224))
x = img_to_array(img)[None, ...].astype("float32")
x = preprocess_input(x)
pred = model.predict(x)
print(class_names[np.argmax(pred[0])], float(np.max(pred[0])))

🧰 Reproducibility

Seeds are set where appropriate; exact determinism may still vary by platform/BLAS/GPU.

For paper-grade reproducibility, pin package versions and record OS/driver details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
.gitignore		.gitignore
README.md		README.md
requirement.txt		requirement.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MobileNet Image Classifier (Keras, `.keras` format)

✨ Features

📦 Project Structure

🛠️ Setup

Evaluate（加标题、代码块语言不变）

✅ Evaluate

About

Uh oh!

Releases

Packages

Languages

Jingli-123/cats-dogs-classifier

Folders and files

Latest commit

History

Repository files navigation

MobileNet Image Classifier (Keras, .keras format)

✨ Features

📦 Project Structure

🛠️ Setup

Evaluate（加标题、代码块语言不变）

✅ Evaluate

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

MobileNet Image Classifier (Keras, `.keras` format)

Packages