Commits · nateholland.bsky.social/PoseDetection · Tangled

nateholland.bsky.social / PoseDetection

0

This repository has no description

0

Commits

Author

Commit

Message

Date

virtualintern +1

af9479b8

feat: use camera input aspect ratio for object detection claude/experiment/rect-models-handover

Adds a 4:3 rectangular detection path on iOS that mirrors the Android
v4.11.0 letterbox preprocessing — instead of feeding square frames to
Vision and letting it center-crop away the sides, the detector now
letterboxes the source frame into the model's native aspect ratio (e.g.
512×384, 640×480, or 960×736) and decodes the model output back to
original-image coordinates. The model's input dimensions are inferred
from a `_<W>x<H>` filename suffix on the bundled `.mlmodelc`, so a new
rect model can be dropped in with no code changes.

iOS detector
- ImageDetector.ios.kt + FrameProcessor.analyseBufferForAll: handles
both Vision-pipeline output (VNRecognizedObjectObservation, used by
classic yolo11 CoreML pipelines) and raw multiarray output
(VNCoreMLFeatureValueObservation with shape [1, 300, 6], used by
ultralytics' yolo26 end2end CoreML export). Coordinates from the
end2end output are in pixel space of the model input and are
normalized by the model dimensions before mapping to the oriented
source frame.
- CustomObjectModel.ios.kt: parses the input width/height from the
model's filename (`yolo26n_v11_rect_512x384` → 512×384). Models
without the suffix get (0, 0) and skip letterboxing — preserves
prior Vision-default behavior for square models.
- Sample app picks up `imageCropAndScaleOption = ScaleFit` as a
belt-and-suspenders so Vision doesn't double-crop a frame whose
aspect already matches the model.

Sample app + experiment harness
- iOSApp.swift parses `-test_model`, `-test_duration_sec`,
`-start_at_wall_ms`, `-finish_on_stop` launch args and threads them
through MainViewControllerWithAutoSpec → LocalExperimentAutoSpec
CompositionLocal. Enables unattended back-to-back model captures via
`xcrun devicectl device process launch`.
- ExperimentLogger.ios.kt writes per-frame detection JSON to
NSDocumentDirectory/experiment_logs/ in the same schema
tools/compare_logs.py consumes; pull via `pymobiledevice3 apps afc`.
- ExperimentAuto.ios.kt logs progress via NSLog and exits the app on
finish (so back-to-back captures cold-start cleanly).
- App.kt: replace System.currentTimeMillis() with
Clock.System.now().toEpochMilliseconds() so commonMain compiles for
iOS.

Bundled rect CoreML models for the sample app
- yolo26n_v11_rect_512x384.mlpackage (val mAP50 = 0.800)
- yolo26n_v11_rect_640x480.mlpackage (val mAP50 = 0.840)
- yolo26n_v11_rect_960x736.mlpackage (val mAP50 = 0.870)

Library version bump: 4.11.1 → 4.12.0.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2mo ago

nate

825b4c30

fix: sample app bug

2mo ago

nate

47ac2c48

chore: version bump

2mo ago

76bf1693

Merge remote-tracking branch 'origin/master' into claude/feature/android-improvements claude/feature/android-improvements

2mo ago

virtualintern +1

c33ec7d6

perf: replace MediaMetadataRetriever with MediaCodec sequential decoder

2mo ago

virtualintern +1

b42adef6

perf: add NNAPI delegate fallback for TFLite object detection

2mo ago

virtualintern +1

83f417b0

perf: parallelize pose + object detection and reduce throttle frequency

2mo ago

virtualintern +1

cfab42ed

perf: replace software video encoding with CameraX VideoCapture

2mo ago

virtualintern +1

8f4a2add

perf: replace MediaMetadataRetriever with MediaCodec sequential decoder claude/feature/camerax-video-capture

2mo ago

virtualintern +1

18fc9c34

perf: add NNAPI delegate fallback for TFLite object detection

2mo ago

virtualintern +1

32e7cc50

perf: parallelize pose + object detection and reduce throttle frequency

2mo ago

virtualintern +1

7a88017f

perf: replace software video encoding with CameraX VideoCapture

2mo ago

nate

3efaf6ae

feat: auto detect models and allow user to switch between them in sample app

3mo ago

nate

ccf4dc85

fix: io object detection in non natural orientations

4mo ago

nate

6a535639

fix: use new retrieval of models

4mo ago

nate

6d819964

refactor: consistency in buffer input

4mo ago

nate

10283ead

fix: model recreation

4mo ago

nate

cff2b173

fix: get metadata from json

4mo ago

nate

b258757e

refactor: base models

4mo ago

nate

9a78faf3

refactor: update versions

4mo ago

nate

1add0317

feat: sample app change detect type

4mo ago

nate

ca92e5a2

refactor: allow posibiliy of gpu

4mo ago

nate

0913646a

fix: fix sample app UI

4mo ago

nate

1b841478

feat: toggle for preview window size

4mo ago

nate

18134bd8

fix: broken focus area

4mo ago

nate

2deb8b9f

feat: switch between ultra wide and regular camera

4mo ago

nate

9c154d04

fix: do not recreate model constantly

4mo ago

nate

28c66ab9

feat: get labels from metadata

4mo ago

nate

9ff107e5

fix: pose detection in frame analysis

4mo ago

nate

beb1e6fc

fix: pose detection broken

4mo ago

nate

976a1e26

fix: improve efficiency

4mo ago

nate

1b023a93

fix: coordinate mapping

4mo ago

nate

ef32d30b

feat: tensor interpretation for yolo models

4mo ago

nate

6ecc1e5d

refactor: single reference to models in sample app

4mo ago

nate

d085ff15

chore: update version in README

5mo ago

nate

5215473a

Merge branch 'labels'

5mo ago

nate

47014b33

chore: android use wide angle camera

5mo ago

nate

149dbc99

feat: ios use wide angle camera

5mo ago

nate

f5a7d093

fix: ios camera flickering on toggle

5mo ago

nate

346c9137

fix: ios camera hanging

5mo ago

nate

d3180ac5

fix: android labels not using colour correctly

5mo ago

nate

6ad83306

fix: draw correct colour labels

5mo ago

nate

482b27c7

feat: draw text labels in drawobjects

5mo ago

nate

0cf98462

feat: ios use wide angle camera

5mo ago

nate

a83e0301

fix: ios camera flickering on toggle

5mo ago

nate

0c9bd93f

fix: ios camera hanging

5mo ago

nate

72c44a90

fix: android labels not using colour correctly

5mo ago

nate

2efc5d15

fix: draw correct colour labels

5mo ago

nate

4a7b98b0

feat: draw text labels in drawobjects

5mo ago

nate

9fafb791

fix: handle ios model parsing error

7mo ago

nate

5ca48483

chore:bump version

8mo ago

d0f51073

fix: Camera frozen when coming back from background

8mo ago

a82cc15c

fix: ios camera rotation is correct, can be vertical/horizontal

8mo ago

a630202f

feat: safe area on posedetection app sample

8mo ago

e9b2a69a

chore: update version

9mo ago

nate

09413434

feat: function for requesting camera data

9mo ago

nate

cdf92862

chore: checking

9mo ago

nate

59292795

fix: retain buffer

9mo ago

9f18d265

chore: update version

9mo ago

654bade6

fix: handle device rotation

9mo ago

feat: use camera input aspect ratio for object detection claude/experiment/rect-models-handover

Adds a 4:3 rectangular detection path on iOS that mirrors the Android
v4.11.0 letterbox preprocessing — instead of feeding square frames to
Vision and letting it center-crop away the sides, the detector now
letterboxes the source frame into the model's native aspect ratio (e.g.
512×384, 640×480, or 960×736) and decodes the model output back to
original-image coordinates. The model's input dimensions are inferred
from a `_<W>x<H>` filename suffix on the bundled `.mlmodelc`, so a new
rect model can be dropped in with no code changes.

iOS detector
- ImageDetector.ios.kt + FrameProcessor.analyseBufferForAll: handles
both Vision-pipeline output (VNRecognizedObjectObservation, used by
classic yolo11 CoreML pipelines) and raw multiarray output
(VNCoreMLFeatureValueObservation with shape [1, 300, 6], used by
ultralytics' yolo26 end2end CoreML export). Coordinates from the
end2end output are in pixel space of the model input and are
normalized by the model dimensions before mapping to the oriented
source frame.
- CustomObjectModel.ios.kt: parses the input width/height from the
model's filename (`yolo26n_v11_rect_512x384` → 512×384). Models
without the suffix get (0, 0) and skip letterboxing — preserves
prior Vision-default behavior for square models.
- Sample app picks up `imageCropAndScaleOption = ScaleFit` as a
belt-and-suspenders so Vision doesn't double-crop a frame whose
aspect already matches the model.

Sample app + experiment harness
- iOSApp.swift parses `-test_model`, `-test_duration_sec`,
`-start_at_wall_ms`, `-finish_on_stop` launch args and threads them
through MainViewControllerWithAutoSpec → LocalExperimentAutoSpec
CompositionLocal. Enables unattended back-to-back model captures via
`xcrun devicectl device process launch`.
- ExperimentLogger.ios.kt writes per-frame detection JSON to
NSDocumentDirectory/experiment_logs/ in the same schema
tools/compare_logs.py consumes; pull via `pymobiledevice3 apps afc`.
- ExperimentAuto.ios.kt logs progress via NSLog and exits the app on
finish (so back-to-back captures cold-start cleanly).
- App.kt: replace System.currentTimeMillis() with
Clock.System.now().toEpochMilliseconds() so commonMain compiles for
iOS.

Bundled rect CoreML models for the sample app
- yolo26n_v11_rect_512x384.mlpackage (val mAP50 = 0.800)
- yolo26n_v11_rect_640x480.mlpackage (val mAP50 = 0.840)
- yolo26n_v11_rect_960x736.mlpackage (val mAP50 = 0.870)

Library version bump: 4.11.1 → 4.12.0.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

af9479b8

virtualintern +1

2mo

fix: sample app bug

825b4c30

nate

2mo

chore: version bump

47ac2c48

nate

2mo

Merge remote-tracking branch 'origin/master' into claude/feature/android-improvements claude/feature/android-improvements

76bf1693

virtualintern

2mo

perf: replace MediaMetadataRetriever with MediaCodec sequential decoder

c33ec7d6

virtualintern +1

2mo

perf: add NNAPI delegate fallback for TFLite object detection

b42adef6

virtualintern +1

2mo

perf: parallelize pose + object detection and reduce throttle frequency

83f417b0

virtualintern +1

2mo

perf: replace software video encoding with CameraX VideoCapture

cfab42ed

virtualintern +1

2mo

perf: replace MediaMetadataRetriever with MediaCodec sequential decoder claude/feature/camerax-video-capture

8f4a2add

virtualintern +1

2mo

perf: add NNAPI delegate fallback for TFLite object detection

18fc9c34

virtualintern +1

2mo

perf: parallelize pose + object detection and reduce throttle frequency

32e7cc50

virtualintern +1

2mo

perf: replace software video encoding with CameraX VideoCapture

7a88017f

virtualintern +1

2mo

feat: auto detect models and allow user to switch between them in sample app

3efaf6ae

nate

3mo

fix: io object detection in non natural orientations

ccf4dc85

nate

4mo

fix: use new retrieval of models

6a535639

nate

4mo

refactor: consistency in buffer input

6d819964

nate

4mo

fix: model recreation

10283ead

nate

4mo

fix: get metadata from json

cff2b173

nate

4mo

refactor: base models

b258757e

nate

4mo

refactor: update versions

9a78faf3

nate

4mo

feat: sample app change detect type

1add0317

nate

4mo

refactor: allow posibiliy of gpu

ca92e5a2

nate

4mo

fix: fix sample app UI

0913646a

nate

4mo

feat: toggle for preview window size

1b841478

nate

4mo

fix: broken focus area

18134bd8

nate

4mo

feat: switch between ultra wide and regular camera

2deb8b9f

nate

4mo

fix: do not recreate model constantly

9c154d04

nate

4mo

feat: get labels from metadata

28c66ab9

nate

4mo

fix: pose detection in frame analysis

9ff107e5

nate

4mo

fix: pose detection broken

beb1e6fc

nate

4mo

fix: improve efficiency

976a1e26

nate

4mo

fix: coordinate mapping

1b023a93

nate

4mo

feat: tensor interpretation for yolo models

ef32d30b

nate

4mo

refactor: single reference to models in sample app

6ecc1e5d

nate

4mo

chore: update version in README

d085ff15

nate

5mo

Merge branch 'labels'

5215473a

nate

5mo

chore: android use wide angle camera

47014b33

nate

5mo

feat: ios use wide angle camera

149dbc99

nate

5mo

fix: ios camera flickering on toggle

f5a7d093

nate

5mo

fix: ios camera hanging

346c9137

nate

5mo

fix: android labels not using colour correctly

d3180ac5

nate

5mo

fix: draw correct colour labels

6ad83306

nate

5mo

feat: draw text labels in drawobjects

482b27c7

nate

5mo

feat: ios use wide angle camera

0cf98462

nate

5mo

fix: ios camera flickering on toggle

a83e0301

nate

5mo

fix: ios camera hanging

0c9bd93f

nate

5mo

fix: android labels not using colour correctly

72c44a90

nate

5mo

fix: draw correct colour labels

2efc5d15

nate

5mo

feat: draw text labels in drawobjects

4a7b98b0

nate

5mo

fix: handle ios model parsing error

9fafb791

nate

7mo

chore:bump version

5ca48483

nate

8mo

fix: Camera frozen when coming back from background

d0f51073

Nuria

8mo

fix: ios camera rotation is correct, can be vertical/horizontal

a82cc15c

florian-kima

8mo

feat: safe area on posedetection app sample

a630202f

florian-kima

8mo

chore: update version

e9b2a69a

nathan holland

9mo

feat: function for requesting camera data

09413434

nate

9mo

chore: checking

cdf92862

nate

9mo

fix: retain buffer

59292795

nate

9mo

chore: update version

9f18d265

nathan holland

9mo

fix: handle device rotation

654bade6

nathan holland

9mo

Next