Commits · nateholland.bsky.social/PoseDetection · Tangled

nateholland.bsky.social / PoseDetection

0

This repository has no description

0

Commits

Author

Commit

Message

Date

virtualintern +1

fde9edd8

chore: gate signAllPublications() on signingEnabled property

2mo ago

virtualintern +1

91bffd93

feat: iOS MLKit pose + split EXIF + skeleton extensions (v4.15.0)

2mo ago

virtualintern +1

e9acbfb8

chore: bump pose-detection library to 4.14.0 release-4.14.0

2mo ago

virtualintern +1

44b8db92

chore(sample): remove experiment mode UI from the sample app

2mo ago

virtualintern +1

a1f305ae

feat(sample): log skeleton events in experiment dumps

2mo ago

virtualintern +1

4a9b0175

feat: PoseFocusMode.CROP_FOLLOW — dynamic follow-the-player crop

2mo ago

virtualintern +1

da0d404c

feat(android): higher pose input res in CROP + conf filter + smoothing

2mo ago

virtualintern +1

b7c63078

feat: PoseFocusMode { MASK, CROP } for focus-area pose input

2mo ago

virtualintern +1

a538c4c3

feat(android): swap MLKit Pose to the Accurate detector

2mo ago

virtualintern +1

ffdd5e9e

chore: bump version to 4.13.0 release-4.13.0

2mo ago

virtualintern +1

a405c0c0

perf: parallel pose+object detection + CPU+XNNPACK default on Android

2mo ago

nate

ad700278

Merge branch 'release-4.12.0' into release-4.12.1 release-4.12.1

2mo ago

nate

3f6e538f

chore: bump version release-4.12.0

2mo ago

virtualintern +1

c08f22d0

release: 4.12.1 — iOS rect-aspect detection (multiarray decode)

2mo ago

nate

0cc3aa52

Merge remote-tracking branch 'refs/remotes/origin/claude/experiment/rect-models-handover' into release-4.12.0

2mo ago

virtualintern +1

af9479b8

feat: use camera input aspect ratio for object detection claude/experiment/rect-models-handover

Adds a 4:3 rectangular detection path on iOS that mirrors the Android
v4.11.0 letterbox preprocessing — instead of feeding square frames to
Vision and letting it center-crop away the sides, the detector now
letterboxes the source frame into the model's native aspect ratio (e.g.
512×384, 640×480, or 960×736) and decodes the model output back to
original-image coordinates. The model's input dimensions are inferred
from a `_<W>x<H>` filename suffix on the bundled `.mlmodelc`, so a new
rect model can be dropped in with no code changes.

iOS detector
- ImageDetector.ios.kt + FrameProcessor.analyseBufferForAll: handles
both Vision-pipeline output (VNRecognizedObjectObservation, used by
classic yolo11 CoreML pipelines) and raw multiarray output
(VNCoreMLFeatureValueObservation with shape [1, 300, 6], used by
ultralytics' yolo26 end2end CoreML export). Coordinates from the
end2end output are in pixel space of the model input and are
normalized by the model dimensions before mapping to the oriented
source frame.
- CustomObjectModel.ios.kt: parses the input width/height from the
model's filename (`yolo26n_v11_rect_512x384` → 512×384). Models
without the suffix get (0, 0) and skip letterboxing — preserves
prior Vision-default behavior for square models.
- Sample app picks up `imageCropAndScaleOption = ScaleFit` as a
belt-and-suspenders so Vision doesn't double-crop a frame whose
aspect already matches the model.

Sample app + experiment harness
- iOSApp.swift parses `-test_model`, `-test_duration_sec`,
`-start_at_wall_ms`, `-finish_on_stop` launch args and threads them
through MainViewControllerWithAutoSpec → LocalExperimentAutoSpec
CompositionLocal. Enables unattended back-to-back model captures via
`xcrun devicectl device process launch`.
- ExperimentLogger.ios.kt writes per-frame detection JSON to
NSDocumentDirectory/experiment_logs/ in the same schema
tools/compare_logs.py consumes; pull via `pymobiledevice3 apps afc`.
- ExperimentAuto.ios.kt logs progress via NSLog and exits the app on
finish (so back-to-back captures cold-start cleanly).
- App.kt: replace System.currentTimeMillis() with
Clock.System.now().toEpochMilliseconds() so commonMain compiles for
iOS.

Bundled rect CoreML models for the sample app
- yolo26n_v11_rect_512x384.mlpackage (val mAP50 = 0.800)
- yolo26n_v11_rect_640x480.mlpackage (val mAP50 = 0.840)
- yolo26n_v11_rect_960x736.mlpackage (val mAP50 = 0.870)

Library version bump: 4.11.1 → 4.12.0.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2mo ago

virtualintern +1

dc0d7b01

release: v4.11.0 — letterbox preprocessing, camera 4:3 pin, GPU delegate logging release-4.11.0

2mo ago

virtualintern +1

fd3b5646

release: v4.10.0 — timestamp alignment, staggered detection, video improvements release-4.10.0

2mo ago

nate

825b4c30

fix: sample app bug

2mo ago

nate

47ac2c48

chore: version bump

2mo ago

76bf1693

Merge remote-tracking branch 'origin/master' into claude/feature/android-improvements claude/feature/android-improvements

2mo ago

virtualintern +1

c33ec7d6

perf: replace MediaMetadataRetriever with MediaCodec sequential decoder

2mo ago

virtualintern +1

b42adef6

perf: add NNAPI delegate fallback for TFLite object detection

2mo ago

virtualintern +1

83f417b0

perf: parallelize pose + object detection and reduce throttle frequency

2mo ago

virtualintern +1

cfab42ed

perf: replace software video encoding with CameraX VideoCapture

2mo ago

virtualintern +1

8f4a2add

perf: replace MediaMetadataRetriever with MediaCodec sequential decoder claude/feature/camerax-video-capture

2mo ago

virtualintern +1

18fc9c34

perf: add NNAPI delegate fallback for TFLite object detection

2mo ago

virtualintern +1

32e7cc50

perf: parallelize pose + object detection and reduce throttle frequency

2mo ago

virtualintern +1

7a88017f

perf: replace software video encoding with CameraX VideoCapture

2mo ago

nate

3efaf6ae

feat: auto detect models and allow user to switch between them in sample app

3mo ago

nate

ccf4dc85

fix: io object detection in non natural orientations

4mo ago

nate

6a535639

fix: use new retrieval of models

4mo ago

nate

6d819964

refactor: consistency in buffer input

4mo ago

nate

10283ead

fix: model recreation

4mo ago

nate

cff2b173

fix: get metadata from json

4mo ago

nate

b258757e

refactor: base models

4mo ago

nate

9a78faf3

refactor: update versions

4mo ago

nate

1add0317

feat: sample app change detect type

4mo ago

nate

ca92e5a2

refactor: allow posibiliy of gpu

4mo ago

nate

0913646a

fix: fix sample app UI

4mo ago

nate

1b841478

feat: toggle for preview window size

4mo ago

nate

18134bd8

fix: broken focus area

4mo ago

nate

2deb8b9f

feat: switch between ultra wide and regular camera

4mo ago

nate

9c154d04

fix: do not recreate model constantly

4mo ago

nate

28c66ab9

feat: get labels from metadata

4mo ago

nate

9ff107e5

fix: pose detection in frame analysis

4mo ago

nate

beb1e6fc

fix: pose detection broken

4mo ago

nate

976a1e26

fix: improve efficiency

4mo ago

nate

1b023a93

fix: coordinate mapping

4mo ago

nate

ef32d30b

feat: tensor interpretation for yolo models

4mo ago

nate

6ecc1e5d

refactor: single reference to models in sample app

4mo ago

nate

d085ff15

chore: update version in README

5mo ago

nate

5215473a

Merge branch 'labels'

5mo ago

nate

47014b33

chore: android use wide angle camera

5mo ago

nate

149dbc99

feat: ios use wide angle camera

5mo ago

nate

f5a7d093

fix: ios camera flickering on toggle

5mo ago

nate

346c9137

fix: ios camera hanging

5mo ago

nate

d3180ac5

fix: android labels not using colour correctly

5mo ago

nate

6ad83306

fix: draw correct colour labels

5mo ago

nate

482b27c7

feat: draw text labels in drawobjects

5mo ago

chore: gate signAllPublications() on signingEnabled property

fde9edd8

virtualintern +1

2mo

feat: iOS MLKit pose + split EXIF + skeleton extensions (v4.15.0)

91bffd93

virtualintern +1

2mo

chore: bump pose-detection library to 4.14.0 release-4.14.0

e9acbfb8

virtualintern +1

2mo

chore(sample): remove experiment mode UI from the sample app

44b8db92

virtualintern +1

2mo

feat(sample): log skeleton events in experiment dumps

a1f305ae

virtualintern +1

2mo

feat: PoseFocusMode.CROP_FOLLOW — dynamic follow-the-player crop

4a9b0175

virtualintern +1

2mo

feat(android): higher pose input res in CROP + conf filter + smoothing

da0d404c

virtualintern +1

2mo

feat: PoseFocusMode { MASK, CROP } for focus-area pose input

b7c63078

virtualintern +1

2mo

feat(android): swap MLKit Pose to the Accurate detector

a538c4c3

virtualintern +1

2mo

chore: bump version to 4.13.0 release-4.13.0

ffdd5e9e

virtualintern +1

2mo

perf: parallel pose+object detection + CPU+XNNPACK default on Android

a405c0c0

virtualintern +1

2mo

Merge branch 'release-4.12.0' into release-4.12.1 release-4.12.1

ad700278

nate

2mo

chore: bump version release-4.12.0

3f6e538f

nate

2mo

release: 4.12.1 — iOS rect-aspect detection (multiarray decode)

c08f22d0

virtualintern +1

2mo

Merge remote-tracking branch 'refs/remotes/origin/claude/experiment/rect-models-handover' into release-4.12.0

0cc3aa52

nate

2mo

feat: use camera input aspect ratio for object detection claude/experiment/rect-models-handover

Adds a 4:3 rectangular detection path on iOS that mirrors the Android
v4.11.0 letterbox preprocessing — instead of feeding square frames to
Vision and letting it center-crop away the sides, the detector now
letterboxes the source frame into the model's native aspect ratio (e.g.
512×384, 640×480, or 960×736) and decodes the model output back to
original-image coordinates. The model's input dimensions are inferred
from a `_<W>x<H>` filename suffix on the bundled `.mlmodelc`, so a new
rect model can be dropped in with no code changes.

iOS detector
- ImageDetector.ios.kt + FrameProcessor.analyseBufferForAll: handles
both Vision-pipeline output (VNRecognizedObjectObservation, used by
classic yolo11 CoreML pipelines) and raw multiarray output
(VNCoreMLFeatureValueObservation with shape [1, 300, 6], used by
ultralytics' yolo26 end2end CoreML export). Coordinates from the
end2end output are in pixel space of the model input and are
normalized by the model dimensions before mapping to the oriented
source frame.
- CustomObjectModel.ios.kt: parses the input width/height from the
model's filename (`yolo26n_v11_rect_512x384` → 512×384). Models
without the suffix get (0, 0) and skip letterboxing — preserves
prior Vision-default behavior for square models.
- Sample app picks up `imageCropAndScaleOption = ScaleFit` as a
belt-and-suspenders so Vision doesn't double-crop a frame whose
aspect already matches the model.

Sample app + experiment harness
- iOSApp.swift parses `-test_model`, `-test_duration_sec`,
`-start_at_wall_ms`, `-finish_on_stop` launch args and threads them
through MainViewControllerWithAutoSpec → LocalExperimentAutoSpec
CompositionLocal. Enables unattended back-to-back model captures via
`xcrun devicectl device process launch`.
- ExperimentLogger.ios.kt writes per-frame detection JSON to
NSDocumentDirectory/experiment_logs/ in the same schema
tools/compare_logs.py consumes; pull via `pymobiledevice3 apps afc`.
- ExperimentAuto.ios.kt logs progress via NSLog and exits the app on
finish (so back-to-back captures cold-start cleanly).
- App.kt: replace System.currentTimeMillis() with
Clock.System.now().toEpochMilliseconds() so commonMain compiles for
iOS.

Bundled rect CoreML models for the sample app
- yolo26n_v11_rect_512x384.mlpackage (val mAP50 = 0.800)
- yolo26n_v11_rect_640x480.mlpackage (val mAP50 = 0.840)
- yolo26n_v11_rect_960x736.mlpackage (val mAP50 = 0.870)

Library version bump: 4.11.1 → 4.12.0.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

af9479b8

virtualintern +1

2mo

release: v4.11.0 — letterbox preprocessing, camera 4:3 pin, GPU delegate logging release-4.11.0

dc0d7b01

virtualintern +1

2mo

release: v4.10.0 — timestamp alignment, staggered detection, video improvements release-4.10.0

fd3b5646

virtualintern +1

2mo

fix: sample app bug

825b4c30

nate

2mo

chore: version bump

47ac2c48

nate

2mo

Merge remote-tracking branch 'origin/master' into claude/feature/android-improvements claude/feature/android-improvements

76bf1693

virtualintern

2mo

perf: replace MediaMetadataRetriever with MediaCodec sequential decoder

c33ec7d6

virtualintern +1

2mo

perf: add NNAPI delegate fallback for TFLite object detection

b42adef6

virtualintern +1

2mo

perf: parallelize pose + object detection and reduce throttle frequency

83f417b0

virtualintern +1

2mo

perf: replace software video encoding with CameraX VideoCapture

cfab42ed

virtualintern +1

2mo

perf: replace MediaMetadataRetriever with MediaCodec sequential decoder claude/feature/camerax-video-capture

8f4a2add

virtualintern +1

2mo

perf: add NNAPI delegate fallback for TFLite object detection

18fc9c34

virtualintern +1

2mo

perf: parallelize pose + object detection and reduce throttle frequency

32e7cc50

virtualintern +1

2mo

perf: replace software video encoding with CameraX VideoCapture

7a88017f

virtualintern +1

2mo

feat: auto detect models and allow user to switch between them in sample app

3efaf6ae

nate

3mo

fix: io object detection in non natural orientations

ccf4dc85

nate

4mo

fix: use new retrieval of models

6a535639

nate

4mo

refactor: consistency in buffer input

6d819964

nate

4mo

fix: model recreation

10283ead

nate

4mo

fix: get metadata from json

cff2b173

nate

4mo

refactor: base models

b258757e

nate

4mo

refactor: update versions

9a78faf3

nate

4mo

feat: sample app change detect type

1add0317

nate

4mo

refactor: allow posibiliy of gpu

ca92e5a2

nate

4mo

fix: fix sample app UI

0913646a

nate

4mo

feat: toggle for preview window size

1b841478

nate

4mo

fix: broken focus area

18134bd8

nate

4mo

feat: switch between ultra wide and regular camera

2deb8b9f

nate

4mo

fix: do not recreate model constantly

9c154d04

nate

4mo

feat: get labels from metadata

28c66ab9

nate

4mo

fix: pose detection in frame analysis

9ff107e5

nate

4mo

fix: pose detection broken

beb1e6fc

nate

4mo

fix: improve efficiency

976a1e26

nate

4mo

fix: coordinate mapping

1b023a93

nate

4mo

feat: tensor interpretation for yolo models

ef32d30b

nate

4mo

refactor: single reference to models in sample app

6ecc1e5d

nate

4mo

chore: update version in README

d085ff15

nate

5mo

Merge branch 'labels'

5215473a

nate

5mo

chore: android use wide angle camera

47014b33

nate

5mo

feat: ios use wide angle camera

149dbc99

nate

5mo

fix: ios camera flickering on toggle

f5a7d093

nate

5mo

fix: ios camera hanging

346c9137

nate

5mo

fix: android labels not using colour correctly

d3180ac5

nate

5mo

fix: draw correct colour labels

6ad83306

nate

5mo

feat: draw text labels in drawobjects

482b27c7

nate

5mo

Next