lip dubbing ai music video -> read lips -> detect phonemes -> generate audio using jamify [wip] modal stack uv for dependency management, see utils/pyproject.toml