Exclude fragmented Sub Codecs from DASH UTF-8 checks

Chardet was detecting a mixture of mostly cp1252 and MacRoman encoding, where it should just be left as-is when parsing. The actual text within it perhaps may want to go through `try_ensure_utf8` when parsed, but not the entire box.
2023-12-02 17:44:47 +00:00 · 2023-12-02 17:44:47 +00:00 · e87de50940
parent 0be62541ba
commit e87de50940
1 changed files with 4 additions and 1 deletions
--- a/devine/core/manifests/dash.py
+++ b/devine/core/manifests/dash.py
@ -473,7 +473,10 @@ class DASH:
                for segment_file in sorted(save_dir.iterdir()):
                    segment_data = segment_file.read_bytes()
                    # TODO: fix encoding after decryption?
-                    if not drm and isinstance(track, Subtitle):
+                    if (
                        not drm and isinstance(track, Subtitle) and
                        track.codec not in (Subtitle.Codec.fVTT, Subtitle.Codec.fTTML)
                    ):
                        segment_data = try_ensure_utf8(segment_data)
                    f.write(segment_data)
                    segment_file.unlink()