devine

Author	SHA1	Message	Date
rlaphoenix	26d067915f	Fix output directory and filename for single-URL aria2c downloads	2024-01-17 04:49:37 +00:00
rlaphoenix	746c55d188	Fix progress total on single-URL requests downloads Previously, it would show the download as fully complete after the first 1024-byte chunk was downloaded, as the Progress Bar total value was set to the amount of URLs. This is because it assumed there would be multiple URLs to download at once, and would advance the progress bar each time one of the downloads completed instead. This changes it so that if there's only one URL to download, then it calculates the total amount of chunks to download which corrects the progress bar advances.	2024-01-14 01:24:51 +00:00
rlaphoenix	0493d28914	Manually specify the output format with Shaka-Packager It normally auto-detects the format from the file extension. The supports formats are "MP4" and "WEBM". The input files to shaka-packager are currently always ".mp4", so this isn't particularly an issue. However, I want to add this just as a pre-caution in case it isn't. This isn't an issue if the input file is another format, like WEBM, as this only controls the output format, the format devine wants, not the input and output format.	2024-01-12 01:17:18 +00:00
rlaphoenix	0116c278af	Absorb original file and path in Decrypt, Repack, & Range Operations To possibly support download resuming in the future, the file names for the decrypt, repack, and change range functions were simplified and once output has finished it then deletes the original input file and re-uses the original input file path. The file names were changed to just append `_repack`, `_decrypted`, `_full_range` etc. to the filename rather than using a duplex extension (`.repack.mp4`, `.decrypted.mp4`, `.range0.mp4`). This is all so that code to check if the file was already downloaded can be simpler. Instead of having to check if 4x different possible file names for a completed download existed, it checks one.	2024-01-12 01:11:47 +00:00
rlaphoenix	ee56bc87c2	Use new Subtitle.convert() in dl command for --sub-format	2024-01-12 00:51:06 +00:00
rlaphoenix	e76bc7201d	Add convert() method to Subtitle class	2024-01-12 00:50:27 +00:00
rlaphoenix	f4d8bc8dd0	Add support for parsing SubRip (SRT) in Subtitle.parse()	2024-01-12 00:37:22 +00:00
rlaphoenix	14ebe4ee1b	Ensure input is UTF-8 when parsing TTML and WebVTT Subtitles This fixes some conversion errors when working with non-latin languages like Russian (crylic) and Arabic.	2024-01-12 00:36:43 +00:00
rlaphoenix	96f1cbb260	Remove empty caption lists post-parsing in Subtitle.parse() This issue is common with Now TV where it for some reason parses into "two" languages. "en" and "eng". This results in one empty caption list, and one non empty caption list. The empty caption list tends to be first. This issue causes a multitude of snowballing problems later down the codebase like when converting to SRT it will result in "MULTI-LANGUAGE SRT" header, which most programs do not recognize, like mkvmerge, causing a mux failure.	2024-01-12 00:30:52 +00:00
rlaphoenix	9683c34337	Improve readability of Subtitle.parse() method	2024-01-12 00:27:19 +00:00
rlaphoenix	c6c2e9ca51	Add Curl-Impersonate Downloader via curl_cffi project The browser to imitate can be set in the config: For example, ```yaml curl_impersonate: browser: chrome110 ``` It will default to using chrome110 if no value is set in the config. A list of available Browsers are listed here: https://github.com/yifeikong/curl_cffi#sessions	2024-01-11 22:29:49 +00:00
rlaphoenix	a9de9748ec	Remove saldl from downloaders config docs	2024-01-09 22:35:45 +00:00
rlaphoenix	e8e3d4a90f	Remove 5-attempt loop from DASH and HLS Downloads These are unnecessary now as all downloaders have retry functionality built-in.	2024-01-09 13:00:39 +00:00
rlaphoenix	cc4900a2ed	Remove uses of the downloader's silent arg in DASH and HLS This was originally done to prevent all aria2c logs unless on the last attempt, at which if it failed all attempts it would let aria2c log the error. However, that's bad practice as aria2c may produce errors or warnings on say the 3rd attempt, and the 3rd attempt may have otherwise succeeded, with warnings or errors. It also generally shouldn't be necessary.	2024-01-09 12:54:27 +00:00
rlaphoenix	009a880371	Silence at the log_buffer not the stdout in aria2c This is so we can still obtain progress data while calling aria2c silently	2024-01-09 12:52:14 +00:00
rlaphoenix	9f04676b5c	Get Cookie Header for each URL in aria2c	2024-01-09 12:41:15 +00:00
rlaphoenix	552a0f13a5	Add retry attempts to Requests downloader	2024-01-09 12:09:21 +00:00
rlaphoenix	fa3cee11b7	Move Download Cancel/Skip Events to constants	2024-01-09 11:55:05 +00:00
rlaphoenix	ce457df151	Change wording from Download Stopped to Download Cancelled	2024-01-09 11:38:58 +00:00
rlaphoenix	d566aa2547	Show Licensing and Licensed Messages via Rich	2024-01-09 11:34:14 +00:00
rlaphoenix	09edb696ba	Change to safer default values for -j, -x, and -s in aria2c The original values would cause blocks by some Services. Therefore, it is better to default to safer values. The new values match the defaults used by aria2c as listed in their docs.	2024-01-09 10:22:28 +00:00
rlaphoenix	a7bbac7bcc	Get -j, -x, and -s from aria2c config, default to 16	2024-01-09 10:18:52 +00:00
rlaphoenix	dbfefc1d97	Pretty up and improve readability of aria2c arguments	2024-01-09 10:05:03 +00:00
rlaphoenix	316f8f0530	Set Referer & User-Agent via dedicated options instead Header in aria2c	2024-01-09 09:57:31 +00:00
rlaphoenix	347c31d717	No longer retrieve timestamp of downloads in aria2c For downloads by devine, there's generally no reason to retrieve this information when it will be decrypted, repacked, remuxed, and so on anyway. Requesting the timestamp will just mean more requests being made, perhaps slowing down the download.	2024-01-09 09:56:15 +00:00
rlaphoenix	e54d4b4f41	Move unsupported proxy check to start of aria2c function	2024-01-09 09:55:12 +00:00
rlaphoenix	484338cf50	Remove unnecessary --min-split-size from aria2c downloader This was added by another team member a long time ago, seemingly for the purposes of preventing a split on DASH/HLS segment files, as they would be already quite small. However, just because they are small it isn't exactly a problem to have it split, and it would only split if the segment file size fits the default split size of 20M at least twice. I.e., if the segment is 45M, it will split twice. If the segment is 25M, it actually won't split at all. You may think 25M will split by 20M into two downloads, but actually the split size must explicitly fit for it to split. So for 2 downloads it will need to be 40MB in size, then 60, then 80, and so on. A 40M or bigger segment file does in my opinion deserve to be split as it may genuinely reap speed benefits.	2024-01-09 09:52:22 +00:00
rlaphoenix	a3ab971132	Fix infinite loop in Track.get_init_segment If the Server returns a Content-Length Header with a value of 0, then the code near-after it would end up looping response streamed chunks of 0-length size, which would go on forever.	2024-01-09 02:45:10 +00:00
rlaphoenix	58cb00b18b	Implement --no-proxy to disable all uses proxies and proxy providers This prevents a service from setting a proxy if geofenced, and also discards any manually provided proxy from `--proxy`.	2024-01-09 02:40:49 +00:00
rlaphoenix	f28a6dc28a	Fix usage of `__all__`	2024-01-09 02:31:22 +00:00
rlaphoenix	2291f90f64	Re-map Video Transfer value 5 to 6 This is seen in some manifests/services for whatever reason. I can't find documentation for this value anywhere. It seems unused in official specifications as of right now. However, it seems in some services/places it is unofficially used as a PAL-version of BT-601 transfer, which makes sense. Devine's code (and other services) wouldn't care about the difference here so currently it is just implemented as a remap from 5 to 6. In the future it may be changed and actually defined as two seperate BT_601 Transfer enum entries.	2024-01-08 23:56:45 +00:00
rlaphoenix	d690ca4d13	Skip audio track filtering if there's no audio tracks This also bypasses the warning log about the audio likely being part of an invariant playlist, which may be true it is too specific of a warning when it could be multiple other reasons why.	2023-12-29 21:19:53 +00:00
rlaphoenix	c0d940b17b	Remove Track.needs_proxy Ok, so there's a few reasons this was done. 1) Design-wise it isn't valid to have --proxy (or via config/otherwise) set a proxy, then unpredictably have it bypassed or disabled. If I specify `--proxy 127.0.0.1:8080`, I would expect it to use that proxy for all communication indefinitely, not switch in and out depending on the track or service. 2) With reason 1, it's also a security problem. The only reason I implemented it in the first place was so I could download faster on my home connection. This means I would authenticate and call APIs under a proxy, then suddenly download manifests and segments e.t.c under my home connection. A competent service could see that as an indicator of bad play and flag you. 3) Maintaining this setup across the codebase is extremely annoying, especially because of how proxies are setup/used by Requests in the Session. There's no way to tell a request session to temporarily disable the proxy and turn it back on later, without having to get the proxy from the session (in an annoying way) store it, then remove it, make the calls, then assuming your still in the same function you can add it back. If you're not in the same function, well, time for some spaghetti code. --- tldr; -1 ux/design/expectations with CLI, -1 security aspect, -1 code maintenance, but only +1 for potentially increased download speeds in certain scenarios.	2023-12-29 20:25:57 +00:00
rlaphoenix	3c1c408ccd	Remove forced removal of Multi-Language SRT header Services needing this done should apply it themselves, e.g. OnMultiplex. A convenience function to do it is available now as `Subtitle.remove_multi_lang_srt_header()`, so you can do e.g., `subtitle.OnMultiplex = remove_multi_lang_srt_header` and it will pass through this function just before muxing.	2023-12-29 16:39:45 +00:00
rlaphoenix	53de34da51	Add `remove_multi_lang_srt_header()` method to Subtitle class	2023-12-29 16:39:45 +00:00
rlaphoenix	e7e18a4204	Use same output subtitle format as input codec to SubtitleEdit calls	2023-12-29 16:39:45 +00:00
rlaphoenix	7cc7227f8c	Specify utf8 with SubtitleEdit when stripping hearing impaired	2023-12-29 16:02:10 +00:00
rlaphoenix	d94d6042b7	Fix Chapter Encoding on Windows when muxing with mkvmerge On Windows it seems to default to some encoding other than UTF-8 (possibly UTF-16 or CP-1252) and since the chapter file is saved as UTF-8, it breaks characters outside typical range. Like ø, æ, and other stuff.	2023-12-03 15:04:58 +00:00
rlaphoenix	308ddbd394	Improve private forking instructions in README	2023-12-03 00:17:04 +00:00
rlaphoenix	7cec16d8ab	Validate track languages in HLS.to_tracks	2023-12-02 22:40:41 +00:00
rlaphoenix	86635f9b7f	Add Support for Python 3.12, update dependencies	2023-12-02 21:17:41 +00:00
rlaphoenix	8cd6dfb65a	Implement `--sub-format` in dl to set output subtitle format The default is still SubRip SRT, but you can now change the output format to almost any of the available Codec options. There is no option to leave the subtitle format as-is yet. I.e., if there's a SRT and WebVTT subtitle, leave them both as-is. Like always, you can configure a default in your config file, e.g., ```yaml dl: sub_format: vtt ``` Note though that SSA, SSAv4, fTTML, and fVTT are not yet supported. There are no plans to support fTTML or fVTT.	2023-12-02 17:56:40 +00:00
rlaphoenix	e87de50940	Exclude fragmented Sub Codecs from DASH UTF-8 checks Chardet was detecting a mixture of mostly cp1252 and MacRoman encoding, where it should just be left as-is when parsing. The actual text within it perhaps may want to go through `try_ensure_utf8` when parsed, but not the entire box.	2023-12-02 17:44:47 +00:00
rlaphoenix	0be62541ba	Handle chardet returning `None` as encoding	2023-12-02 15:10:00 +00:00
Shivelight	c31ee338dc	Add option for automatic subtitle character encoding normalization (#68 ) * Add option for automatic subtitle character encoding normalization The rationale behind this function is that some services use ISO-8859-1 (latin1) or Windows-1252 (CP-1252) instead of UTF-8 encoding, whether intentionally or accidentally. Some services even stream subtitles with malformed/mixed encoding (each segment has a different encoding). * Remove Subtitle parameter `auto_fix_encoding` Just always attempt to fix encoding. If the subtitle is neither UTF-8 nor CP-1252, then it should realistically error out instead of producing garbage Subtitle data anyway. * Move Subtitle encoding fixing code out of if drm tree * Use chardet as a last ditch effort fixing Subs, or return original data * Move Subtitle.fix_encoding method to utilities as try_ensure_utf8 * Add Shivelight as a contributor --------- Co-authored-by: rlaphoenix <rlaphoenix@pm.me>	2023-12-02 11:00:55 +00:00
rlaphoenix	4b8cfabaac	Fix all Ruff and isort linter errors	2023-12-02 09:57:13 +00:00
rlaphoenix	959590a6bb	Overhaul tooling, linting, editor configs, and README	2023-12-02 09:57:13 +00:00
rlaphoenix	c159672181	Update Video.Range.from_cicp with changes in H.Sup19 (04/21) Note: There is some breaking changes here. If you manually worked with the Enum names here, then some of them have changed to better reflect the code points usage. Generally speaking it should not affect service code.	2023-09-04 00:48:50 +01:00
rlaphoenix	aff40df7d1	Raise CalledProcessError if Shaka logs an error This seems to be necessary as Shaka-packager seems to always return exit code 0, even on errors.	2023-07-15 18:13:24 +01:00
rlaphoenix	f3cfaa3ab3	Fix DASH FPS error when SegmentBase is not found	2023-07-15 18:08:01 +01:00

1 2 3 4 5 ...

441 Commits All Branches Search

441 Commits

All Branches