shaka-packager

Commit Graph

Author	SHA1	Message	Date
sr90	bb104fef5d	feat: get start number from muxer and specify initial sequence number (#879 ) Set the start number in representation to the segment index that is sent by muxer. With this enhancement, you can now specify the initial sequence number to be used on the generated segments when calling the packager. With the old implementation, it was always starting with "1". --------- Co-authored-by: Cosmin Stejerean <cstejerean@meta.com>	2024-05-02 13:25:49 -07:00
Torbjörn Einarson	4b5e80d02c	feat: teletext formatting (#1384 ) This PR adds parsing of teletext styling, and rendering of the styling in output TTML and WebVTT subtitle tracks. Beyond unit tests, I've used the sample https://drive.google.com/file/d/19ZYsoeUfH85gEilQkaAdLbPhC4CxhDEh/view?usp=sharing which has rather advanced subtitling with two separate rows at the same time, where one is left aligned and another is right aligned. This necessitates two parallel cues to be rendered. It also has some colored text. Solve #1335. ## parse teletext styling and formatting Extend the teletext parser to parse the teletext styling and formatting. This includes translating rows into regions, calculating alignment from start and stop position of the text, and extracting text and background colors. The colors are limited to full lines. Both lines and regions are propagated in the TextSample structures. This is because the number of lines may differ from different sources. For teletext, there are 24 rows, but they are essentially always used with double height, so the number of output lines is 12 from 0 to 11. There are also corresponding regions are denoted "ttx_R", where R is an integer row number. A renderer can use either the line number or the region ID to render the text. ## ttml generation for teletext to EBU-TT-D Add support to render teletext input in EBU-TT-D (IMSC-1) format. This includes appropriate regions ttx_0 to ttx_11 signalled in the TextSamples, alignment and text and background colors. The general TTML output has been changed to always include metadata, layout, and styling nodes, even if they are empty. EBU-TT-D is detected by the presence of "ttx_?" regions in the samples. If detected, extra TTML elements will be added and the EBU-TT-D linePadding used as well. Appropriate styles for background and text colors are generated depending on the color and backgroundColor attributes in the text fragments. ## adapt WebVTT output to teletext TextSample. Teletext input generates both a region with prefix ttx_ and a floating point line number (e.g. 9.5) in the range 0 to 11.5 (due to input 0-23 as double lines). The output is adopted to drop such regions and convert the line number to an integer since the standard only used floats for percent values but not for plain line numbers.	2024-04-29 10:33:03 -07:00
Anthony Lu	56440413aa	fix: use a better estimate of frame rate for cases with very short first sample durations (#838 ) Use the second sample in mp4 and webm formats. #835 had issues with merging due to golden file conflicts. Because we cannot make dependent pull requests, this is a replica of #835. --------- Signed-off-by: Cosmin Stejerean <cstejerean@meta.com> Co-authored-by: Cosmin Stejerean <cstejerean@meta.com>	2024-02-28 15:53:06 -08:00
Cosmin Stejerean	615720e7dd	fix: AudioSampleEntry size caluations due to bad merge (#1354 ) from ALAC pull request	2024-02-27 08:57:48 -08:00
wjywbs	b68ec87f6a	feat: Add support for ALAC codec (#1299 ) Co-authored-by: Cosmin Stejerean <cstejerean@meta.com>	2024-02-26 13:39:30 -08:00
Daniel Cantarín	89376d3c4d	feat: Allow LIVE UDP WebVTT input (#1349 ) An updated version of PR #1027 That previous PR was done using 2021 code, and there were many changes in the codebase from there, so a rebase was needed and also some minor tweak here and there. But it's the same code, just reimplemented on a newer codebase. If you want to take a look at this in action, after building shaka packager with this PR's code included, try this commands in 3 different simultaneous bash sessions: 1. Video UDP input: `ffmpeg -f lavfi -re -i "testsrc=s=320x240:r=30,format=yuv420p" -c:v h264 -sc_threshold 0 -g 30 -keyint_min 30 -r 30 -a53cc 1 -b:v 150k -preset ultrafast -r 30 -f mpegts "udp://127.0.0.1:10000?pkt_size=1316"` 2. WebVTT UDP input: `for sec in $(seq 0 9999) ; do printf "%02d:%02d.000 --> %02d:%02d.000\ntest second ${sec}\n\n" "$(( ${sec} / 60 ))" "$(( ${sec} % 60 ))" "$(( (${sec} + 1) / 60 ))" "$(( (${sec} + 1) % 60 ))" ; sleep 1 ; done > /dev/udp/127.0.0.1/12345` 3. shaka packager command line: `timeout 60 path/to/build/packager/packager 'in=udp://127.0.0.1:10000?timeout=8000000,stream_selector=0,init_segment=240_init.m4s,segment_template=240_$Number%09d$.m4s,bandwidth=150000' 'in=udp://127.0.0.1:12345?timeout=8000000,stream_selector=0,input_format=webvtt,format=webvtt+mp4,init_segment=text_init.m4s,segment_template=text_$Number%09d$.m4s,language=eng,dash_roles=subtitle' --mpd_output ./manifest.mpd --segment_duration 3.2 --suggested_presentation_delay 3.2 --min_buffer_time 3.2 --minimum_update_period 3.2 --time_shift_buffer_depth 60 --preserved_segments_outside_live_window 1 --default_language=eng --dump_stream_info 2>&1` Note the added `input_format=webvtt` to the shaka packager command's second selector. That's new from this PR. If you don't use that, shaka's format autodetection will not detect the webvtt format from the input, as explained in https://github.com/shaka-project/shaka-packager/issues/685#issuecomment-1029407191. Try the command without it if you want to. Fixes #685 Fixes #1017 --------- Co-authored-by: Daniel Cantarín <canta@canta.com.ar>	2024-02-23 16:02:19 -08:00
Cosmin Stejerean	71c175d4b8	feat: Add input support for EBU Teletext in MPEG-TS (#1344 ) Replaces #1181 * Add support for EBU Teletext input following Level 1.5 of the core specification ETSI EN 300 706 V1.2.1 (2003-04). * Add support for webvtt in MP4 segments output. Closes #272 --------- Co-authored-by: Marcus Spangenberg <marcus.spangenberg@eyevinn.se>	2024-02-23 15:31:48 -08:00
sr90	4aa4b4b9aa	feat: Add support for single file TS for HLS (#934 ) This is based on comments at https://github.com/google/shaka-packager/pull/891. The muxer is deciding whether to write to a single file or a segment file based on the configuration. Example: ``` ../packager 'in=TOS.ts,stream=video,output=tos_video.ts,playlist_name=tos_video.m3u8' \ 'in=TOS.ts,stream=audio,output=tos_audio.ts,playlist_name=tos_audio.m3u8' \ --hls_master_playlist_output tos.m3u8 ``` Tested the content using Exoplayer. --------- Co-authored-by: Cosmin Stejerean <cstejerean@meta.com>	2024-02-23 15:28:11 -08:00
Roy-Funderburk	07f780dae1	feat: This patch adds support for DTS:X Profile 2 audio in MP4 files. (#1303 ) feat: Added audio specific configuration udts box to AudioSampleEntry for MP4 input/output. DASH tags for DTS audio as specified in ETSI TS 103 491 and ETSI TS 102 114. Closes #1301 --------- Co-authored-by: Cosmin Stejerean <cstejerean@meta.com>	2024-02-14 23:03:03 -08:00
modernletter	c09eb831b8	feat: Parse MPEG-TS PMT ES language and maximum bitrate descriptors (#369 ) (#1311 ) Part of https://github.com/shaka-project/shaka-packager/issues/369 This adds read support for some MPEG-TS PMT elementary stream descriptors: - ISO639 Language Descriptor providing language code and audio type - Maximum Bitrate Descriptor providing peak stream bandwidth Those metadata are propagated to StreamInfo structures: - StreamInfo.language field - AudioStreamMetadata.max_bitrate field for audio streams - audio type is currently not propagated - corresponding field has to be added to AudioStreamMetadata Test vector file containing those descriptors is provided.	2024-02-08 11:58:26 -08:00
SteveR-PMP	2ba67bc24c	feat: default text zero bias (#1330 ) A positive value, in milliseconds. It is the threshold used to determine if we should assume that the text stream actually starts at time zero. If the first sample comes before default_text_zero_bias_ms, then the start will be padded as the stream is assumed to start at zero. If the first sample comes after default_text_zero_bias_ms then the start of the stream will not be padded as we cannot assume the start time of the stream.	2024-02-08 10:39:50 -08:00
Cosmin Stejerean	9b9adf38ff	test: fix fake clock for muxer for integration tests (#1322 ) The fix in #1289 was not complete and left the fake clock as null which didn't have any effect. This was revealed by integration tests showing mismatches in the timestamps in MP4.	2024-02-08 09:49:15 -08:00
Joey Parrish	3e71302ba4	feat!: Rewrite build system and third-party dependencies (#1310 ) This work was done over ~80 individual commits in the `cmake` branch, which are now being merged back into `main`. As a roll-up commit, it is too big to be reviewable, but each change was reviewed individually in context of the `cmake` branch. After this, the `cmake` branch will be renamed `cmake-porting-history` and preserved. --------- Co-authored-by: Geoff Jukes <geoffjukes@users.noreply.github.com> Co-authored-by: Bartek Zdanowski <bartek.zdanowski@gmail.com> Co-authored-by: Carlos Bentzen <cadubentzen@gmail.com> Co-authored-by: Dennis E. Mungai <2356871+Brainiarc7@users.noreply.github.com> Co-authored-by: Cosmin Stejerean <cstejerean@gmail.com> Co-authored-by: Carlos Bentzen <carlos.bentzen@bitmovin.com> Co-authored-by: Cosmin Stejerean <cstejerean@meta.com> Co-authored-by: Cosmin Stejerean <cosmin@offbytwo.com>	2023-12-01 09:32:19 -08:00
Caitlin O'Callaghan	f264befe86	feat: Write colr atom to muxed mp4 (#1261 ) This PR is an extension of the full AV1 codec string feature: [PR 1205](https://github.com/shaka-project/shaka-packager/pull/1205) and relates to [Issue 1007](https://github.com/shaka-project/shaka-packager/issues/1007) and [Issue 1202](https://github.com/shaka-project/shaka-packager/issues/1202). As per the AV1 spec, the codec string may contain optional color values. These color values are critical for detecting HDR video streams - see [Issue 1007](https://github.com/shaka-project/shaka-packager/issues/1007). Color information is extracted from the input mp4's `colr` atom and used to generate the full AV1 codec string. This PR preserves the color information by writing the `colr` atom to the muxed mp4. References: - [AV1 Codec ISO Media File Format Binding](https://aomediacodec.github.io/av1-isobmff/#codecsparam) - [AV1 Bitstream & Decoding Process Specification - Section 6.4.2 Color config semantics (page 117)](https://aomediacodec.github.io/av1-spec/av1-spec.pdf) - [QuickTime File Format Specification](https://developer.apple.com/library/archive/documentation/QuickTime/QTFF/QTFFChap3/qtff3.html#//apple_ref/doc/uid/TP40000939-CH205-125526)	2023-08-29 18:46:19 -07:00
Prakash Duggaraju	dcf32258ff	fix: Fix handling of non-interleaved multi track FMP4 files (#1214 ) Do not assume that each fragment contains all tracks. Use track id instead of index to pick the correct timestamp. Fixes #1213	2023-08-21 16:34:32 -07:00
Caitlin O'Callaghan	cc9a691aef	feat: Generate the entire AV1 codec string when the colr atom is present (#1205 ) As per the AV1 spec, the codec string may contain optional color values. This extracts the missing color information from the mp4 `colr` atom, if present, and generates the full AV1 codec string. Closes #1007	2023-08-04 09:00:59 -07:00
sr90	520926c27a	fix(MP4): Add compatible brand dby1 for Dolby content. (#1211 ) This PR adds dby1 compatible brand to dolby content as per https://professional.dolby.com/siteassets/content-creation/dolby-vision-for-content-creators/dolby_vision_bitstreams_within_the_iso_base_media_file_format_dec2017.pdf	2023-07-18 19:50:33 -07:00
Marcus Spangenberg	494769ca86	fix: TTML generator timestamp millisecond formatting (#1179 ) Fix bug where milliseconds were formatted with two digits instead of three, resulting in incorrect timestamps in TTML cues. Fixes #1180	2023-07-05 14:28:57 -07:00
Bartek Zdanowski	b221aa9caf	fix: Parse one frame mpeg-ts video (#1015 ) Closes #1013 Co-authored-by: Joey Parrish <joeyparrish@users.noreply.github.com>	2022-10-27 20:22:17 -07:00
Bartek Zdanowski	ab8ab12d09	fix: PTS diverge DTS when DTS close to 2pow33 and PTS more than 0 (#1050 ) Fixes #1049 Co-authored-by: Joey Parrish <joeyparrish@users.noreply.github.com>	2022-10-27 14:21:03 -07:00
Vishal Shah	b9d477b969	fix: webvtt single cue do not fail on EOS (#1061 ) While Parsing cue body check for the block size. If it's the last block do not error if it doesn't have a newline. Fixes #1018	2022-06-02 09:27:47 -07:00
Joey Parrish	f577e2a0cf	chore: Update URLs after moving projects (#1042 ) Since a project URL is encoded into outputs, this means also updating the golden output files. Closes #1043	2022-03-07 11:56:34 -08:00
Vishal Shah	e1b0c7c454	Fix WEBVTT Region parse 100 precent (#1006 )	2021-11-15 21:28:15 -08:00
Joey Parrish	efbca399c0	fix: Add missing limits header In many places, we used std::numeric_limits without including the proper header. This would build on some Linux distributions, but not others. This adds the missing includes, fixing the build on Fedora, among other distros. Change-Id: I63e9e37e5973fe23bbdf9868552db51062b1dae4	2021-10-13 12:25:34 -07:00
Caitlin O'Callaghan	c87c5bcdef	Fix for gap size warning in Low Latency mode (#985 ) ## The issue - With LL-DASH mode enabled, the gap size warning was hit and printed to the console every time a new segment was registered to the manifest. - This occurred because the first chunk's size and duration were being stored for each segment, rather than the full segment size and duration. Note, only the first chunk's metrics are known at first because in low latency mode, the segment is registered to the manifest before it is finished being processed and written. - Because of this, the gap size check was comparing the end time of the first chunk in the previous segment to the beginning time of the current segment, causing the check to fail every time. ## The Fix - Update a low latency segment's duration and size once the segment file has been fully written. - The full segment size and duration will be used to update the bandwidth estimator and the segment info list. - Updating the segment info list to hold the full duration is necessary for satisfying [the gap size check found in Represenation.cc](https://github.com/google/shaka-packager/blob/master/packager/mpd/base/representation.cc#L391). - NOTE: bandwidth estimation is currently only used in HLS	2021-09-03 09:57:43 -07:00
Caitlin O'Callaghan	cd018a71c3	Low latency DASH support (#979 ) # LL-DASH Support These changes add support for LL-DASH streaming. NOTE: LL-HLS support is still in progress, but it's coming. :) ## Testing `./chunking_unittest --gtest_filter="ChunkingHandlerTest.LowLatencyDash"` `./media_event_unittest --gtest_filter="MpdNotifyMuxerListenerTest.LowLatencyDash"` `./mpd_unittest --gtest_filter="PeriodTest.LowLatencyDashMpdGetXml"` `./mpd_unittest --gtest_filter="SimpleMpdNotifierTest.NotifyAvailabilityTimeOffset"` `./mpd_unittest --gtest_filter="SimpleMpdNotifierTest.NotifySegmentDuration"` `./mpd_unittest --gtest_filter="LowLatencySegmentTest.LowLatencySegmentTemplate"` Note, packager_test must be run from the main project directory `./out/Release/packager_test --gtest_filter="PackagerTest.LowLatencyDashEnabledAndUtcTimingNotSet"` `./out/Release/packager_test --gtest_filter="PackagerTest.LowLatencyDashEnabledAndUtcTimingNotSet"`	2021-08-25 08:38:05 -07:00
Joey Parrish	cfbe5c08c2	cleanup: Convert all time parameters to signed This converts all time parameters to signed, finishing a cleanup that was started in 2018 in `b4256bf0`. This changes the type of: - timestamps - PTS specifically - timestamp offsets - timescales - durations This excludes: - MP4 box definitions - DTS specifically This is meant to address signed/unsigned conversion issues on arm64 that caused some test cases to fail. Change-Id: Ic752a20cbc6e31fea6bc0894d1771833171e7cbe	2021-08-05 18:24:15 +00:00
nvincen	f018c9a9bf	Added MPEG-H support (mha1, mhm1) Implemented according to `Audio Amendment to Guidelines for Implementation: DASH-IF Interoperability Points, Version 4.3` (https://dashif.org/docs/Audio%20Amendment%20to%20DASH%20IOP%204.3.pdf). Closes #930.	2021-06-29 23:10:53 -07:00
Mattias Wadman	62f37eb3b7	Ignore matroska projection metadata Warn instead of fail parsing. Closes #932.	2021-05-07 10:13:02 -07:00
KongQun Yang	2e521c8413	Remove another use of regex library It is not working correctly in gcc 4.8 or earlier, which is still popular (bundled by default in CentOS 7). Issue #865, #929. Change-Id: I136446a70831bd0237cd29646dd349fe7558176b	2021-05-05 18:01:27 +00:00
Vishal Shah	d9124d6aaa	[WEBVTT] Fix missing text alignment tags from output Legacy players, e.g. older versions of ExoPlayer, do not handle default webvtt text alignment correctly. Need to specify `align:center` explicitly cues without text alignment for backwards compatibility. Fixes #925.	2021-05-04 22:57:43 -07:00
KongQun Yang	4528bdb330	Remove the use of regex library It is not working correctly in gcc 4.8 or earlier, which is still popular (e.g. bundled by default in CentOS 7). Fixes #865, #929. Change-Id: I55a42428dbd2a12fc2c3b1e6a49fdd662a295dca	2021-05-04 02:09:08 +00:00
Daniel Cantarín	f6c02e629d	Generate object type properly for MPEG-1 audio Fix #905.	2021-04-04 22:47:31 -07:00
Jacob Trimble	c1f64e5350	Fix transparency case in DVB-SUB. This fixes some math errors in the color conversions and handles the case of Y=0. Fixes #903 Change-Id: I796246e4d62a3161b44916f97e9e98f9203ad338	2021-03-29 16:34:39 +00:00
Daniel Cantarín	dd935f6dc3	TTML: change "imagetype" attribute to camel case Fixes #908	2021-03-09 10:10:32 -08:00
Sergio Garcia Murillo	f9908362f8	Prevent seg fault if webm fragment is not initialized or last frame is EOS Fixes #900	2021-03-07 15:01:08 -08:00
Sergio Garcia Murillo	b8ce44aba0	Prevent seg fault if mp4 fragment is not initialized Related to #900.	2021-03-02 23:53:58 -08:00
Jacob Trimble	00af192626	Cleanup HttpFile and related PR. This implements many of the comments made on the PR and cleans up those files. Closes #149 Change-Id: Ice73fe3c04a6f595da6986a4c070e50cb20f9435	2021-03-02 17:43:47 +00:00
Jacob Trimble	a0f3f2cd3a	Add cc_index to stream descriptor. This also allows setting the language of different text streams from the same input. Multiple streams can use the same input stream using different cc_index values and can each use a different language. This also will try to pull the language from the input if not specified. Change-Id: I7078710b509b7d77dad8cb4299a82f954af7e9e7	2021-02-17 18:33:53 +00:00
Jacob Trimble	78be14c092	Add DVB-sub parser Note that this only supports a single page within the DVB-sub stream. Multiple pages will be merged together. A follow-up will allow selecting a specific page. This only supports outputting using TTML or MP4+TTML; you cannot have DVB-sub output nor can you output it in WebVTT. Since DVB-sub uses images, it is hard to impossible to do this with WebVTT. This also only supports interlaced images, not progressive images nor text. Closes #832 Change-Id: Id6dbb6393c7b9a05722e61c6bd255bef5e69a7d8	2021-02-17 18:32:03 +00:00
Jacob Trimble	95089593fc	Don't re-open WebVTT file to determine size. Change-Id: Id92226adce813b7d0c4c741e47e36dbf8f208797	2021-02-08 20:31:13 +00:00
JPeMu	36ef7ec945	[MPEG-TS] Fix PCR reserved bits not being set correctly Fixes #893.	2021-02-03 12:09:07 -08:00
Ole Andre Birkedal	aa17521268	HTTP PUT output support (#737 ) Issue #149 Co-authored-by: Andreas Motl <andreas.motl@elmyra.de> Co-authored-by: Rintaro Kuroiwa <rkuroiwa@google.com> Co-authored-by: Ole Andre Birkedal <o.birkedal@sportradar.com>	2021-02-02 10:51:50 -08:00
Jacob Trimble	5bcda6b88b	Use TsStreamType for MP2T parser. This also changes some of the logs to error so the user can see why the parsing failed. Change-Id: Ib8b7a5076462bccc718e17ef9e0a57d172d1f7b4	2021-02-01 20:13:13 +00:00
Jacob Trimble	2eb32ee177	Propagate Flush errors in MP2T parser. Issue #832 Change-Id: I59f31ff491437b81ffc22ab5760ad0c059e9933e	2021-01-20 18:27:31 +00:00
Jacob Trimble	89d407f9ae	Add subtitle composition to DVB-sub parser. Issue #832 Change-Id: Iababe884619e1e48f1abe0806e8b863c95a3c1ef	2021-01-20 18:26:28 +00:00
Jacob Trimble	32c5393fba	Add helpers for DVB-sub colors. Issue #832 Change-Id: I6350306c7d9a6450d82994bbd9a9a239986bc3fa	2021-01-20 18:25:43 +00:00
Vishal Shah	8e3e8d3e8e	[WEBVTT] Support both center and middle text alignments Fixes #882.	2021-01-19 11:45:20 -08:00
KongQun Yang	10daa39901	[MP4] Allow not to generate 'sidx' box for single-segment too I.e. the flag --generate_sidx_in_media_segments, --nogenerate_sidx_in_media_segments work for both single-segment and multi-segment mode with this change. Related to #862. Change-Id: Icd27fd00e8e036ba0c4709b48650372429cc0351	2020-12-11 19:08:37 +00:00
KongQun Yang	516430bde1	[MP4] Truncate segment references in 'sidx' if necessary The reference count in 'sidx' box is a uint16 field, which allows at most 0xFFFF entries, i.e. at most 0xFFFF subsegments, which is roughly 18 hours for one second segments. Do not fail packaging when it happens. Instead, generate a warning and truncate the number of references to 0xFFFF instead. Note that the actual number of mp4 fragments in the mp4 file can still be more than 0xFFFF. The stream will not play to the end in DASH, but it will play successfully in HLS. Workarounds #862. Change-Id: Ib3930418d1528df1f9ea64cda0d0ebaa78d26abb	2020-12-11 19:07:56 +00:00

1 2 3 4 5 ...

431 Commits