[FFmpeg-devel,RFC] avformat: introduce AVStreamGroup

Message ID	20230906143832.54604-1-jamrial@gmail.com
State	New
Headers	show Delivered-To: ffmpegpatchwork2@gmail.com Received-SPF: pass (google.com: domain of ffmpeg-devel-bounces@ffmpeg.org designates 79.124.17.100 as permitted sender) client-ip=79.124.17.100; From: James Almer <jamrial@gmail.com> To: ffmpeg-devel@ffmpeg.org Date: Wed, 6 Sep 2023 11:38:32 -0300 Message-ID: <20230906143832.54604-1-jamrial@gmail.com> MIME-Version: 1.0 Subject: [FFmpeg-devel] [PATCH] [RFC]avformat: introduce AVStreamGroup Precedence: list Reply-To: FFmpeg development discussions and patches <ffmpeg-devel@ffmpeg.org> Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: ffmpeg-devel-bounces@ffmpeg.org Sender: "ffmpeg-devel" <ffmpeg-devel-bounces@ffmpeg.org>
Series	[FFmpeg-devel,RFC] avformat: introduce AVStreamGroup \| expand [FFmpeg-devel,RFC] avformat: introduce AVStreamGroup

James Almer Sept. 6, 2023, 2:38 p.m. UTC

Signed-off-by: James Almer <jamrial@gmail.com>
---
This is an initial proof of concept for AVStream groups, something that's
needed for quite a few existing and upcoming formats that lavf has no way to
currently export. Said formats define a single video or audio stream composed
by merging several individualy multiplexed streams within a media file.
This is the case of HEIF, a format defining a tiled image where each tile is a
separate image (either hevc, av1, etc) all of which need to be decoded
individualy and then stitched together for presentation using container level
information; IAMF, a new audio format where several individual streams (mono or
stereo) need to be decoded individually and then combined to form audio streams
with different channel layouts; and MPEG-TS programs, currently exported as
AVProgram, which this new general purpose API would replace.
There may be others too, like something ISOBMFF specific and not HEIF related,
I'm told.

A new struct, AVStreamGroup, would cover all these cases by grouping the
relevant streams and propagating format specific metadata for the purpose of
combining them into what's expected for presentation (Something a filter for
example would have to take care of).

Missing from this first version is something like a type field, which could be
an enum listing the different currently known formats the user would then use
to interpret the attached metadata, defined perhaps in codecpar.extradata

I'd like to hear opinions and suggestions to improve and properly handle this.

 libavformat/avformat.c |  26 ++++++++
 libavformat/avformat.h | 143 ++++++++++++++++++++++++++++++++++++++++-
 libavformat/dump.c     |  65 +++++++++++++++++--
 libavformat/internal.h |  28 ++++++++
 libavformat/options.c  |  77 ++++++++++++++++++++++
 5 files changed, 332 insertions(+), 7 deletions(-)

Tomas Härdin Sept. 6, 2023, 5:53 p.m. UTC | #1

ons 2023-09-06 klockan 11:38 -0300 skrev James Almer:
> Signed-off-by: James Almer <jamrial@gmail.com>
> ---
> This is an initial proof of concept for AVStream groups, something
> that's
> needed for quite a few existing and upcoming formats that lavf has no
> way to
> currently export. Said formats define a single video or audio stream
> composed
> by merging several individualy multiplexed streams within a media
> file.
> This is the case of HEIF, a format defining a tiled image where each
> tile is a
> separate image (either hevc, av1, etc) all of which need to be
> decoded
> individualy and then stitched together for presentation using
> container level
> information; 

I remember this blocking HEIF as a GSoC project. Honestly the way that
format is designed is immensely horrible.

> MPEG-TS programs, currently exported as
> AVProgram, which this new general purpose API would replace.

I can foresee this being a nuisance for users accustomed to AVProgram.
Also this feature borders on NLE territory. Not necessarily a bad
thing, but FFmpeg is overall poorly architectured for NLE stuff. I
believe I raised this issue back when lavfi was proposed, it being
wholly unsuitable for NLE work.


> +typedef struct AVStreamGroup {
> +    /**
> +     * A class for @ref avoptions. Set on stream creation.
> +     */
> +    const AVClass *av_class;
> +
> +    /**
> +     * Group index in AVFormatContext.
> +     */
> +    int index;
> +
> +    /**
> +     * Format-specific group ID.
> +     * decoding: set by libavformat
> +     * encoding: set by the user, replaced by libavformat if left
> unset
> +     */
> +    int id;
> +
> +    /**
> +     * Codec parameters associated with this stream group. Allocated
> and freed
> +     * by libavformat in avformat_new_stream_group() and
> avformat_free_context()
> +     * respectively.
> +     *
> +     * - demuxing: filled by libavformat on stream group creation or
> in
> +     *             avformat_find_stream_info()
> +     * - muxing: filled by the caller before avformat_write_header()
> +     */
> +    AVCodecParameters *codecpar;
> +
> +    void *priv_data;
> +
> +    /**
> +     * Number of elements in AVStreamGroup.stream_index.
> +     *
> +     * Set by av_stream_group_add_stream() and
> av_stream_group_new_stream(), must not
> +     * be modified by any other code.
> +     */
> +    int nb_stream_indexes;
> +
> +    /**
> +     * A list of indexes of streams in the group. New entries are
> created with
> +     * av_stream_group_add_stream() and
> av_stream_group_new_stream().
> +     *
> +     * - demuxing: entries are created by libavformat in
> avformat_open_input().
> +     *             If AVFMTCTX_NOHEADER is set in ctx_flags, then
> new entries may also
> +     *             appear in av_read_frame().
> +     * - muxing: entries are created by the user before
> avformat_write_header().
> +     *
> +     * Freed by libavformat in avformat_free_context().
> +     */
> +    int *stream_index;
> +} AVStreamGroup;

I see no provisions for attaching metadata, for example HEIF stitching.
Putting it in coderpar seems wrong, since it is container-level
metadata. We could just have an HEIF specific struct as container
metadata.

/Tomas

James Almer Sept. 6, 2023, 7:16 p.m. UTC | #2

On 9/6/2023 2:53 PM, Tomas Härdin wrote:
> ons 2023-09-06 klockan 11:38 -0300 skrev James Almer:
>> Signed-off-by: James Almer <jamrial@gmail.com>
>> ---
>> This is an initial proof of concept for AVStream groups, something
>> that's
>> needed for quite a few existing and upcoming formats that lavf has no
>> way to
>> currently export. Said formats define a single video or audio stream
>> composed
>> by merging several individualy multiplexed streams within a media
>> file.
>> This is the case of HEIF, a format defining a tiled image where each
>> tile is a
>> separate image (either hevc, av1, etc) all of which need to be
>> decoded
>> individualy and then stitched together for presentation using
>> container level
>> information;
> 
> I remember this blocking HEIF as a GSoC project. Honestly the way that
> format is designed is immensely horrible.
> 
>> MPEG-TS programs, currently exported as
>> AVProgram, which this new general purpose API would replace.
> 
> I can foresee this being a nuisance for users accustomed to AVProgram.
> Also this feature borders on NLE territory. Not necessarily a bad
> thing, but FFmpeg is overall poorly architectured for NLE stuff. I
> believe I raised this issue back when lavfi was proposed, it being
> wholly unsuitable for NLE work.
> 
> 
>> +typedef struct AVStreamGroup {
>> +    /**
>> +     * A class for @ref avoptions. Set on stream creation.
>> +     */
>> +    const AVClass *av_class;
>> +
>> +    /**
>> +     * Group index in AVFormatContext.
>> +     */
>> +    int index;
>> +
>> +    /**
>> +     * Format-specific group ID.
>> +     * decoding: set by libavformat
>> +     * encoding: set by the user, replaced by libavformat if left
>> unset
>> +     */
>> +    int id;
>> +
>> +    /**
>> +     * Codec parameters associated with this stream group. Allocated
>> and freed
>> +     * by libavformat in avformat_new_stream_group() and
>> avformat_free_context()
>> +     * respectively.
>> +     *
>> +     * - demuxing: filled by libavformat on stream group creation or
>> in
>> +     *             avformat_find_stream_info()
>> +     * - muxing: filled by the caller before avformat_write_header()
>> +     */
>> +    AVCodecParameters *codecpar;
>> +
>> +    void *priv_data;
>> +
>> +    /**
>> +     * Number of elements in AVStreamGroup.stream_index.
>> +     *
>> +     * Set by av_stream_group_add_stream() and
>> av_stream_group_new_stream(), must not
>> +     * be modified by any other code.
>> +     */
>> +    int nb_stream_indexes;
>> +
>> +    /**
>> +     * A list of indexes of streams in the group. New entries are
>> created with
>> +     * av_stream_group_add_stream() and
>> av_stream_group_new_stream().
>> +     *
>> +     * - demuxing: entries are created by libavformat in
>> avformat_open_input().
>> +     *             If AVFMTCTX_NOHEADER is set in ctx_flags, then
>> new entries may also
>> +     *             appear in av_read_frame().
>> +     * - muxing: entries are created by the user before
>> avformat_write_header().
>> +     *
>> +     * Freed by libavformat in avformat_free_context().
>> +     */
>> +    int *stream_index;
>> +} AVStreamGroup;
> 
> I see no provisions for attaching metadata, for example HEIF stitching.
> Putting it in coderpar seems wrong, since it is container-level
> metadata. We could just have an HEIF specific struct as container
> metadata.

The doxy for AVCodecParameters says "This struct describes the 
properties of an encoded stream.", so It's not about container level props.

Although codecpar will be used to export the merged/stitched stream 
props like dimensions and channel layout, maybe you're right about the 
metadata because there would be a clash between actual HEVC/Opus/AAC/AV1 
extradata and the HEIF/IAMF/etc specific info if both use 
codecpar.extradata, even if one will be in AVStream and the other in 
AVStreamGroup.

Maybe in side_data (Once my other set is pushed)? Defining new types for 
each kind of metadata.

Tomas Härdin Sept. 13, 2023, 9:34 a.m. UTC | #3

ons 2023-09-06 klockan 16:16 -0300 skrev James Almer:
> On 9/6/2023 2:53 PM, Tomas Härdin wrote:
> > ons 2023-09-06 klockan 11:38 -0300 skrev James Almer:
> > > Signed-off-by: James Almer <jamrial@gmail.com>
> > > ---
> > > This is an initial proof of concept for AVStream groups,
> > > something
> > > that's
> > > needed for quite a few existing and upcoming formats that lavf
> > > has no
> > > way to
> > > currently export. Said formats define a single video or audio
> > > stream
> > > composed
> > > by merging several individualy multiplexed streams within a media
> > > file.
> > > This is the case of HEIF, a format defining a tiled image where
> > > each
> > > tile is a
> > > separate image (either hevc, av1, etc) all of which need to be
> > > decoded
> > > individualy and then stitched together for presentation using
> > > container level
> > > information;
> > 
> > I remember this blocking HEIF as a GSoC project. Honestly the way
> > that
> > format is designed is immensely horrible.
> > 
> > > MPEG-TS programs, currently exported as
> > > AVProgram, which this new general purpose API would replace.
> > 
> > I can foresee this being a nuisance for users accustomed to
> > AVProgram.
> > Also this feature borders on NLE territory. Not necessarily a bad
> > thing, but FFmpeg is overall poorly architectured for NLE stuff. I
> > believe I raised this issue back when lavfi was proposed, it being
> > wholly unsuitable for NLE work.
> > 
> > 
> > > +typedef struct AVStreamGroup {
> > > +    /**
> > > +     * A class for @ref avoptions. Set on stream creation.
> > > +     */
> > > +    const AVClass *av_class;
> > > +
> > > +    /**
> > > +     * Group index in AVFormatContext.
> > > +     */
> > > +    int index;
> > > +
> > > +    /**
> > > +     * Format-specific group ID.
> > > +     * decoding: set by libavformat
> > > +     * encoding: set by the user, replaced by libavformat if
> > > left
> > > unset
> > > +     */
> > > +    int id;
> > > +
> > > +    /**
> > > +     * Codec parameters associated with this stream group.
> > > Allocated
> > > and freed
> > > +     * by libavformat in avformat_new_stream_group() and
> > > avformat_free_context()
> > > +     * respectively.
> > > +     *
> > > +     * - demuxing: filled by libavformat on stream group
> > > creation or
> > > in
> > > +     *             avformat_find_stream_info()
> > > +     * - muxing: filled by the caller before
> > > avformat_write_header()
> > > +     */
> > > +    AVCodecParameters *codecpar;
> > > +
> > > +    void *priv_data;
> > > +
> > > +    /**
> > > +     * Number of elements in AVStreamGroup.stream_index.
> > > +     *
> > > +     * Set by av_stream_group_add_stream() and
> > > av_stream_group_new_stream(), must not
> > > +     * be modified by any other code.
> > > +     */
> > > +    int nb_stream_indexes;
> > > +
> > > +    /**
> > > +     * A list of indexes of streams in the group. New entries
> > > are
> > > created with
> > > +     * av_stream_group_add_stream() and
> > > av_stream_group_new_stream().
> > > +     *
> > > +     * - demuxing: entries are created by libavformat in
> > > avformat_open_input().
> > > +     *             If AVFMTCTX_NOHEADER is set in ctx_flags,
> > > then
> > > new entries may also
> > > +     *             appear in av_read_frame().
> > > +     * - muxing: entries are created by the user before
> > > avformat_write_header().
> > > +     *
> > > +     * Freed by libavformat in avformat_free_context().
> > > +     */
> > > +    int *stream_index;
> > > +} AVStreamGroup;
> > 
> > I see no provisions for attaching metadata, for example HEIF
> > stitching.
> > Putting it in coderpar seems wrong, since it is container-level
> > metadata. We could just have an HEIF specific struct as container
> > metadata.
> 
> The doxy for AVCodecParameters says "This struct describes the 
> properties of an encoded stream.", so It's not about container level
> props.

It *is* container level props. The underlying codecs have no concept of
this kind of stitching. The closest you're going to get is tiles in
JPEG2000, but I doubt HEIF support JPEG2000.

We might say "well the resulting stream group has resolution so it's
like a codec" but see below.

> Although codecpar will be used to export the merged/stitched stream 
> props like dimensions and channel layout, maybe you're right about
> the 
> metadata because there would be a clash between actual
> HEVC/Opus/AAC/AV1 
> extradata and the HEIF/IAMF/etc specific info if both use 
> codecpar.extradata, even if one will be in AVStream and the other in 
> AVStreamGroup.

Yes, pretty much. But it's more that codecpar is pressed into service
where it probably doesn't belong. It might be more appropriate to call
these "essence parameters". I'm going to stick my neck out further and
say that picture and sound essence should be handled with different
structs, not smushed together into one struct like AVCodecParameters.

/Tomas

Pierre-Anthony Lemieux Sept. 13, 2023, 2:33 p.m. UTC | #4

On Wed, Sep 13, 2023 at 2:35 AM Tomas Härdin <git@haerdin.se> wrote:
>
> ons 2023-09-06 klockan 16:16 -0300 skrev James Almer:
> > On 9/6/2023 2:53 PM, Tomas Härdin wrote:
> > > ons 2023-09-06 klockan 11:38 -0300 skrev James Almer:
> > > > Signed-off-by: James Almer <jamrial@gmail.com>
> > > > ---
> > > > This is an initial proof of concept for AVStream groups,
> > > > something
> > > > that's
> > > > needed for quite a few existing and upcoming formats that lavf
> > > > has no
> > > > way to
> > > > currently export. Said formats define a single video or audio
> > > > stream
> > > > composed
> > > > by merging several individualy multiplexed streams within a media
> > > > file.
> > > > This is the case of HEIF, a format defining a tiled image where
> > > > each
> > > > tile is a
> > > > separate image (either hevc, av1, etc) all of which need to be
> > > > decoded
> > > > individualy and then stitched together for presentation using
> > > > container level
> > > > information;
> > >
> > > I remember this blocking HEIF as a GSoC project. Honestly the way
> > > that
> > > format is designed is immensely horrible.
> > >
> > > > MPEG-TS programs, currently exported as
> > > > AVProgram, which this new general purpose API would replace.
> > >
> > > I can foresee this being a nuisance for users accustomed to
> > > AVProgram.
> > > Also this feature borders on NLE territory. Not necessarily a bad
> > > thing, but FFmpeg is overall poorly architectured for NLE stuff. I
> > > believe I raised this issue back when lavfi was proposed, it being
> > > wholly unsuitable for NLE work.
> > >
> > >
> > > > +typedef struct AVStreamGroup {
> > > > +    /**
> > > > +     * A class for @ref avoptions. Set on stream creation.
> > > > +     */
> > > > +    const AVClass *av_class;
> > > > +
> > > > +    /**
> > > > +     * Group index in AVFormatContext.
> > > > +     */
> > > > +    int index;
> > > > +
> > > > +    /**
> > > > +     * Format-specific group ID.
> > > > +     * decoding: set by libavformat
> > > > +     * encoding: set by the user, replaced by libavformat if
> > > > left
> > > > unset
> > > > +     */
> > > > +    int id;
> > > > +
> > > > +    /**
> > > > +     * Codec parameters associated with this stream group.
> > > > Allocated
> > > > and freed
> > > > +     * by libavformat in avformat_new_stream_group() and
> > > > avformat_free_context()
> > > > +     * respectively.
> > > > +     *
> > > > +     * - demuxing: filled by libavformat on stream group
> > > > creation or
> > > > in
> > > > +     *             avformat_find_stream_info()
> > > > +     * - muxing: filled by the caller before
> > > > avformat_write_header()
> > > > +     */
> > > > +    AVCodecParameters *codecpar;
> > > > +
> > > > +    void *priv_data;
> > > > +
> > > > +    /**
> > > > +     * Number of elements in AVStreamGroup.stream_index.
> > > > +     *
> > > > +     * Set by av_stream_group_add_stream() and
> > > > av_stream_group_new_stream(), must not
> > > > +     * be modified by any other code.
> > > > +     */
> > > > +    int nb_stream_indexes;
> > > > +
> > > > +    /**
> > > > +     * A list of indexes of streams in the group. New entries
> > > > are
> > > > created with
> > > > +     * av_stream_group_add_stream() and
> > > > av_stream_group_new_stream().
> > > > +     *
> > > > +     * - demuxing: entries are created by libavformat in
> > > > avformat_open_input().
> > > > +     *             If AVFMTCTX_NOHEADER is set in ctx_flags,
> > > > then
> > > > new entries may also
> > > > +     *             appear in av_read_frame().
> > > > +     * - muxing: entries are created by the user before
> > > > avformat_write_header().
> > > > +     *
> > > > +     * Freed by libavformat in avformat_free_context().
> > > > +     */
> > > > +    int *stream_index;
> > > > +} AVStreamGroup;
> > >
> > > I see no provisions for attaching metadata, for example HEIF
> > > stitching.
> > > Putting it in coderpar seems wrong, since it is container-level
> > > metadata. We could just have an HEIF specific struct as container
> > > metadata.
> >
> > The doxy for AVCodecParameters says "This struct describes the
> > properties of an encoded stream.", so It's not about container level
> > props.
>
> It *is* container level props. The underlying codecs have no concept of
> this kind of stitching. The closest you're going to get is tiles in
> JPEG2000, but I doubt HEIF support JPEG2000.

Just an FYI.

HEIF supports JPEG 2000:

https://www.itu.int/rec/T-REC-T.815/en

One implementation:

https://github.com/strukturag/libheif/pull/874

>
> We might say "well the resulting stream group has resolution so it's
> like a codec" but see below.
>
> > Although codecpar will be used to export the merged/stitched stream
> > props like dimensions and channel layout, maybe you're right about
> > the
> > metadata because there would be a clash between actual
> > HEVC/Opus/AAC/AV1
> > extradata and the HEIF/IAMF/etc specific info if both use
> > codecpar.extradata, even if one will be in AVStream and the other in
> > AVStreamGroup.
>
> Yes, pretty much. But it's more that codecpar is pressed into service
> where it probably doesn't belong. It might be more appropriate to call
> these "essence parameters". I'm going to stick my neck out further and
> say that picture and sound essence should be handled with different
> structs, not smushed together into one struct like AVCodecParameters.
>
> /Tomas
> _______________________________________________
> ffmpeg-devel mailing list
> ffmpeg-devel@ffmpeg.org
> https://ffmpeg.org/mailman/listinfo/ffmpeg-devel
>
> To unsubscribe, visit link above, or email
> ffmpeg-devel-request@ffmpeg.org with subject "unsubscribe".

Tomas Härdin Sept. 13, 2023, 8:41 p.m. UTC | #5

ons 2023-09-13 klockan 07:33 -0700 skrev Pierre-Anthony Lemieux:
> On Wed, Sep 13, 2023 at 2:35 AM Tomas Härdin <git@haerdin.se> wrote:
> > 
> > ons 2023-09-06 klockan 16:16 -0300 skrev James Almer:
> > > On 9/6/2023 2:53 PM, Tomas Härdin wrote:
> > > > ons 2023-09-06 klockan 11:38 -0300 skrev James Almer:
> > > > > Signed-off-by: James Almer <jamrial@gmail.com>
> > > > > ---
> > > > > This is an initial proof of concept for AVStream groups,
> > > > > something
> > > > > that's
> > > > > needed for quite a few existing and upcoming formats that
> > > > > lavf
> > > > > has no
> > > > > way to
> > > > > currently export. Said formats define a single video or audio
> > > > > stream
> > > > > composed
> > > > > by merging several individualy multiplexed streams within a
> > > > > media
> > > > > file.
> > > > > This is the case of HEIF, a format defining a tiled image
> > > > > where
> > > > > each
> > > > > tile is a
> > > > > separate image (either hevc, av1, etc) all of which need to
> > > > > be
> > > > > decoded
> > > > > individualy and then stitched together for presentation using
> > > > > container level
> > > > > information;
> > > > 
> > > > I remember this blocking HEIF as a GSoC project. Honestly the
> > > > way
> > > > that
> > > > format is designed is immensely horrible.
> > > > 
> > > > > MPEG-TS programs, currently exported as
> > > > > AVProgram, which this new general purpose API would replace.
> > > > 
> > > > I can foresee this being a nuisance for users accustomed to
> > > > AVProgram.
> > > > Also this feature borders on NLE territory. Not necessarily a
> > > > bad
> > > > thing, but FFmpeg is overall poorly architectured for NLE
> > > > stuff. I
> > > > believe I raised this issue back when lavfi was proposed, it
> > > > being
> > > > wholly unsuitable for NLE work.
> > > > 
> > > > 
> > > > > +typedef struct AVStreamGroup {
> > > > > +    /**
> > > > > +     * A class for @ref avoptions. Set on stream creation.
> > > > > +     */
> > > > > +    const AVClass *av_class;
> > > > > +
> > > > > +    /**
> > > > > +     * Group index in AVFormatContext.
> > > > > +     */
> > > > > +    int index;
> > > > > +
> > > > > +    /**
> > > > > +     * Format-specific group ID.
> > > > > +     * decoding: set by libavformat
> > > > > +     * encoding: set by the user, replaced by libavformat if
> > > > > left
> > > > > unset
> > > > > +     */
> > > > > +    int id;
> > > > > +
> > > > > +    /**
> > > > > +     * Codec parameters associated with this stream group.
> > > > > Allocated
> > > > > and freed
> > > > > +     * by libavformat in avformat_new_stream_group() and
> > > > > avformat_free_context()
> > > > > +     * respectively.
> > > > > +     *
> > > > > +     * - demuxing: filled by libavformat on stream group
> > > > > creation or
> > > > > in
> > > > > +     *             avformat_find_stream_info()
> > > > > +     * - muxing: filled by the caller before
> > > > > avformat_write_header()
> > > > > +     */
> > > > > +    AVCodecParameters *codecpar;
> > > > > +
> > > > > +    void *priv_data;
> > > > > +
> > > > > +    /**
> > > > > +     * Number of elements in AVStreamGroup.stream_index.
> > > > > +     *
> > > > > +     * Set by av_stream_group_add_stream() and
> > > > > av_stream_group_new_stream(), must not
> > > > > +     * be modified by any other code.
> > > > > +     */
> > > > > +    int nb_stream_indexes;
> > > > > +
> > > > > +    /**
> > > > > +     * A list of indexes of streams in the group. New
> > > > > entries
> > > > > are
> > > > > created with
> > > > > +     * av_stream_group_add_stream() and
> > > > > av_stream_group_new_stream().
> > > > > +     *
> > > > > +     * - demuxing: entries are created by libavformat in
> > > > > avformat_open_input().
> > > > > +     *             If AVFMTCTX_NOHEADER is set in ctx_flags,
> > > > > then
> > > > > new entries may also
> > > > > +     *             appear in av_read_frame().
> > > > > +     * - muxing: entries are created by the user before
> > > > > avformat_write_header().
> > > > > +     *
> > > > > +     * Freed by libavformat in avformat_free_context().
> > > > > +     */
> > > > > +    int *stream_index;
> > > > > +} AVStreamGroup;
> > > > 
> > > > I see no provisions for attaching metadata, for example HEIF
> > > > stitching.
> > > > Putting it in coderpar seems wrong, since it is container-level
> > > > metadata. We could just have an HEIF specific struct as
> > > > container
> > > > metadata.
> > > 
> > > The doxy for AVCodecParameters says "This struct describes the
> > > properties of an encoded stream.", so It's not about container
> > > level
> > > props.
> > 
> > It *is* container level props. The underlying codecs have no
> > concept of
> > this kind of stitching. The closest you're going to get is tiles in
> > JPEG2000, but I doubt HEIF support JPEG2000.
> 
> Just an FYI.
> 
> HEIF supports JPEG 2000:
> 
> https://www.itu.int/rec/T-REC-T.815/en
> 
> One implementation:
> 
> https://github.com/strukturag/libheif/pull/874

Cursed

/Tomas

James Almer Sept. 15, 2023, 6:10 p.m. UTC | #6

On 9/13/2023 6:34 AM, Tomas Härdin wrote:
> ons 2023-09-06 klockan 16:16 -0300 skrev James Almer:
>> On 9/6/2023 2:53 PM, Tomas Härdin wrote:
>>> ons 2023-09-06 klockan 11:38 -0300 skrev James Almer:
>>>> Signed-off-by: James Almer <jamrial@gmail.com>
>>>> ---
>>>> This is an initial proof of concept for AVStream groups,
>>>> something
>>>> that's
>>>> needed for quite a few existing and upcoming formats that lavf
>>>> has no
>>>> way to
>>>> currently export. Said formats define a single video or audio
>>>> stream
>>>> composed
>>>> by merging several individualy multiplexed streams within a media
>>>> file.
>>>> This is the case of HEIF, a format defining a tiled image where
>>>> each
>>>> tile is a
>>>> separate image (either hevc, av1, etc) all of which need to be
>>>> decoded
>>>> individualy and then stitched together for presentation using
>>>> container level
>>>> information;
>>>
>>> I remember this blocking HEIF as a GSoC project. Honestly the way
>>> that
>>> format is designed is immensely horrible.
>>>
>>>> MPEG-TS programs, currently exported as
>>>> AVProgram, which this new general purpose API would replace.
>>>
>>> I can foresee this being a nuisance for users accustomed to
>>> AVProgram.
>>> Also this feature borders on NLE territory. Not necessarily a bad
>>> thing, but FFmpeg is overall poorly architectured for NLE stuff. I
>>> believe I raised this issue back when lavfi was proposed, it being
>>> wholly unsuitable for NLE work.
>>>
>>>
>>>> +typedef struct AVStreamGroup {
>>>> +    /**
>>>> +     * A class for @ref avoptions. Set on stream creation.
>>>> +     */
>>>> +    const AVClass *av_class;
>>>> +
>>>> +    /**
>>>> +     * Group index in AVFormatContext.
>>>> +     */
>>>> +    int index;
>>>> +
>>>> +    /**
>>>> +     * Format-specific group ID.
>>>> +     * decoding: set by libavformat
>>>> +     * encoding: set by the user, replaced by libavformat if
>>>> left
>>>> unset
>>>> +     */
>>>> +    int id;
>>>> +
>>>> +    /**
>>>> +     * Codec parameters associated with this stream group.
>>>> Allocated
>>>> and freed
>>>> +     * by libavformat in avformat_new_stream_group() and
>>>> avformat_free_context()
>>>> +     * respectively.
>>>> +     *
>>>> +     * - demuxing: filled by libavformat on stream group
>>>> creation or
>>>> in
>>>> +     *             avformat_find_stream_info()
>>>> +     * - muxing: filled by the caller before
>>>> avformat_write_header()
>>>> +     */
>>>> +    AVCodecParameters *codecpar;
>>>> +
>>>> +    void *priv_data;
>>>> +
>>>> +    /**
>>>> +     * Number of elements in AVStreamGroup.stream_index.
>>>> +     *
>>>> +     * Set by av_stream_group_add_stream() and
>>>> av_stream_group_new_stream(), must not
>>>> +     * be modified by any other code.
>>>> +     */
>>>> +    int nb_stream_indexes;
>>>> +
>>>> +    /**
>>>> +     * A list of indexes of streams in the group. New entries
>>>> are
>>>> created with
>>>> +     * av_stream_group_add_stream() and
>>>> av_stream_group_new_stream().
>>>> +     *
>>>> +     * - demuxing: entries are created by libavformat in
>>>> avformat_open_input().
>>>> +     *             If AVFMTCTX_NOHEADER is set in ctx_flags,
>>>> then
>>>> new entries may also
>>>> +     *             appear in av_read_frame().
>>>> +     * - muxing: entries are created by the user before
>>>> avformat_write_header().
>>>> +     *
>>>> +     * Freed by libavformat in avformat_free_context().
>>>> +     */
>>>> +    int *stream_index;
>>>> +} AVStreamGroup;
>>>
>>> I see no provisions for attaching metadata, for example HEIF
>>> stitching.
>>> Putting it in coderpar seems wrong, since it is container-level
>>> metadata. We could just have an HEIF specific struct as container
>>> metadata.
>>
>> The doxy for AVCodecParameters says "This struct describes the
>> properties of an encoded stream.", so It's not about container level
>> props.
> 
> It *is* container level props. The underlying codecs have no concept of
> this kind of stitching. The closest you're going to get is tiles in
> JPEG2000, but I doubt HEIF support JPEG2000.
> 
> We might say "well the resulting stream group has resolution so it's
> like a codec" but see below.
> 
>> Although codecpar will be used to export the merged/stitched stream
>> props like dimensions and channel layout, maybe you're right about
>> the
>> metadata because there would be a clash between actual
>> HEVC/Opus/AAC/AV1
>> extradata and the HEIF/IAMF/etc specific info if both use
>> codecpar.extradata, even if one will be in AVStream and the other in
>> AVStreamGroup.
> 
> Yes, pretty much. But it's more that codecpar is pressed into service
> where it probably doesn't belong. It might be more appropriate to call
> these "essence parameters". I'm going to stick my neck out further and
> say that picture and sound essence should be handled with different
> structs, not smushed together into one struct like AVCodecParameters.

Can you suggest how to approach this then, if not with 
AVCodecParameters? For the resulting merged/stitched stream, we need at 
the very least dimensions, and for audio we need channel layout, and for 
each different kind of group type (HEIF, IAMF, TS programs) we need 
specific parameters, like order of tiles for HEIF, mixing parameters for 
IAMF, and these parameters should be in a form easy for the user (like 
our CLI) to feed to lavfi, where the actual merging/stitching would take 
place with new or existing filters for this specific purpose.

Maybe something like:

----
enum AVStreamGroupParamsType {
     AV_STREAM_GROUP_PARAMS_NONE,
     AV_STREAM_GROUP_PARAMS_TS,
     AV_STREAM_GROUP_PARAMS_HEIF,
     AV_STREAM_GROUP_PARAMS_IAMF,
};

typedef struct AVStreamGroupTSParams {
     // Basically AVProgram
} AVStreamGroupTSParams;

typedef struct AVStreamGroupHEIFParams {
     // dimensions, tile order, etc
} AVStreamGroupHEIFParams;

typedef struct AVStreamGroupIAMFParams {
     // channel layout, mixing params
} AVStreamGroupIAMFParams;

typedef struct AVStreamGroup {
     [...]
     enum AVStreamGroupParamsType type;
     union {
         AVStreamGroupTSParams ts;
         AVStreamGroupHEIFParams heif;
         AVStreamGroupIAMFParams iamf;
     } essence;
     [...]
} AVStreamGroup;
----

Tomas Härdin Sept. 28, 2023, 11:27 a.m. UTC | #7

fre 2023-09-15 klockan 15:10 -0300 skrev James Almer:
> On 9/13/2023 6:34 AM, Tomas Härdin wrote:
> > ons 2023-09-06 klockan 16:16 -0300 skrev James Almer:
> > > On 9/6/2023 2:53 PM, Tomas Härdin wrote:
> > > > ons 2023-09-06 klockan 11:38 -0300 skrev James Almer:
> > > > > Signed-off-by: James Almer <jamrial@gmail.com>
> > > > > ---
> > > > > This is an initial proof of concept for AVStream groups,
> > > > > something
> > > > > that's
> > > > > needed for quite a few existing and upcoming formats that
> > > > > lavf
> > > > > has no
> > > > > way to
> > > > > currently export. Said formats define a single video or audio
> > > > > stream
> > > > > composed
> > > > > by merging several individualy multiplexed streams within a
> > > > > media
> > > > > file.
> > > > > This is the case of HEIF, a format defining a tiled image
> > > > > where
> > > > > each
> > > > > tile is a
> > > > > separate image (either hevc, av1, etc) all of which need to
> > > > > be
> > > > > decoded
> > > > > individualy and then stitched together for presentation using
> > > > > container level
> > > > > information;
> > > > 
> > > > I remember this blocking HEIF as a GSoC project. Honestly the
> > > > way
> > > > that
> > > > format is designed is immensely horrible.
> > > > 
> > > > > MPEG-TS programs, currently exported as
> > > > > AVProgram, which this new general purpose API would replace.
> > > > 
> > > > I can foresee this being a nuisance for users accustomed to
> > > > AVProgram.
> > > > Also this feature borders on NLE territory. Not necessarily a
> > > > bad
> > > > thing, but FFmpeg is overall poorly architectured for NLE
> > > > stuff. I
> > > > believe I raised this issue back when lavfi was proposed, it
> > > > being
> > > > wholly unsuitable for NLE work.
> > > > 
> > > > 
> > > > > +typedef struct AVStreamGroup {
> > > > > +    /**
> > > > > +     * A class for @ref avoptions. Set on stream creation.
> > > > > +     */
> > > > > +    const AVClass *av_class;
> > > > > +
> > > > > +    /**
> > > > > +     * Group index in AVFormatContext.
> > > > > +     */
> > > > > +    int index;
> > > > > +
> > > > > +    /**
> > > > > +     * Format-specific group ID.
> > > > > +     * decoding: set by libavformat
> > > > > +     * encoding: set by the user, replaced by libavformat if
> > > > > left
> > > > > unset
> > > > > +     */
> > > > > +    int id;
> > > > > +
> > > > > +    /**
> > > > > +     * Codec parameters associated with this stream group.
> > > > > Allocated
> > > > > and freed
> > > > > +     * by libavformat in avformat_new_stream_group() and
> > > > > avformat_free_context()
> > > > > +     * respectively.
> > > > > +     *
> > > > > +     * - demuxing: filled by libavformat on stream group
> > > > > creation or
> > > > > in
> > > > > +     *             avformat_find_stream_info()
> > > > > +     * - muxing: filled by the caller before
> > > > > avformat_write_header()
> > > > > +     */
> > > > > +    AVCodecParameters *codecpar;
> > > > > +
> > > > > +    void *priv_data;
> > > > > +
> > > > > +    /**
> > > > > +     * Number of elements in AVStreamGroup.stream_index.
> > > > > +     *
> > > > > +     * Set by av_stream_group_add_stream() and
> > > > > av_stream_group_new_stream(), must not
> > > > > +     * be modified by any other code.
> > > > > +     */
> > > > > +    int nb_stream_indexes;
> > > > > +
> > > > > +    /**
> > > > > +     * A list of indexes of streams in the group. New
> > > > > entries
> > > > > are
> > > > > created with
> > > > > +     * av_stream_group_add_stream() and
> > > > > av_stream_group_new_stream().
> > > > > +     *
> > > > > +     * - demuxing: entries are created by libavformat in
> > > > > avformat_open_input().
> > > > > +     *             If AVFMTCTX_NOHEADER is set in ctx_flags,
> > > > > then
> > > > > new entries may also
> > > > > +     *             appear in av_read_frame().
> > > > > +     * - muxing: entries are created by the user before
> > > > > avformat_write_header().
> > > > > +     *
> > > > > +     * Freed by libavformat in avformat_free_context().
> > > > > +     */
> > > > > +    int *stream_index;
> > > > > +} AVStreamGroup;
> > > > 
> > > > I see no provisions for attaching metadata, for example HEIF
> > > > stitching.
> > > > Putting it in coderpar seems wrong, since it is container-level
> > > > metadata. We could just have an HEIF specific struct as
> > > > container
> > > > metadata.
> > > 
> > > The doxy for AVCodecParameters says "This struct describes the
> > > properties of an encoded stream.", so It's not about container
> > > level
> > > props.
> > 
> > It *is* container level props. The underlying codecs have no
> > concept of
> > this kind of stitching. The closest you're going to get is tiles in
> > JPEG2000, but I doubt HEIF support JPEG2000.
> > 
> > We might say "well the resulting stream group has resolution so
> > it's
> > like a codec" but see below.
> > 
> > > Although codecpar will be used to export the merged/stitched
> > > stream
> > > props like dimensions and channel layout, maybe you're right
> > > about
> > > the
> > > metadata because there would be a clash between actual
> > > HEVC/Opus/AAC/AV1
> > > extradata and the HEIF/IAMF/etc specific info if both use
> > > codecpar.extradata, even if one will be in AVStream and the other
> > > in
> > > AVStreamGroup.
> > 
> > Yes, pretty much. But it's more that codecpar is pressed into
> > service
> > where it probably doesn't belong. It might be more appropriate to
> > call
> > these "essence parameters". I'm going to stick my neck out further
> > and
> > say that picture and sound essence should be handled with different
> > structs, not smushed together into one struct like
> > AVCodecParameters.
> 
> Can you suggest how to approach this then, if not with 
> AVCodecParameters? For the resulting merged/stitched stream, we need
> at 
> the very least dimensions, and for audio we need channel layout, and
> for 
> each different kind of group type (HEIF, IAMF, TS programs) we need 
> specific parameters, like order of tiles for HEIF, mixing parameters
> for 
> IAMF, and these parameters should be in a form easy for the user
> (like 
> our CLI) to feed to lavfi, where the actual merging/stitching would
> take 
> place with new or existing filters for this specific purpose.
> 
> Maybe something like:
> 
> ----
> enum AVStreamGroupParamsType {
>      AV_STREAM_GROUP_PARAMS_NONE,
>      AV_STREAM_GROUP_PARAMS_TS,
>      AV_STREAM_GROUP_PARAMS_HEIF,
>      AV_STREAM_GROUP_PARAMS_IAMF,
> };
> 
> typedef struct AVStreamGroupTSParams {
>      // Basically AVProgram
> } AVStreamGroupTSParams;
> 
> typedef struct AVStreamGroupHEIFParams {
>      // dimensions, tile order, etc
> } AVStreamGroupHEIFParams;
> 
> typedef struct AVStreamGroupIAMFParams {
>      // channel layout, mixing params
> } AVStreamGroupIAMFParams;
> 
> typedef struct AVStreamGroup {
>      [...]
>      enum AVStreamGroupParamsType type;
>      union {
>          AVStreamGroupTSParams ts;
>          AVStreamGroupHEIFParams heif;
>          AVStreamGroupIAMFParams iamf;
>      } essence;
>      [...]
> } AVStreamGroup;

Sorry for not replying to this sooner.

Yes, a typed union like this should work nicely. This way we keep
things related to each type of stream group separate.

/Tomas

Anton Khirnov Oct. 2, 2023, 9:25 a.m. UTC | #8

Quoting Tomas Härdin (2023-09-28 13:27:53)
> Yes, a typed union like this should work nicely. This way we keep
> things related to each type of stream group separate.

I agree that this seems like a better solution than repurposing
AVCodecParameters, but the union members probably need to be pointers to
keep both the stream group and the type-specific structs extensible.

Anton Khirnov Oct. 2, 2023, 9:37 a.m. UTC | #9

Quoting James Almer (2023-09-06 16:38:32)
> Signed-off-by: James Almer <jamrial@gmail.com>
> ---
> This is an initial proof of concept for AVStream groups, something that's
> needed for quite a few existing and upcoming formats that lavf has no way to
> currently export. Said formats define a single video or audio stream composed
> by merging several individualy multiplexed streams within a media file.
> This is the case of HEIF, a format defining a tiled image where each tile is a
> separate image (either hevc, av1, etc) all of which need to be decoded
> individualy and then stitched together for presentation using container level
> information; IAMF, a new audio format where several individual streams (mono or
> stereo) need to be decoded individually and then combined to form audio streams
> with different channel layouts; and MPEG-TS programs, currently exported as
> AVProgram, which this new general purpose API would replace.
> There may be others too, like something ISOBMFF specific and not HEIF related,
> I'm told.
> 
> A new struct, AVStreamGroup, would cover all these cases by grouping the
> relevant streams and propagating format specific metadata for the purpose of
> combining them into what's expected for presentation (Something a filter for
> example would have to take care of).
> 
> Missing from this first version is something like a type field, which could be
> an enum listing the different currently known formats the user would then use
> to interpret the attached metadata, defined perhaps in codecpar.extradata
> 
> I'd like to hear opinions and suggestions to improve and properly handle this.
> 
>  libavformat/avformat.c |  26 ++++++++
>  libavformat/avformat.h | 143 ++++++++++++++++++++++++++++++++++++++++-
>  libavformat/dump.c     |  65 +++++++++++++++++--
>  libavformat/internal.h |  28 ++++++++
>  libavformat/options.c  |  77 ++++++++++++++++++++++
>  5 files changed, 332 insertions(+), 7 deletions(-)
> 

> diff --git a/libavformat/avformat.h b/libavformat/avformat.h
> index 1916aa2dc5..d18eafb933 100644
> --- a/libavformat/avformat.h
> +++ b/libavformat/avformat.h
> @@ -1007,6 +1007,59 @@ typedef struct AVStream {
>      int pts_wrap_bits;
>  } AVStream;
>  
> +typedef struct AVStreamGroup {
> +    /**
> +     * A class for @ref avoptions. Set on stream creation.
                                             ^^^^^^
                                             group

> +     */
> +    const AVClass *av_class;
> +
> +    /**
> +     * Group index in AVFormatContext.
> +     */
> +    int index;

unsigned?

> +
> +    /**
> +     * Format-specific group ID.
> +     * decoding: set by libavformat
> +     * encoding: set by the user, replaced by libavformat if left unset
> +     */
> +    int id;

might want to make this 64bit

> +
> +    /**
> +     * Codec parameters associated with this stream group. Allocated and freed
> +     * by libavformat in avformat_new_stream_group() and avformat_free_context()
> +     * respectively.
> +     *
> +     * - demuxing: filled by libavformat on stream group creation or in
> +     *             avformat_find_stream_info()
> +     * - muxing: filled by the caller before avformat_write_header()
> +     */
> +    AVCodecParameters *codecpar;
> +
> +    void *priv_data;

Do we really need this?

> +
> +    /**
> +     * Number of elements in AVStreamGroup.stream_index.
> +     *
> +     * Set by av_stream_group_add_stream() and av_stream_group_new_stream(), must not
> +     * be modified by any other code.
> +     */
> +    int nb_stream_indexes;
> +
> +    /**
> +     * A list of indexes of streams in the group. New entries are created with
> +     * av_stream_group_add_stream() and av_stream_group_new_stream().
> +     *
> +     * - demuxing: entries are created by libavformat in avformat_open_input().
> +     *             If AVFMTCTX_NOHEADER is set in ctx_flags, then new entries may also
> +     *             appear in av_read_frame().
> +     * - muxing: entries are created by the user before avformat_write_header().
> +     *
> +     * Freed by libavformat in avformat_free_context().
> +     */
> +    int *stream_index;

unsigned for both?

> @@ -1844,6 +1940,51 @@ const AVClass *av_stream_get_class(void);
>   */
>  AVStream *avformat_new_stream(AVFormatContext *s, const AVCodec *c);
>  
> +/**
> + * Add a new stream to a stream group.
> + *
> + * When demuxing, it may be called by the demuxer in read_header(). If the
> + * flag AVFMTCTX_NOHEADER is set in s.ctx_flags, then it may also
> + * be called in read_packet().
> + *
> + * When muxing, may be called by the user before avformat_write_header() after
> + * having allocated a new group with avformat_new_stream_group().
> + *
> + * User is required to call avformat_free_context() to clean up the allocation
> + * by av_stream_group_new_stream().
> + *
> + * This is functionally the same as avformat_new_stream() while also adding the
> + * newly allocated stream to the group belonging to the media file.
> + *
> + * @param stg stream group belonging to a media file.
> + *
> + * @return newly created stream or NULL on error.
> + * @see av_stream_group_add_stream, avformat_new_stream_group.
> + */
> +AVStream *av_stream_group_new_stream(AVStreamGroup *stg);

Is there a big enough advantage to having this as a separate function?

> +
> +/**
> + * Add an already allocated stream to a stream group.
> + *
> + * When demuxing, it may be called by the demuxer in read_header(). If the
> + * flag AVFMTCTX_NOHEADER is set in s.ctx_flags, then it may also
> + * be called in read_packet().
> + *
> + * When muxing, may be called by the user before avformat_write_header() after
> + * having allocated a new group with avformat_new_stream_group() and stream with
> + * avformat_new_stream().
> + *
> + * User is required to call avformat_free_context() to clean up the allocation
> + * by av_stream_group_add_stream().
> + *
> + * @param stg stream group belonging to a media file.
> + * @param st  stream in the media file to add to the group.
> + *
> + * @return 0 on success, or a negative AVERROR otherwise.
> + * @see avformat_new_stream, av_stream_group_new_stream, avformat_new_stream_group.
> + */
> +int av_stream_group_add_stream(AVStreamGroup *stg, const AVStream *st);

It'd be nice to have the streamgroup-related functions consistenly
namespaced.

E.g.
* avformat_stream_group_add()
* avformat_stream_group_add_stream()
* ff_stream_group_free()
etc.

alternatively for the first two:
* avformat_stream_group_create()
* avformat_stream_group_extend()

James Almer Oct. 2, 2023, 12:10 p.m. UTC | #10

On 10/2/2023 6:37 AM, Anton Khirnov wrote:
> Quoting James Almer (2023-09-06 16:38:32)
>> Signed-off-by: James Almer <jamrial@gmail.com>
>> ---
>> This is an initial proof of concept for AVStream groups, something that's
>> needed for quite a few existing and upcoming formats that lavf has no way to
>> currently export. Said formats define a single video or audio stream composed
>> by merging several individualy multiplexed streams within a media file.
>> This is the case of HEIF, a format defining a tiled image where each tile is a
>> separate image (either hevc, av1, etc) all of which need to be decoded
>> individualy and then stitched together for presentation using container level
>> information; IAMF, a new audio format where several individual streams (mono or
>> stereo) need to be decoded individually and then combined to form audio streams
>> with different channel layouts; and MPEG-TS programs, currently exported as
>> AVProgram, which this new general purpose API would replace.
>> There may be others too, like something ISOBMFF specific and not HEIF related,
>> I'm told.
>>
>> A new struct, AVStreamGroup, would cover all these cases by grouping the
>> relevant streams and propagating format specific metadata for the purpose of
>> combining them into what's expected for presentation (Something a filter for
>> example would have to take care of).
>>
>> Missing from this first version is something like a type field, which could be
>> an enum listing the different currently known formats the user would then use
>> to interpret the attached metadata, defined perhaps in codecpar.extradata
>>
>> I'd like to hear opinions and suggestions to improve and properly handle this.
>>
>>   libavformat/avformat.c |  26 ++++++++
>>   libavformat/avformat.h | 143 ++++++++++++++++++++++++++++++++++++++++-
>>   libavformat/dump.c     |  65 +++++++++++++++++--
>>   libavformat/internal.h |  28 ++++++++
>>   libavformat/options.c  |  77 ++++++++++++++++++++++
>>   5 files changed, 332 insertions(+), 7 deletions(-)
>>
> 
>> diff --git a/libavformat/avformat.h b/libavformat/avformat.h
>> index 1916aa2dc5..d18eafb933 100644
>> --- a/libavformat/avformat.h
>> +++ b/libavformat/avformat.h
>> @@ -1007,6 +1007,59 @@ typedef struct AVStream {
>>       int pts_wrap_bits;
>>   } AVStream;
>>   
>> +typedef struct AVStreamGroup {
>> +    /**
>> +     * A class for @ref avoptions. Set on stream creation.
>                                               ^^^^^^
>                                               group
> 
>> +     */
>> +    const AVClass *av_class;
>> +
>> +    /**
>> +     * Group index in AVFormatContext.
>> +     */
>> +    int index;
> 
> unsigned?

Made it int to have it consistent with AVStream, but ok.

> 
>> +
>> +    /**
>> +     * Format-specific group ID.
>> +     * decoding: set by libavformat
>> +     * encoding: set by the user, replaced by libavformat if left unset
>> +     */
>> +    int id;
> 
> might want to make this 64bit

Ok.

> 
>> +
>> +    /**
>> +     * Codec parameters associated with this stream group. Allocated and freed
>> +     * by libavformat in avformat_new_stream_group() and avformat_free_context()
>> +     * respectively.
>> +     *
>> +     * - demuxing: filled by libavformat on stream group creation or in
>> +     *             avformat_find_stream_info()
>> +     * - muxing: filled by the caller before avformat_write_header()
>> +     */
>> +    AVCodecParameters *codecpar;
>> +
>> +    void *priv_data;
> 
> Do we really need this?

It's a single pointer, and some demuxers may actually make use of it, 
like they do for the AVStream's counterpart. A git grep "st->priv_data" 
has a lot of hits.

> 
>> +
>> +    /**
>> +     * Number of elements in AVStreamGroup.stream_index.
>> +     *
>> +     * Set by av_stream_group_add_stream() and av_stream_group_new_stream(), must not
>> +     * be modified by any other code.
>> +     */
>> +    int nb_stream_indexes;
>> +
>> +    /**
>> +     * A list of indexes of streams in the group. New entries are created with
>> +     * av_stream_group_add_stream() and av_stream_group_new_stream().
>> +     *
>> +     * - demuxing: entries are created by libavformat in avformat_open_input().
>> +     *             If AVFMTCTX_NOHEADER is set in ctx_flags, then new entries may also
>> +     *             appear in av_read_frame().
>> +     * - muxing: entries are created by the user before avformat_write_header().
>> +     *
>> +     * Freed by libavformat in avformat_free_context().
>> +     */
>> +    int *stream_index;
> 
> unsigned for both?

Ok.

> 
>> @@ -1844,6 +1940,51 @@ const AVClass *av_stream_get_class(void);
>>    */
>>   AVStream *avformat_new_stream(AVFormatContext *s, const AVCodec *c);
>>   
>> +/**
>> + * Add a new stream to a stream group.
>> + *
>> + * When demuxing, it may be called by the demuxer in read_header(). If the
>> + * flag AVFMTCTX_NOHEADER is set in s.ctx_flags, then it may also
>> + * be called in read_packet().
>> + *
>> + * When muxing, may be called by the user before avformat_write_header() after
>> + * having allocated a new group with avformat_new_stream_group().
>> + *
>> + * User is required to call avformat_free_context() to clean up the allocation
>> + * by av_stream_group_new_stream().
>> + *
>> + * This is functionally the same as avformat_new_stream() while also adding the
>> + * newly allocated stream to the group belonging to the media file.
>> + *
>> + * @param stg stream group belonging to a media file.
>> + *
>> + * @return newly created stream or NULL on error.
>> + * @see av_stream_group_add_stream, avformat_new_stream_group.
>> + */
>> +AVStream *av_stream_group_new_stream(AVStreamGroup *stg);
> 
> Is there a big enough advantage to having this as a separate function?

I figured it would be nice to have it for the sake of convenience. For 
formats like HEIF, new streams should in theory be created once the 
group they will belong to is known.
I have no strong attachment to this function, so it can go if you think 
it's superfluous.

> 
>> +
>> +/**
>> + * Add an already allocated stream to a stream group.
>> + *
>> + * When demuxing, it may be called by the demuxer in read_header(). If the
>> + * flag AVFMTCTX_NOHEADER is set in s.ctx_flags, then it may also
>> + * be called in read_packet().
>> + *
>> + * When muxing, may be called by the user before avformat_write_header() after
>> + * having allocated a new group with avformat_new_stream_group() and stream with
>> + * avformat_new_stream().
>> + *
>> + * User is required to call avformat_free_context() to clean up the allocation
>> + * by av_stream_group_add_stream().
>> + *
>> + * @param stg stream group belonging to a media file.
>> + * @param st  stream in the media file to add to the group.
>> + *
>> + * @return 0 on success, or a negative AVERROR otherwise.
>> + * @see avformat_new_stream, av_stream_group_new_stream, avformat_new_stream_group.
>> + */
>> +int av_stream_group_add_stream(AVStreamGroup *stg, const AVStream *st);
> 
> It'd be nice to have the streamgroup-related functions consistenly
> namespaced.
> 
> E.g.
> * avformat_stream_group_add()
> * avformat_stream_group_add_stream()
> * ff_stream_group_free()
> etc.
> 
> alternatively for the first two:
> * avformat_stream_group_create()
> * avformat_stream_group_extend()

I named avformat_new_stream_group() essentially the same as 
avformat_new_stream(), then namespaced the functions that take a 
AVStreamGroup as input.
I don't particularly like _extend(), but i guess i could do something like

AVStreamGroup *avformat_stream_group_create(AVFormatContext *s)
int avformat_stream_group_add_stream(AVStreamGroup *stg,
                                      const AVStream *st);

James Almer Oct. 2, 2023, 7:48 p.m. UTC | #11

On 10/2/2023 6:25 AM, Anton Khirnov wrote:
> Quoting Tomas Härdin (2023-09-28 13:27:53)
>> Yes, a typed union like this should work nicely. This way we keep
>> things related to each type of stream group separate.
> 
> I agree that this seems like a better solution than repurposing
> AVCodecParameters, but the union members probably need to be pointers to
> keep both the stream group and the type-specific structs extensible.

Good idea, will do that and re-send.

Anton Khirnov Oct. 3, 2023, 3:43 p.m. UTC | #12

Quoting James Almer (2023-10-02 14:10:11)
> 
> I figured it would be nice to have it for the sake of convenience. For 
> formats like HEIF, new streams should in theory be created once the 
> group they will belong to is known.
> I have no strong attachment to this function, so it can go if you think 
> it's superfluous.

I expect the use cases for it to be limited and the advantage not that
big, so yeah, I'd prefer it dropped.

> > 
> >> +
> >> +/**
> >> + * Add an already allocated stream to a stream group.
> >> + *
> >> + * When demuxing, it may be called by the demuxer in read_header(). If the
> >> + * flag AVFMTCTX_NOHEADER is set in s.ctx_flags, then it may also
> >> + * be called in read_packet().
> >> + *
> >> + * When muxing, may be called by the user before avformat_write_header() after
> >> + * having allocated a new group with avformat_new_stream_group() and stream with
> >> + * avformat_new_stream().
> >> + *
> >> + * User is required to call avformat_free_context() to clean up the allocation
> >> + * by av_stream_group_add_stream().
> >> + *
> >> + * @param stg stream group belonging to a media file.
> >> + * @param st  stream in the media file to add to the group.
> >> + *
> >> + * @return 0 on success, or a negative AVERROR otherwise.
> >> + * @see avformat_new_stream, av_stream_group_new_stream, avformat_new_stream_group.
> >> + */
> >> +int av_stream_group_add_stream(AVStreamGroup *stg, const AVStream *st);
> > 
> > It'd be nice to have the streamgroup-related functions consistenly
> > namespaced.
> > 
> > E.g.
> > * avformat_stream_group_add()
> > * avformat_stream_group_add_stream()
> > * ff_stream_group_free()
> > etc.
> > 
> > alternatively for the first two:
> > * avformat_stream_group_create()
> > * avformat_stream_group_extend()
> 
> I named avformat_new_stream_group() essentially the same as 
> avformat_new_stream(), then namespaced the functions that take a 
> AVStreamGroup as input.
> I don't particularly like _extend(), but i guess i could do something like
> 
> AVStreamGroup *avformat_stream_group_create(AVFormatContext *s)
> int avformat_stream_group_add_stream(AVStreamGroup *stg,
>                                       const AVStream *st);

Fine with me.

[FFmpeg-devel,RFC] avformat: introduce AVStreamGroup

Commit Message

Comments

Patch