Feat: Introduce query layer for launch-event search across NDJSON, gzip, and CLP inputs.

**Background:**

Right now, kernel and launch-event querying lives under the info subcommand of the TritonParse CLI, implemented in [tritonparse/info](https://github.com/meta-pytorch/tritonparse/tree/ef7726147721fd51f26998137ce479b63ffa78ac/tritonparse/info).
- [cli.py](https://github.com/meta-pytorch/tritonparse/blob/ef7726147721fd51f26998137ce479b63ffa78ac/tritonparse/info/cli.py) accepts a kernel name and a comma-separated equality query via [`--args-list`](https://github.com/meta-pytorch/tritonparse/blob/ef7726147721fd51f26998137ce479b63ffa78ac/tritonparse/info/cli.py#L36). It also contains the parsing and execution logic for that query through [`_parse_args_list`](https://github.com/meta-pytorch/tritonparse/blob/ef7726147721fd51f26998137ce479b63ffa78ac/tritonparse/info/cli.py#L194) and [`_launch_matches_filter`](https://github.com/meta-pytorch/tritonparse/blob/ef7726147721fd51f26998137ce479b63ffa78ac/tritonparse/info/cli.py#L296).
- [kernel_query.py](https://github.com/meta-pytorch/tritonparse/blob/ef7726147721fd51f26998137ce479b63ffa78ac/tritonparse/info/kernel_query.py) provides launch-event lookup helpers, supporting simple filtering by kernel name and launch index.

In the current search flow, launch events are first narrowed by kernel name, then iterated over to check whether each event matches the conditions from the `--args-list` query. For each matching launch, the output reports its per-kernel launch ID, its event index within the full trace, and its recorded launch grid:
```
  ...
  id=1054  line  1058  grid=[1]
  id=1055  line  1059  grid=[1]
  id=1056  line  1060  grid=[1]
  id=1057  line  1061  grid=[1]
  ...
```

**Areas for potential improvements:**
- Gzip archives must first be decompressed before they can be searched.
- The current query format only supports equality conditions joined by `AND`, so it is less expressive than a SQL-style or KQL-style query.
- Test coverage is uneven: the `info` subcommand is covered by [test_info_cli.py](https://github.com/meta-pytorch/tritonparse/blob/ef7726147721fd51f26998137ce479b63ffa78ac/tests/cpu/test_info_cli.py) and [test_kernel_query.py](https://github.com/meta-pytorch/tritonparse/blob/ef7726147721fd51f26998137ce479b63ffa78ac/tests/cpu/test_kernel_query.py), but there is currently no test coverage for `--args-list` filtering.
- The current search output is not schema-aware: it supports filtering launches, but does not project or display event fields from the matched launch records, especially those referenced in the query.

**Proposed next steps (PR scope):** 
- Introduce a dedicated Python query class, such as [`KqlQuery`](https://github.com/y-scope/clp/blob/7f90b54987b280007ef5ada444b33698344bbce2/python-wheels/yscope-clp-core/yscope_clp_core/_config.py#L37) or `LaunchQuery`, that can be shared between TritonParse and clp-ffi.
- Add a temporary adapter that translates `--args-list` into this query object. CLP archives could then be searched through the query object, while NDJSON and gzip inputs can continue using the existing filtering path.
- Alternatively, the CLI could also accept KQL-style query input for CLP archives. This should be mutually exclusive with `--args-list`.
- Tests should be added to ensure both paths produce identical search results when the query is strictly translatable from `--args-list`.

**Longer-term considerations:**
Both query parsing and execution could be consolidated into a shared query layer rather than being split across `cli.py` and `kernel_query.py`.
Eventually, the advantages of using CLP for storage would be:
- Unlike gzip, archive search would not require decompression.
- Query parsing and execution are already abstracted by the CLP engine.
- More expressive queries are supported beyond simple equality, including timestamp ranges, logical operators, pattern matching, and projection.
So TritonParse could focus on defining the launch-event schema and higher-level query semantics.

Finally, one important question is whether there are any plans to support querying directly within the TritonParse UI?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Introduce query layer for launch-event search across NDJSON, gzip, and CLP inputs. #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Feat: Introduce query layer for launch-event search across NDJSON, gzip, and CLP inputs. #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions