Develop - test run for deployment by KutnerBroadie · Pull Request #249 · CancerDataAggregator/cda-service

KutnerBroadie · 2024-03-15T19:09:07Z

Testing a deployment

…ion table

…system CD-644 fix unique terms duplicate results

* update pr_push with sherlock * change name * update current action to notify sherlock and include push to develop, test and main

ah action to deploy dev

* update sql and tests to use alias and integer_id_alias

* fix tests after modifying the rest of mapping tables to use integer aliases * add schema with data source tables

* separate column defintions by entity vs mapping table * change conditional for mapping table and filter out alias ids * add lots of checks for associated project to look like it is on entity tables

* fix system param in unique terms call * fix test

* Implements parsing based includeCount query optimizer. Currently there is a bug that doesn't allow filters on the same table as the endpoint. * Initial Commit Implements parsing based includeCount query optimizer. Currently there is a bug that doesn't allow filters on the same table as the endpoint. * Fixed Entity Filter Bug Added the ability to handle filters involving the endpoint entity table. * Update QueryService Broke out parenthesis substring code into a function. Cleaned up a few things and added distinction between includeCountQuery and countEndpointQuery. * Moved Filter class to own file. * Added new test file. * Updated queryapi controller test. * Wrapped Filter class in try-catch blocks. * Fixed misspelling. * Adding Count Endpoint Code Broke out Filter into it's own file class. Adding code to enable count endpoint optimization. * Successful Merge of Branches Merged two branches to include new count endpoint optimization code with exception handling + test code * Consolidated initialization constructor Worked out method to add a single initialization function to Filter class. * Added new fn and accompanying unit test to trim extraneous parens from filter queries that are compound. * Changed hard-coded use of and, or and where to use swagger generated values. * Disabled half-written test so whole file can be run. * Provided optimized version of parenthesisSubstring to build string all at once. * Finished Optimized Count Endpoint Queries Added full functionality for producing optimized count endpoint query code * Column Name fix Fixed misspelling of integer_id_alias * Resolved Query Issue Resolved construction of preselect query and added json casting * Updated mutation count column names see description * Fixup for parenthesisSubString in cases where WHERE clause doesn't have parens. * Doing runtime check of entity getprimarykeys to ensure we aren't doing an out-of-bounds access on an empty list. * Ensuring that our entity table has a PK for filter usage. * Allowing coalesce statement to start without a paren. * Added sample coalesce statement. * isRoot Argument Removed and Added Count Tests Created ChildFilter class solely to eliminated need for isRoot argument. Added several tests for each entity/count endpoint. * Added Optimized Query to Results Added the code required to pass back the optimized query for count endpoint queries * Refined Filter Class Rewrote filter initialization code to eliminate need for isRoot and id arguments while maintaining the inability to produce a root filter with isRoot variable set to false. * Filtering Streamed Nulls Filtering out nulls in streams for totalFields to count and groupedCountFields * remove logic to handle subquery * Moved paren string processing to a FilterUtils class, updated tests to work with it. * Fixed Count Optimization for Simple Filters Fixed template to properly handle constructing optimized query when only a simple filter is applied * Refactored Count Optimization to Utilize Wildcard Refactored code to use count(*) where appropriate. * Relationship field as backup to Primary Key Adding ability to join on relationship fields if table has no primary key * use count(*) in count queries * split somatic_mutation columns into internal and external fields * Refactored Count Endpoint Query Creation Added method to add distinct counts when not querying somatic_mutations endpoint otherwise use count(*) * Cleanup Cleaning up commented code * Somatic Mutation Work Arounds And Common Alias For Mapping Added several work arounds to handle somatic_mutation table count queries. Also added utilizing the commonAlias in place of mappingEntityKeys which were generated from join paths. * Adding TODOs Added TODOs for MVP * Updated Schema and Added File Count Coverage Updated the schema to reflect database inclusion of the "subject_alias" column in the somatic mutations table. Also added coverage for getting a file count on count endpoint queries when the path between the entity table to the files table is equal to 1. * Assumption for Common Alias and Cleaned Up cda_subject_alias References Added code to assume the commonAlias variable. Also cleaned up cda_subject_alias references to now use the "subject_alias" column. * Added Checks to Mapping and Join Paths for Common Alias Presence Added checks when building Join paths or just a simple mapping table to ensure the commonAlias exists in those tables * Schema Modification and More somatic_mutation Handling Removed foreign keys for cda_subject_* columns as they were sometimes getting chosen over "subject_alias". Also added handling on join statements involving somatic mutation. * Replaced mappingEntityKey with commonAlias See subject * Updated groupedFieldsToCount for Mutations Updated groupedFieldsToCount to utilize better columns. * Fixed SQL Syntax Error Added parenthesis around UNIONINTESECT for count endpoint creation which lacked coverage for simple filters that had a mapping table --------- Co-authored-by: tanner-coon-bh <tanner.coon@bluehalo.com> Co-authored-by: Andrea Haessly <ahaessly@broadinstitute.org>

fixing two minor slightly fragile syntax issues in build.gradle

* Fixed integer_id_alias Assumption Fixed issues that where affecting filters on *_data_source tables * Added Fixes for *_data_source Tables Added edge case handling for filters from *_data_source tables

Added paged query preselect optimization for files table. Note: currently hard coded for subjects table due to temporary issues with files table

Updated paged query optimizer to utilize file table instead of subject. Also added the ability to pass back the optimized query with the results.

* Added Optimization For File Paged Query Preselects Added paged query preselect optimization for files table. Note: currently hard coded for subjects table due to temporary issues with files table * Updated To Utilize Files table Updated paged query optimizer to utilize file table instead of subject. Also added the ability to pass back the optimized query with the results. * Fixed *_associated_project Table Filters & Simplified file Table Joins Treated *_associated_project tables like *_data_source tables as there is no mapping table between them and their respective entity tables. Also, added code to update the join statement and file preselect filter to only use the mapping table for paged queries * Updated File Join Optimization for All Entity Tables see description

see description

Fixed column counted to be the common alias, not the filter table key.

Fixed bug with query generation for simple queries that have a mapping table with the common alias

bug fix

* Removed bulk-data endpoint * Disabled boolean-query endpoint * Update to set no caching in header response from all of our endpoints. Adjusted to accommodate pen testing results. * Missed the status check

* Removed bulk-data endpoint * Disabled boolean-query endpoint

Fixed bug not properly utilizing AND operations with a lone file filter in the rightFilter

* Fixed Incorrect Paged Queries Fixed incorrect paged queries by utilizing preselect building from the Filter class. * Fixed Join Regex Added proper whitespace check to regex split for getting joins * Fixed Regex Made sure to add '+' for whitespace to regex on join split

Rebuilt logic for dropping unnecessary joins. I now build a join path to get all required tables to join on any tables found in the select clause. Then I only remove any join statements that don't include any of these tables.

* Don't deploy to dev in PRs * release branch is develop

* Updated For New Mutation Table Updated Schemas to reflect changes to data model. Updated MutationSqlGenerator* files to reflect changes to mutation table. Removed work-around code for quirks with somatic_mutation from Filter code. * Updated Mutation Count Endpoint Added "one_consequence" column to the summarized mutation count endpoint. * Updated Mutation Default Order By Changed mutation default order by from case_barcode to integer_id_alias * Updated expected query for CountSqlGeneratorTest Updated expected test result to match the new optimized count query * Bypassing CountSqlGeneratorTest Auto passing the test because it currently isn't written to test against the optimized query

Simple fix to resolve performance issues caused by ordering by non-primarykey column

* BigInt hotfix Fixed issue with bigint column filters processed as text which lead to PostgreSQL errors. * Update tests.yml Updating jacaco due to deprecated issues

* Fixed bug with readable query Found bug in readable query code that doesn't replace the parameters correctly when there are more than 10 present. * Updating SnakeYAML Updating SnakeYAML to non-vulnerable version 2.0. * Upgrading github action upload-artifact

ahaessly and others added 30 commits November 17, 2023 16:20

fix unique_terms querys when filtering on system

70b4c1c

update to deploy rdbms server

b70248d

add a comment to indicate how we handle fields from the somatic_mutat…

ab7769b

…ion table

update starting/testing section

f2559b5

Merge pull request #226 from CancerDataAggregator/ah-cd-644-dups-for-…

e0dee17

…system CD-644 fix unique terms duplicate results

add new workflow (#227)

acf0e71

* update pr_push with sherlock * change name * update current action to notify sherlock and include push to develop, test and main

remove hikari property

f6aa8c1

add HikariCP

dc10794

remove redundant workflow

165813e

delete old workflow

d781470

add push to dev

510e62a

remove sonarqube

f6cde13

fix spacing

dcdf9e8

change field names when specifying fields in the mapping tables

a0efc7d

don't return id alias fields

5732458

Merge pull request #231 from CancerDataAggregator/ah_add_sherlock_dev

6ba747b

ah action to deploy dev

CD-675 AH fix integer alias (#232)

f04d22c

* update sql and tests to use alias and integer_id_alias

upgrade versions to match vulnerability fixes in main (#234)

c79560e

CD-772 finish migrating to integer aliases (#235)

c145698

* fix tests after modifying the rest of mapping tables to use integer aliases * add schema with data source tables

add boolean types for parameters (#236)

76ff268

Ah CD-778 internal cols exposed (#237)

5d0f90f

* separate column defintions by entity vs mapping table * change conditional for mapping table and filter out alias ids * add lots of checks for associated project to look like it is on entity tables

treat somatic_mutation as both entity and mapping table (#238)

84d3447

AH fix unique terms system param (#239)

dd4296a

* fix system param in unique terms call * fix test

fix relationships with data_source tables

2332523

Update build.gradle

b766693

fixing two minor slightly fragile syntax issues in build.gradle

Mvp bug fix (#245)

ca278f5

* Fixed integer_id_alias Assumption Fixed issues that where affecting filters on *_data_source tables * Added Fixes for *_data_source Tables Added edge case handling for filters from *_data_source tables

Added Optimization For File Paged Query Preselects

367b867

Added paged query preselect optimization for files table. Note: currently hard coded for subjects table due to temporary issues with files table

Updated To Utilize Files table

2165d39

Updated paged query optimizer to utilize file table instead of subject. Also added the ability to pass back the optimized query with the results.

upgrade logback version to 1.2.13 (#241)

46a44bc

ahaessly and others added 28 commits March 12, 2024 13:07

test connection to database for status check (#246)

b3ad039

Updated entitycountsqlgenerator tests to match optimized queries.

d5ba894

Merge branch 'file_preselect_optimization' into develop

bc9e355

Disabled incorrect unit tests.

381cc5a

Disabled unit tests that need to be updated in the future.

8de6da3

Fixing FileSqlGeneratorTest

c8876be

Catching Non-Entity*SQLGenerator types

25248e2

see description

Fixed Parenthesis Around Filters with File Preselect

b0bb647

Added case where file columns included in select clause

9cec0bc

Bug Fix for simple queries with mapping table

322a7ff

Fixed column counted to be the common alias, not the filter table key.

Fixed another bug

2085653

Fixed bug with query generation for simple queries that have a mapping table with the common alias

set qa env for stable data

e5dffe9

Merge branch 'test' into develop

485b42a

Innocuous change to trigger deploy

cda132d

Fixed bugs with simple queries

a5beafa

bug fix

Remove caching from response header (#251)

7223f18

* Removed bulk-data endpoint * Disabled boolean-query endpoint * Update to set no caching in header response from all of our endpoints. Adjusted to accommodate pen testing results. * Missed the status check

Removed bulk-data and boolean-query endpoint (#250)

a3053b6

* Removed bulk-data endpoint * Disabled boolean-query endpoint

Updated the hardcoded dataset-description information (#252)

a29357e

Added Endpoint to Test Java Memory Settings (#253)

c3c2f29

Fixed bug (#254)

eca564b

Fixed bug not properly utilizing AND operations with a lone file filter in the rightFilter

Fixed Join Keep in Paged Query (#256)

545cd35

Rebuilt logic for dropping unnecessary joins. I now build a join path to get all required tables to join on any tables found in the select clause. Then I only remove any join statements that don't include any of these tables.

Don't deploy to dev in PRs (#258)

8e3eb62

* Don't deploy to dev in PRs * release branch is develop

Fixed Mutation Default Order By (#260)

fb26efc

Simple fix to resolve performance issues caused by ordering by non-primarykey column

BigInt hotfix (#261)

28f7ff3

* BigInt hotfix Fixed issue with bigint column filters processed as text which lead to PostgreSQL errors. * Update tests.yml Updating jacaco due to deprecated issues

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Develop - test run for deployment#249

Develop - test run for deployment#249
KutnerBroadie wants to merge 58 commits into
testfrom
develop

KutnerBroadie commented Mar 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

KutnerBroadie commented Mar 15, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants