pg_query

by lfittl

lfittl /pg_query

Ruby extension to parse, deparse and normalize SQL queries using the PostgreSQL query parser

470 Stars 44 Forks Last release: Not found BSD 3-Clause "New" or "Revised" License 415 Commits 56 Releases

Available items

No Items, yet!

The developer of this repository has not created any items for sale yet. Need a bug fixed? Help with integration? A different license? Create a request here:

pgquery [ ](https://rubygems.org/gems/pgquery)

This Ruby extension uses the actual PostgreSQL server source to parse SQL queries and return the internal PostgreSQL parsetree.

In addition the extension allows you to normalize queries (replacing constant values with ?) and parse these normalized queries into a parsetree again.

When you build this extension, it builds parts of the PostgreSQL server source (see libpg_query), and then statically links it into this extension.

This is slightly crazy, but is the only reliable way of parsing all valid PostgreSQL queries.

You can find further examples and a longer rationale here: https://pganalyze.com/blog/parse-postgresql-queries-in-ruby.html

Installation

gem install pg_query

Due to compiling parts of PostgreSQL, installation might take a while on slower systems. Expect up to 5 minutes.

Usage

Parsing a query

PgQuery.parse("SELECT 1")

=> #<0x007fe92b27ea18> {"targetList"=> [{"ResTarget"=> {"val"=>{"A_Const"=>{"val"=>{"Integer"=>{"ival"=>1}}, "location"=>7}}, "location"=>7}}], "op"=>0, }}], @query="SELECT 1", @warnings=[]> </0x007fe92b27ea18>

Modifying a parsed query and turning it into SQL again

parsed_query = PgQuery.parse("SELECT * FROM users")

=> #<0x007ff3e956c8b0> {"targetList"=> [{"ResTarget"=> {"val"=> {"ColumnRef"=> {"fields"=>[{"A_Star"=>{}}], "location"=>7}}, "location"=>7} }], "fromClause"=> [{"RangeVar"=> {"relname"=>"users", "inhOpt"=>2, "relpersistence"=>"p", "location"=>14}}], }}], @query="SELECT * FROM users", @warnings=[]>

Modify the parse tree in some way

parsed_query.tree[0]['SelectStmt']['fromClause'][0]['RangeVar']['relname'] = 'other_users'

Turn it into SQL again

parsed_query.deparse => "SELECT * FROM "other_users"" </0x007ff3e956c8b0>

Note: The deparsing feature is experimental and does not support outputting all SQL yet.

Parsing a normalized query

# Normalizing a query (like pg_stat_statements in Postgres 10+)
PgQuery.normalize("SELECT 1 FROM x WHERE y = 'foo'")

=> "SELECT $1 FROM x WHERE y = $2"

Parsing a normalized query (pre-Postgres 10 style)

PgQuery.parse("SELECT ? FROM x WHERE y = ?")

=> #<0x007fb99455a438> {"targetList"=> [{"ResTarget"=> {"val"=>{"ParamRef"=>{"location"=>7}}, "location"=>7}}], "fromClause"=> [{"RangeVar"=> {"relname"=>"x", "inhOpt"=>2, "relpersistence"=>"p", "location"=>14}}], "whereClause"=> {"A_Expr"=> {"kind"=>0, "name"=>[{"String"=>{"str"=>"="}}], "lexpr"=>{"ColumnRef"=>{"fields"=>[{"String"=>{"str"=>"y"}}], "location"=>22}}, "rexpr"=>{"ParamRef"=>{"location"=>26}}, "location"=>24}}, }}], @query="SELECT ? FROM x WHERE y = ?", @warnings=[]> </0x007fb99455a438>

Extracting tables from a query

PgQuery.parse("SELECT ? FROM x JOIN y USING (id) WHERE z = ?").tables

=> ["x", "y"]

Extracting columns from a query

PgQuery.parse("SELECT ? FROM x WHERE x.y = ? AND z = ?").filter_columns

=> [["x", "y"], [nil, "z"]]

Fingerprinting a query

PgQuery.parse("SELECT 1").fingerprint

=> "8e1acac181c6d28f4a923392cf1c4eda49ee4cd2"

PgQuery.parse("SELECT 2; --- comment").fingerprint

=> "8e1acac181c6d28f4a923392cf1c4eda49ee4cd2"

Faster fingerprint method that is implemented inside the native library

PgQuery.fingerprint("SELECT ?")

=> "8e1acac181c6d28f4a923392cf1c4eda49ee4cd2"

Differences from Upstream PostgreSQL

This gem is based on libpg_query, which uses the latest stable PostgreSQL version, but with a patch applied to support parsing normalized queries containing

?
replacement characters.

Supported Ruby Versions

Currently tested and officially supported Ruby versions:

  • MRI 2.1
  • MRI 2.2
  • MRI 2.3
  • MRI 2.4

Resources

See libpg_query for pgquery in other languages, as well as products/tools built on pgquery.

Original Author

Special Thanks to

License

Copyright (c) 2015, pganalyze Team [email protected]
pg_query is licensed under the 3-clause BSD license, see LICENSE file for details.

Query normalization code:
Copyright (c) 2008-2015, PostgreSQL Global Development Group

We use cookies. If you continue to browse the site, you agree to the use of cookies. For more information on our use of cookies please see our Privacy Policy.