RE: [personal kdb+] unique index

david_demner · January 2, 2016, 3:03am

User-Agent: Workspace Webmail 5.16.0Message-Id: <20160101200302.85f80dae80d1d2f2e266ec6278e6cbe8.4a08ca37d3.wbe@email07.europe.secureserver.net>From: “David Demner (AquaQ)” <david.demner>To: personal-kdbplus@googlegroups.comSubject: RE: [personal kdb+] unique indexDate: Fri, 01 Jan 2016 20:03:02 -0700Mime-Version: 1.0

Selecting that many records basically returns the whole table so you don’t get a gain from g# but you do pay the overhead of having it. Using only 10 cust, g# is faster:

q)n:1300000 q)a:([]cust:n?`8; v1:n?100; v2:n?100; v3:n?100) q)b:10#distinct a`cust q)\ts select from a where cust in b 19 18874704 q)update `g#cust from `a `a q)\ts select from a where cust in b 0 1008

vs using 100000 cust as below is way slower:

q)a:([]cust:n?`8; v1:n?100; v2:n?100; v3:n?100) q)b:100000#distinct a`cust q)\ts select from a where cust in b 23 18874704 q)update `g#cust from `a `a q)\ts select from a where cust in b 160 4719024

`u# won't help because it's not the table that you're applying the index to but the list; and it wouldn't help anyway for 100k cust (same problem as above)

Sam11 · January 2, 2016, 5:21am

So I guess the query searches table ‘a’ for each value in ‘b’, so indexing on ‘b’ doesn’t help.

The following query runs much faster but unfortunately only finds the first occurrence, whereas I need all occurrences. I wonder if there is way to tweak this without a big hit on performance

\t a (a`cust)?b
0

Topic		Views
unique index Community Support kdb-and-q	0	January 2, 2016
RE: [personal kdb+] kdb+tick with schemaless events Community Support imported , kdb-and-q	2	April 29, 2015
kdb+ intro question Community Support kdb-and-q	2	June 28, 2014
slow performance of win32 version of KDB Community Support kdb-and-q	7	September 15, 2015
Question about the performance difference of two queries on HDB Community Support kdb-and-q	7	January 28, 2016

RE: [personal kdb+] unique index

Related topics