Several question of KDB+ and Q

terryyao · December 4, 2021, 12:00am

https://learninghub.kx.com/forums/topic/several-question-of-kdb-and-q

hi all:

how to sql and use the result? for example

select f1 from table1
and than use f1 as variable,like sql procedure select f1 into var from table1 and use var after

how to caculate cointegration of two column
how to drop tables
q)tables
aggregate_quoteaggregate_quote1`trades
drop aggregate_quote aggregate_quote1 trades
is there q lib like talib for finance data

thanks

terryyao · January 13, 2022, 12:00am

thanks Laura :

for cointegration

this is python example code?

def coint(
    y0,
    y1,
    trend="c",
    method="aeg",
    maxlag=None,
    autolag="aic",
    return_results=None,
    """
    Test for no-cointegration of a univariate equation.
The null hypothesis is no cointegration. Variables in y0 and y1 are
assumed to be integrated of order 1, I(1).

This uses the augmented Engle-Granger two-step cointegration test.
Constant or trend is included in 1st stage regression, i.e. in
cointegrating equation.

**Warning:** The autolag default has changed compared to statsmodels 0.8.
In 0.8 autolag was always None, no the keyword is used and defaults to
"aic". Use `autolag=None` to avoid the lag search.

Parameters
----------
y0 : array_like
    The first element in cointegrated system. Must be 1-d.
y1 : array_like
    The remaining elements in cointegrated system.
trend : str {"c", "ct"}
    The trend term included in regression for cointegrating equation.

    * "c" : constant.
    * "ct" : constant and linear trend.
    * also available quadratic trend "ctt", and no constant "nc".

method : {"aeg"}
    Only "aeg" (augmented Engle-Granger) is available.
maxlag : None or int
    Argument for `adfuller`, largest or given number of lags.
autolag : str
    Argument for `adfuller`, lag selection criterion.

    * If None, then maxlag lags are used without lag search.
    * If "AIC" (default) or "BIC", then the number of lags is chosen
      to minimize the corresponding information criterion.
    * "t-stat" based choice of maxlag.  Starts with maxlag and drops a
      lag until the t-statistic on the last lag length is significant
      using a 5%-sized test.
return_results : bool
    For future compatibility, currently only tuple available.
    If True, then a results instance is returned. Otherwise, a tuple
    with the test outcome is returned. Set `return_results=False` to
    avoid future changes in return.

Returns
-------
coint_t : float
    The t-statistic of unit-root test on residuals.
pvalue : float
    MacKinnon"s approximate, asymptotic p-value based on MacKinnon (1994).
crit_value : dict
    Critical values for the test statistic at the 1 %, 5 %, and 10 %
    levels based on regression curve. This depends on the number of
    observations.

Notes
-----
The Null hypothesis is that there is no cointegration, the alternative
hypothesis is that there is cointegrating relationship. If the pvalue is
small, below a critical size, then we can reject the hypothesis that there
is no cointegrating relationship.

P-values and critical values are obtained through regression surface
approximation from MacKinnon 1994 and 2010.

If the two series are almost perfectly collinear, then computing the
test is numerically unstable. However, the two series will be cointegrated
under the maintained assumption that they are integrated. In this case
the t-statistic will be set to -inf and the pvalue to zero.

TODO: We could handle gaps in data by dropping rows with nans in the
Auxiliary regressions. Not implemented yet, currently assumes no nans
and no gaps in time series.

References
----------
.. [1] MacKinnon, J.G. 1994  "Approximate Asymptotic Distribution Functions
   for Unit-Root and Cointegration Tests." Journal of Business &amp; Economics
   Statistics, 12.2, 167-76.
.. [2] MacKinnon, J.G. 2010.  "Critical Values for Cointegration Tests."
   Queen"s University, Dept of Economics Working Papers 1227.
   <a href="//ideas.repec.org/p/qed/wpaper/1227.html&quot;" target="&quot;_blank&quot;" rel="&quot;noopener&quot; noopener">http://ideas.repec.org/p/qed/wpaper/1227.html</a>
"""
y0 = array_like(y0, "y0")
y1 = array_like(y1, "y1", ndim=2)
trend = string_like(trend, "trend", options=("c", "nc", "ct", "ctt"))
method = string_like(method, "method", options=("aeg",))
maxlag = int_like(maxlag, "maxlag", optional=True)
autolag = string_like(
    autolag, "autolag", optional=True, options=("aic", "bic", "t-stat")
)
return_results = bool_like(return_results, "return_results", optional=True)</pre>

nobs, k_vars = y1.shape

k_vars += 1 # add 1 for y0
if trend == "nc":

xx = y1

else:

xx = add_trend(y1, trend=trend, prepend=False)
res_co = OLS(y0, xx).fit()
if res_co.rsquared < 1 - 100 * SQRTEPS:

res_adf = adfuller(

res_co.resid, maxlag=maxlag, autolag=autolag, regression="nc"

)

else:

warnings.warn(

"y0 and y1 are (almost) perfectly colinear."

"Cointegration test is not reliable in this case.",

CollinearityWarning,

)
Edge case where series are too similar
res_adf = (-np.inf,)
no constant or trend, see egranger in Stata and MacKinnon
if trend == "nc":

crit = [np.nan] * 3 # 2010 critical values not available

else:

crit = mackinnoncrit(N=k_vars, regression=trend, nobs=nobs - 1)
nobs - 1, the -1 is to match egranger in Stata, I do not know why.
TODO: check nobs or df = nobs - k
pval_asy = mackinnonp(res_adf[0], regression=trend, N=k_vars)

return res_adf[0], pval_asy, crit

lkerr2 · January 13, 2022, 12:00am

Hi ,

I see you’ve asked a new thread about cointegration so we can continue our conversation over there: Johansen cointegration test kdb+ implement - KX Learning Hub

Thanks,

Laura

lkerr2 · December 6, 2021, 12:00am

Thanks for your questions!

If f1 in your example is a column, you can simply assign this in your select statement e.g.

f1var: select f1 from table1

then f1var will be available to you to use as a local variable

I’ll need a little more information to answer this question fully for you. By cointegration do you mean combining two columns into one? If so, this will depend the datatypes of the columns.

You can see an example of combining two columns into a single column here: https://code.kx.com/q/basics/qsql/#computed-columns

The drop command in SQL is similar to the delete keyword in q. You can delete a table from memory by running something like

tables aggregate_quoteaggregate_quote1trades delete aggregate_quote1 from . tables aggregate_quotetrades

Take care when using this. The delete keyword will only work for removing variables from a namespace and will not delete from a partitioned database, for example. More info here https://code.kx.com/q/ref/delete/

I’m not familiar with ta-lib myself, but from a quick google I can see that it is a python library for technical analysis of financial data. We don’t have libraries as such for this kind of analysis - financial analysis is really q’s bread and butter, with or without use of one of our KX products. But we also have the option to integrate with python in a few different ways: https://code.kx.com/pykdb/comparisons.html

Hope this helps!

Laura

Topic		Replies	Views
Several question of KDB+ and Q Community Support kdb-and-q	4	31	January 13, 2022
Johansen cointegration test kdb+ implement Community Support imported , kdb-and-q	2	25	January 13, 2022
Johansen cointegration test kdb+ implement Community Support kdb-and-q	2	19	January 13, 2022
SQL Subset? Community Support kdb-and-q	17	16	May 16, 2008
Using python inside kdb+; anyone would like to participate for code review, writing documentation and test cases? Community Support kdb-and-q	10	17	February 3, 2017

Several question of KDB+ and Q

Edge case where series are too similar

no constant or trend, see egranger in Stata and MacKinnon

nobs - 1, the -1 is to match egranger in Stata, I do not know why.

TODO: check nobs or df = nobs - k

Related topics