Using python inside kdb+; anyone would like to participate for code review, writing documentation and test cases?

kuentang · April 12, 2016, 8:41am

<o:p> </o:p>

Hi all,<o:p></o:p>

<o:p> </o:p>

My tool to execute python code inside kdb+ like q/rserver is ready. Unfortunately documentation and test cases are missing. If anyone would like to participate/volunteer please drop me an email and I can give you access to my github repository then.<o:p></o:p>

<o:p> </o:p>

And a talk is scheduled for June in London during python meetup to showcase this tool.<o:p></o:p>

<o:p> </o:p>

Thanks,<o:p></o:p>

<o:p> </o:p>

Kim<o:p></o:p>

<o:p> </o:p>

–
Kim Tang<o:p></o:p>

<o:p> </o:p>

Alexander_Belop · June 13, 2016, 8:40pm

On Tuesday, April 12, 2016 at 3:41:37 AM UTC-4, kuentang wrote:

My tool to execute python code inside kdb+ like q/rserver is ready.

Why are you reinventing the wheel? Doesn’t PyQ do what you need already?

For example,

q)p)print “Hello word!”

Hello word!

See pyq.enlnt.com for details.

kuentang · June 15, 2016, 9:08am

<o:p> </o:p>

Hi Sasha,<o:p></o:p>

<o:p> </o:p>

>>> Why are you reinventing the wheel? Doesn’t PyQ do what you need already?<o:p></o:p>

<o:p> </o:p>

I do miss that PyQ is able to execute python inside q. <o:p></o:p>

<o:p> </o:p>

cat p.k<o:p></o:p>

PYVER:“2.7”<o:p></o:p>

SOEXT:“.so\000”<o:p></o:p>

PSO:`p<o:p></o:p>

lib:“libpython”,PYVER,SOEXT<o:p></o:p>

PYTHON:“python.py”<o:p></o:p>

<o:p> </o:p>

`QVER setenv$_.Q.k<o:p></o:p>

\d .p<o:p></o:p>

@[.[PSO]2:(i;1);. `lib]<o:p></o:p>

e:{e0 x,“\000”;}<o:p></o:p>

\d .<o:p></o:p>

p)import sys, os<o:p></o:p>

p)from pyq import q<o:p></o:p>

p)sys.executable = str(q.PYTHON)<o:p></o:p>

p)script = str(q(“.z.f”))<o:p></o:p>

p)sys.argv = [script] + [str(a) for a in q(“.z.x”)]<o:p></o:p>

p)sys.modules[‘__main__’].__file__ = script<o:p></o:p>

p)sys.path.insert(0, os.path.dirname(script))<o:p></o:p>

p)del sys, os, script<o:p></o:p>

<o:p> </o:p>

It looks like that pyq is just using one function to execute python inside q and unfortunately I was missing get and set functions like in r/q-server to pass data seamlessly between python and q. As you might agree to be able to execute python inside q is one thing but you also need to be able to get the results back from your python calculation. <o:p></o:p>

<o:p> </o:p>

How is it possible to get results from a python calculation back in q using pyq?<o:p></o:p>

I can see that you can use sd0/sd1 from the _k module and does it help?<o:p></o:p>

<o:p> </o:p>

cat test_sd.q<o:p></o:p>

p)import os<o:p></o:p>

p)import sys<o:p></o:p>

p)from pyq import _k<o:p></o:p>

p)def f(d):<o:p></o:p>

x = os.read(d, 1)<o:p></o:p>

print(x.decode())<o:p></o:p>

sys.stdout.flush()<o:p></o:p>

_k.sd0(d)<o:p></o:p>

p)r,w = os.pipe()<o:p></o:p>

p)_k.sd1(r, None)<o:p></o:p>

p)os.write(w, b’X’)<o:p></o:p>

<o:p> </o:p>

Nevertheless the module that I have invented is able to execute the following q code:<o:p></o:p>

<o:p> </o:p>

cat plot_cv_predict.q <o:p></o:p>

n) “Plotting Cross-Validated Predictions”<o:p></o:p>

<o:p> </o:p>

/ loading required libraries from python<o:p></o:p>

n)from sklearn import datasets<o:p></o:p>

from sklearn.cross_validation import cross_val_predict<o:p></o:p>

from sklearn import linear_model<o:p></o:p>

import matplotlib.pyplot as plt<o:p></o:p>

<o:p> </o:p>

/ execute the function in python<o:p></o:p>

<o:p> </o:p>

n) lr = linear_model.LinearRegression()<o:p></o:p>

boston = datasets.load_boston()<o:p></o:p>

<o:p> </o:p>

/ get result back from python<o:p></o:p>

y:neval"boston.target"<o:p></o:p>

<o:p> </o:p>

/ # cross_val_predict returns an array of the same size as y where each entry<o:p></o:p>

/ # is a prediction obtained by cross validated:<o:p></o:p>

<o:p> </o:p>

/ pass y back to python using backtick<o:p></o:p>

n) predicted = cross_val_predict(lr, boston.data, `y, cv=10)<o:p></o:p>

<o:p> </o:p>

n) fig, ax = plt.subplots()<o:p></o:p>

n) ax.scatter(y, predicted)<o:p></o:p>

n) ax.plot([y.min(), y.max()], [y.min(), y.max()], ‘k–’, lw=4)<o:p></o:p>

n) ax.set_xlabel(‘Measured’)<o:p></o:p>

n) ax.set_ylabel(‘Predicted’)<o:p></o:p>

n) plt.show()<o:p></o:p>

<o:p> </o:p>

qpredicted:neval “predicted”<o:p></o:p>

<o:p> </o:p>

(::)t:(a:y;b:qpredicted)<o:p></o:p>

<o:p> </o:p>

n) fig = plt.figure()<o:p></o:p>

n) `t.plot(x=‘a’,y=‘b’)<o:p></o:p>

n) plt.show()<o:p></o:p>

<o:p> </o:p>

/ a table in q can be passed as panda in python<o:p></o:p>

neval"str(type(t))"<o:p></o:p>

/ “<class ‘pandas.core.frame.DataFrame’>”<o:p></o:p>

<o:p> </o:p>

/ pandas can be retrieved as table from python<o:p></o:p>

t~neval"t"<o:p></o:p>

/ 1b<o:p></o:p>

<o:p> </o:p>

Kim<o:p></o:p>

<o:p> </o:p>

Von: personal-kdbplus@googlegroups.com [mailto:personal-kdbplus@googlegroups.com] Im Auftrag von Sasha
Gesendet: Montag, 13. Juni 2016 21:41
An: Kdb+ Personal Developers
Betreff: [personal kdb+] Re: Using python inside kdb+; anyone would like to participate for code review, writing documentation and test cases?<o:p></o:p>

<o:p> </o:p>

Alexander_Belop · June 26, 2016, 7:58pm

On Wednesday, June 15, 2016 at 4:08:48 AM UTC-4, kuentang wrote:

How is it possible to get results from a python calculation back in q using pyq?

It is often not a good idea to call python from q. I consider Python to be a higher level language where you have classes, modules and other features useful for organizing the high level of your application, whereas q is best at the data access level. The only valid reason to call Python from q is in a top level callback. It is mostly because of this philosophy that I have never perfected passing Python results to q. Nevertheless, there are several ways to work around this limitation.

You can assign the results of python computation to a q global inside Python:

**q)**p)q.x = 20 + 22

**q)**x

42

If you define a Python function that takes instances of class K and returns an instance of class K, you can expose it to q by simply assigning it to a q global:

**q)**p)def f(x, y): return x + y

**q)**p)q.f = f

**q)**f(20;22)

42

(Note that the exposed function is monadic taking a list of arguments.) Such function can be used freely in q callbacks.

kuentang · June 26, 2016, 11:21pm

Thanks for sharing.<o:p></o:p>

<o:p> </o:p>

>>> I consider Python to be a higher level language where you have classes, modules and other features useful for organizing the high level of your application, whereas q is best at the data access level.<o:p></o:p>

<o:p> </o:p>

In my opinion q is good both for data access and implementing an application (see tick+ for example).<o:p></o:p>

<o:p> </o:p>

Kim<o:p></o:p>

<o:p> </o:p>

Von: personal-kdbplus@googlegroups.com [mailto:personal-kdbplus@googlegroups.com] Im Auftrag von Sasha
Gesendet: Sonntag, 26. Juni 2016 20:59
An: Kdb+ Personal Developers
Betreff: Re: [personal kdb+] Re: Using python inside kdb+; anyone would like to participate for code review, writing documentation and test cases?<o:p></o:p>

<o:p> </o:p>

Alexander_Belop · June 27, 2016, 2:22am

On Sunday, June 26, 2016 at 6:21:16 PM UTC-4, kuentang wrote:

>>> I consider Python to be a higher level language where you have classes, modules and other features useful for organizing the high level of your application, whereas q is best at the data access level.

In my opinion q is good both for data access and implementing an application (see tick+ for example).

I agree, but if your app is written in q, why do you need Python? :-)

Seriously, though the main idea in PyQ is to expose kdb+ data objects as first class Python objects.

Consider this example (in IPython):

In [5]: t = %q (n:til 10;r:10?1f)

In [6]: t

Out[6]:

n r

0 0.3927524

1 0.5170911

2 0.5159796

3 0.4066642

4 0.1780839

5 0.3017723

6 0.785033

7 0.5347096

8 0.7111716

9 0.411597

In [7]: np.log10(t.r, out=np.asarray(t.r))

Out[7]:

array([-0.40588117, -0.28643292, -0.28736743, -0.39076407, -0.74937544,

-0.52032069, -0.10511208, -0.27188201, -0.14802557, -0.38552782])

In [8]: t

Out[8]:

n r

0 -0.4058812

1 -0.2864329

2 -0.2873674

3 -0.3907641

4 -0.7493754

5 -0.5203207

6 -0.1051121

7 -0.271882

8 -0.1480256

9 -0.3855278

In [9]: t.reverse[-3:]

Out[9]:

n r

2 -0.2873674

1 -0.2864329

0 -0.4058812

Here, the table t is created in q, but is manipulated entirely in Python. PyQ code can be easily understood by a Python programmer with a minimal knowledge of kdb+ concepts.

Note that calling a numpy function on q data (see In[7] above) could be arranged from q as well

q)t:(n:til 10;r:10?1f)

q)t

n r

0 0.4931835

1 0.5785203

2 0.08388858

3 0.1959907

4 0.375638

5 0.6137452

6 0.5294808

7 0.6916099

8 0.2296615

9 0.6919531

q)p)import numpy as np

q)p)np.log10(q.t.r, out=np.asarray(q.t.r))

q)t

n r

0 -0.3069915

1 -0.2376814

2 -1.076297

3 -0.7077645

4 -0.4252305

5 -0.2120119

6 -0.2761498

7 -0.1601388

8 -0.6389117

9 -0.1599234

In this case, however there is no need to return a Python object to q because the result is written directly to a kdb+ table.

On the other hand, if we implement an automatic Python to q translation, something like “p)np.log10(q.t.r, out=np.asarray(q.t.r))”

would be wasteful because an extra copy of t.r will be created just to be returned to q and discarded there.

kuentang · June 27, 2016, 7:49am

<o:p> </o:p>

>>>I agree, but if your app is written in q, why do you need Python? :-)<o:p></o:p>

<o:p> </o:p>

Unfortunately there are not that many scientific libraries compare to python (scikit), r or matlab in q available. And you dont want to spend 1 2 weeks to implement say svm. In this case you just trigger a function in python.<o:p></o:p>

>>> On the other hand, if we implement an automatic Python to q translation, something like “p)np.log10(q.t.r, out=np.asarray(q.t.r))” would be wasteful because an extra copy of t.r will be created just to be returned to q and discarded there.<o:p></o:p>

Yes, it is wasteful and there is unfortunately no workaround about it. L I think this is the price we need to pay.<o:p></o:p>

<o:p> </o:p>

Kim<o:p></o:p>

<o:p> </o:p>

Von: personal-kdbplus@googlegroups.com [mailto:personal-kdbplus@googlegroups.com] Im Auftrag von Sasha
Gesendet: Montag, 27. Juni 2016 03:22
An: Kdb+ Personal Developers
Betreff: Re: [personal kdb+] Re: Using python inside kdb+; anyone would like to participate for code review, writing documentation and test cases?<o:p></o:p>

<o:p> </o:p>

Alexander_Belop · June 27, 2016, 10:04pm

On Monday, June 27, 2016 at 2:50:55 AM UTC-4, kuentang wrote:

>>> On the other hand, if we implement an automatic Python to q translation, something like “p)np.log10(q.t.r, out=np.asarray(q.t.r))” would be wasteful because an extra copy of t.r will be created just to be returned to q and discarded there.

Yes, it is wasteful and there is unfortunately no workaround about it.

Maybe a trailing ; can signify that return value should be suppressed.

carfield11 · June 30, 2016, 4:04pm

On Monday, June 27, 2016 at 6:21:16 AM UTC+8, kuentang wrote:

Thanks for sharing.

>>> I consider Python to be a higher level language where you have classes, modules and other features useful for organizing the high level of your application, whereas q is best at the data access level.

In my opinion q is good both for data access and implementing an application (see tick+ for example).

Kim

Yes, opinion is difference by difference people, and each language have their edge, q is simple and concise, but in the meantime it miss some features, say for quite a other language, if there is error, there is exception come with stacktrace. However, in q usually there is just an error word

kuentang · June 30, 2016, 8:43pm

<o:p> </o:p>

>>> Yes, opinion is difference by difference people, and each language have their edge, q is simple and concise,<o:p></o:p>

True true and opinion will change or advance in time. At the beginning I thought that kdb+ is like excel but in console. J But after say 3 months my opinion has changed. And after reading this article https://kx.com/_media-coverage/0809-AutoTrader-sm.pdf my opinion has changed again.<o:p></o:p>

<o:p> </o:p>

>>> but in the meantime it miss some features, say for quite a other language, if there is error, there is exception come with stacktrace. However, in q usually there is just an error word<o:p></o:p>

<o:p> </o:p>

Check Aaron Daviess presentation about dequeing bugs http://www.q-ist.com/2013/03/my-kdb-user-meeting-presentation.html . It will show how to debug an exception efficiently. <o:p></o:p>

<o:p> </o:p>

Funny enough is that I wanted something similar that is able to trace back an exception in runtime. And instead asking around I come up with my own library which is called bt, short for behaviour tag engine: https://github.com/kimtang/btlib-bt . The pattern is stolen from Chris Grangers blog: http://www.chris-granger.com/2013/01/24/the-ide-as-data/ <o:p></o:p>

<o:p> </o:p>

Btw I will be in hkg next month. If you have time for a coffee or tea just let me know.<o:p></o:p>

<o:p> </o:p>

Kim<o:p></o:p>

<o:p> </o:p>

Von: personal-kdbplus@googlegroups.com [mailto:personal-kdbplus@googlegroups.com] Im Auftrag von Carfield Yim
Gesendet: Donnerstag, 30. Juni 2016 17:05
An: Kdb+ Personal Developers
Betreff: Re: [personal kdb+] Re: Using python inside kdb+; anyone would like to participate for code review, writing documentation and test cases?<o:p></o:p>

<o:p> </o:p>

On Monday, June 27, 2016 at 6:21:16 AM UTC+8, kuentang wrote:<o:p></o:p>

Thanks for sharing.<o:p></o:p>

<o:p></o:p>

>>> I consider Python to be a higher level language where you have classes, modules and other features useful for organizing the high level of your application, whereas q is best at the data access level.<o:p></o:p>

<o:p></o:p>

In my opinion q is good both for data access and implementing an application (see tick+ for example).<o:p></o:p>

<o:p></o:p>

Kim<o:p></o:p>

<o:p></o:p>

Yes, opinion is difference by difference people, and each language have their edge, q is simple and concise, but in the meantime it miss some features, say for quite a other language, if there is error, there is exception come with stacktrace. However, in q usually there is just an error word<o:p></o:p>

–
Submitted via Google Groups

Jiahe_Xi · February 3, 2017, 2:49pm

Hi Kim,

I would like to particiate in the test. Can you drop me the link to jiahe.xi@gmail.com please? It’s exactly what i was looking for - i want to call the python’s statsmodel module in q for some stats functions.

best,

robert

Topic		Views
PyQ 4.0 (Python for kdb+) release announcement Community Support kdb-and-q	12	June 8, 2017
For anyone who's interested, I've made a toolkit for q/kdb+ that Community Support kdb-and-q	2	May 13, 2014
[personal kdb+] PyQ Community Support kdb-and-q	0	May 7, 2010
kdb+ and Python Community Support kdb-and-q	5	November 22, 2017
Accessing Python inside q Community Support kdb-and-q	10	March 11, 2011

Using python inside kdb+; anyone would like to participate for code review, writing documentation and test cases?

Related topics