there is a code which is tested worked in small data, looking for someone to optimize the code to process large data from database using any option i.e. multithreading, multiprocessing, splitting in chank or any other method you know.
What the code do?
there is two database (sqllite db)
it takes two table from first db the column IP and look on second db table if it can find the same IP there.
What are the hardware constraints? What are the software requirements (i.e., are virtualenvs required, or are system packages able to be installed? Linux or Windows?)?
I've got nearly 7 years Python development experience on both Windows and Linux. I can deliver and improved algorithm in a day.
I am a Phd Student in Big Data and deep learning, i will optimize your code by ussing big data technologies if you have a cluster or by multithreading, i am interested on this job, just count on me you wouldn't be disappointed.