Data Scientist and Big Data Engineer @ The project involves developing big data analytics to detect cyber security attacks at a nation level (midspace) in real time as well as in batches. This involves heavy use of Spark (Java) to analyse the incoming data and create alerts. The alerts are then analysed using IPython Notebooks.
I additionally work on developing the platform, which involves several Big Data technologies.
★ Data Science Technologies
* Spark, Spark Streaming, Java 8, Accumulo, Kafka
* Python, IPython Notebook, Pandas, NumPy SciPy, Matplotlib
★ Big Data Technologies
* HDFS
* Yarn
* Hue
* Cloudera Manager
* Oozie From June 2015 to Present (7 months) Guildford, United KingdomSoftware Developer @ Worked on a software project building a product that supports law enforcement agencies and the secret services with identifying and intercepting terrorists and other criminals.
The product is written in Java and is deployed on powerful GNU/Linux virtual machines. I have been writing performance optimised, multithreaded Java code as well as assisting in the deployment of the virtual machines (ESXi & vSphere).
★Technologies
* Java
* Maven
* Spring
* Multithreading
* GNU/Linux (Centos - CLI only)
* Perforce
* Git From January 2015 to June 2015 (6 months) Guildford, United KingdomPh.D. Student - Computational Mass Spectrometry @ The Ph.D. has allowed investigation and experience into many software tools and technologies, through the search for finding the best solutions to scientific problems. Below outlined are two key projects of the thesis, which were presented to Nobel Prize winner Venki Ramakrishnan in 2014.
* Amphitrite - Ion mobility mass spectrometry (IM-MS) software package (github.com/gnsiva/amphitrite)
Developed an API to interact and process IM-MS data, including novel methods for data representation and a deconvolution algorithm. Additionally a set of GUI applications were built for ease of use for non-programmers and are currently in prototype stage. The work is the basis for a £750,000 grant application between my Ph.D. supervisor and the University of Oxford.
* Challenger - GPU accelerated genetic algorithm (GA)
Developed a set of tools for processing extracting information across multiple IM-MS data files simultaneously, this included GUI and command line interface components.
The actual GA was computationally expensive, and the use case required the algorithm to complete within 30 minutes. This was not possible using pure Python and C implementations, and so Cuda C was used.
* Analysis of B-cell epitope discontinuity
Queried a large database of protein structures (pdb.org) and downloaded the data using urllib2. Analysing the data showed that a particular biotechnology product which costs thousands of pounds would not work. From September 2010 to September 2014 (4 years 1 month) London, United KingdomMSc Student in Bioinformatics @ Developed new software to analyse isothermal titration calorimetry (ITC) data. The data is frequently very noisy and often shows baseline drift. The aim of the project was to improve the accuracy, precision and reproducibility of the data obtained from ITC instruments.
Several filtering methods were used in analysing these data including the Hamming window filter for smoothing, inverse fast fourier transforms to investigate the periodicity of the noise and curve fitting for correcting baseline drift. The work was carried out using Python and the PyLab package. From September 2009 to September 2010 (1 year 1 month) London, United KingdomTechnical Correspondent @ As a technical correspondent, was responsible for dealing directly with clients and answering technical questions. While there, developed an MS Excel macro which saved 10 minutes per day for each of the 4 technical correspondents and this macro was used by the company for at least 4 years. From September 2008 to September 2009 (1 year 1 month) Oxford, United Kingdom
Doctor of Philosophy (Ph.D.), Computational data analysis advances in ion mobility mass spectrometry @ University College London, U. of London From 2010 to 2014 Master's Degree, Bioinformatics, Distinction @ Birkbeck, University of London From 2009 to 2010 Bachelor's Degree, Biological Sciences, 2:1 @ University of Brighton From 2005 to 2008 Ganesh Sivalingam is skilled in: Python, Data Analysis, Software Development, Java, Data Science, Big Data, GNU/Linux, C, Git, MySQL, CUDA, R, Statistics, Scientific Writing, Bioinformatics
Websites:
http://mscalculator.com