Learn By Examples - Introduction to Data and Text Mining using DSTK3

Front Cover
SVBook - 107 pages
0 Reviews

 DSTK - Data Science Toolkit 3 is a set of data and text mining softwares, following the CRISP DM model. DSTK offers data understanding using statistical and text analysis, data preparation using normalization and text processing, modeling and evaluation for machine learning and statistical learning algorithms. 

DSTK 3 contains DSTK Engine as interpreter, DSTK ScriptWriter as a simple IDE, DSTK Studio providing a SPSS Statistics like easy to use interface with DSTK Engine, and DSTK Text Explorer provides a GUI for Text Mining. DSTK Studio and DSTK Text Explorer, however, need a small payment of 59 usd to support our development. DSTK Engine and DSTK ScriptWriter are free.

This book is going to be an easy guide to get started using DSTK 3 softwares for data and text mining.


Introduction

Getting Started

DSTK ScriptWriter Essentials

DSTK Studio Essentials

DSTK Text Explorer Essentials

Conclusion


This book has been taught at Udemy and EMHAcademy.com.

Use the following Link to get the Udemy Course for FREE:

https://www.udemy.com/introduction-to-data-and-text-mining-using-dstk-3/learn/v4/

 

What people are saying - Write a review

We haven't found any reviews in the usual places.

Selected pages

Common terms and phrases

About the author

 Eric Goh is a data scientist, software engineer, adjunct faculty and entrepreneur with years of experiences in multiple industries. His varied career includes data science, data and text mining, natural language processing, machine learning, intelligent system development, and engineering product design. He founded SVBook and extended it with DSTK.Tech (http://dstk.tech) and EMHAcademy.com. DSTK.Tech is where Eric develops his own DSTK data science softwares. Eric also publishes 5 books at LeanPub and SVBook, and teaches the content at Udemy and EMHAcademy.com. During his free time, Eric is also an adjunct faculty at University of the People.

Eric Goh has been leading his teams for various industrial projects, including the advanced product code classification system project which automates Singapore Custom’s trade facilitation process, and Nanyang Technological University's data science projects where he develop his own DSTK data science software. He has years of experience in C#, Java, C/C++, SPSS Statistics and Modeller, SAS Enterprise Miner, R, Python, Excel, Excel VBA and etc. He won Tan Kah Kee Young Inventors' Merit Award and Shortlisted Entry for TelR Data Mining Challenge.

He holds a Masters of Technology degree from the National University of Singapore, an Executive MBA degree from U21Global (currently GlobalNxt) and IGNOU, a Graduate Diploma in Mechatronics from A*STAR SIMTech (a national research institute located in Nanyang Technological University), and Coursera Specialization Certificate in Business Statistics and Analysis from Rice University. He possessed a Bachelor of Science degree in Computing from the University of Portsmouth after National Service. He is also a AIIM Certified Business Process Management Master (BPMM), GSTF certified Big Data Science Analyst (CBDSA), and IES Certified Lecturer.

Specialties: Data Science, Text Mining, Social Network Analysis, Natural Language Processing, Machine Learning, Software Engineering, Mechatronics, Business. 

Bibliographic information