Webinář ZDARMA [18 1. 2024]

TIBCO Statistica Data Scientist

A powerful package for processing big data projects using ETL, AI and machine learning tools.
The possibility of independent deployment of created processes and models into third-party applications.

View Full Version Data Scientist use Statistica primarily for analyzing large (big data) sets, creating advanced predictive models, and for other complex data projects. The package includes a range of machine learning, AI and ETL tools.

The Data Scientist version will be used primarily by data scientists and analysts for predicting and modelling the behaviour of variables under different conditions and importing the created models and processes into third-party applications.

The software is available in desktop, network and server form.

Import of data

Data Scientist is fully compatible with xlsx (including xls), csv files and with fixed-width data (e.g. in text files). It will allow you to:

  • retrieve data from SQL, NoSQL and other databases,
  • via integrated PI connector retrieve data from OSIsoft PI system (a popular solution for operational data management),
  • import Spotfire SBDF data files,
  • integrate two or more data sets into one graphical environment and a series of outputs.

Data preparation

Data Scientist offers automated data cleansing of duplicate, inconsistent and outlying values (or their recoding) using the so-called Data Health Check (DHC) function.

For advanced data transformation, the tool Rules Builderwhich allows you to process data from different sources according to complex rules (even using conditional expressions).

For easier processing, bring your data closer to a normal layout by using the built-in Box-Cox transformation.

Data evaluation

In Data Scientist, you can evaluate measured data (including big data files), including. with the help of:

  • classical methods descriptive, parametric and non-parametric statistics,
  • exploratory analysis and visualization,
  • multivariate statistical methods for data organization and classification,
  • advanced linear and non-linear models,
  • estimation of many variance components and accuracy in the data sets (Variance Estimation and Precision).

Predictive modelling

Use data mining, text mining and neural network tools to create models of the behaviour of the observed variables in different situations.

Models can be generate in C, C++, C#, Java, PMML, SAS and SQL and can be further modified as required.

Data Scientist offers e.g. also the function of decision trees and random forests and the possibility optimization of predictors.

Other features

Statistica in this version also offers the possibility to program custom scripts in R, Python or C#. The Data Scientist package can also be used for e.g. pro:

  • understanding the key parameters affecting critical quality attributes (process analysis, quality control and multivariate statistical process control functions),
  • design of experiments and their virtual execution (design of experiments function – Design of Experiments, test power analysis – Power Analysis and interval estimation – Interval Estimation),
  • deployment of developed processes and models into third-party applications (with autonomous functionality independent of TIBCO Statistica).

Visualisation and outputs

You can see the distribution of the obtained data and the results in the Data Scientist version, among others. through histogram, line, box, point, scatter and quantile plots and other frequently used 2D and 3D imaging methods.

The results obtained can export e.g. in the form of:

  • simple and advanced reports,

  • entry into different types of databases,

  • MS Word (docx), MS Excel (xlsx) and text files (csv) or pdf.

Other products


The essential package for data analysts and technology and research professionals to quickly analyze data. Desktop is the basic version of Statistica for fast data

More »


Extension of the basic package for easy transformation of data from different systems (ETL), quality management within SPC methodologies and for the creation of business

More »


Extensions to previous packages for quick and easy analysis of large (big data) sets and the creation of predictive models using machine learning, AI and

More »


View Full Version Comprehensive version of Statistica can be used primarily to analyze large (big data) datasets, create advanced predictive models and real-time production monitoring.

More »

Need advice?

Send us your query using the form below. We will get back to you within the next working day to see how we can help.

Special products are available for universities as part of the University Programme.

Contact form

*please fill in these fields