DataWarrior Features

General

  • Interactive data visualization and analysis
  • Built-in chemical intelligence
  • Realtime data filtering on alphanumerical and chemical criteria
  • Prediction of molecular properties from the chemical structure
  • Dedicated cheminformatics modules support drug discovery
  • Installation contains user manual and many example files
  • Runs on Linux, Macintosh (with retina support) and Windows
  • Computationally demanding algorithms use all processor cores

Files

  • Reads and writes its own native text-based file formats
  • Imports TAB delimited txt, csv, sdf (version 2 & 3), interprets SMILES codes
  • Imports from clipboard content
  • Exports TAB delimited txt, sdf (version 2 & 3)
  • Flexible file merge and append options

Views

  • Table view with columns containing alphanumerical or chemical information
  • Versatile graphical 2D-view for scatter plots, bar & pie charts, box plots, ...
  • Graphical freely rotatable 3D-view for scatter plots & bar charts
  • Dedicated chemical structure view with optional alphanumerical data
  • Form based view with form designer and form based data editing
  • Multiple views are shown side by side or are stacked on top of each other
  • Views can be highly customized to reveal multiple dimensions of the data

Filter Types

  • Text filters with support for regular expressions
  • Data range sliders for numerical and date columns
  • Category filters with individually selectable categories
  • Category browser to manually or automatically switch categories
  • Substructure filter with flexiple query features and real-time filtering
  • Filtering on various shades of compound similarity
  • Special filters screen against lists of compounds or substructures

Data Analysis

  • Data pivoting and reverse pivoting
  • Calculation of new column from custom expression
  • Principal Component Analysis
  • Self Organizing Maps
  • Calculation and display of statistical parameters
  • Creation and manipulation of persistent row lists for many purposes

Cheminformatics

  • Fast substructure & compound similarity filtering (see descriptors)
  • Calculation of physico-chemical properties like MW, logP, logS, tPSA
  • Calculation of druglikeness, flexibility, complexity, atom/ring counts, etc.
  • Detection of toxicity risk factor for four toxicity categories
  • Enumeration of combinatorial libraries retaining scaffold atom coordinates
  • Creation of evolutionary libraries with flexible fitness criteria
  • Principal component analysis and self organizing maps on chemical descriptors
  • 2-dimensional scaling algorithm using chemical and pharmacophore similarities
  • Automatic and semi-automatic creation of structure-activity-relationship tables
  • Scaffold analysis (ring systems or Murcko scaffolds)
  • Search & Replace functionality on chemical structure columns
  • Comparison of two structure files to reveal overlap of similar structures
  • 2D atom coordinate generation with unified scaffold orientation
  • Activity cliff analysis
  • Diverse subset selection and compound clustering
  • Similarity analysis comparing two large compound sets
  • Consistently uses MDL's concept of Enhanced Stereo Recognition
  • Generation of conformers with MMFF94 energy minimization
  • Conformation explorer with raytracer for photo-realistic molecule images

Descriptors

  • FragFp: fragment dictionary based binary fingerprint (analog MDL keys)
  • PathFp: linear atom strands normalized, hashed, binary (analog Daylight)
  • SphereFp: canonical circular fragments, hashed, binary
  • SkelSpheres: canonical circular fragments & skeletons, stereo perception, hashed, counts
  • OrgFunctions: synthetically accessible organic functionality in similarity tree
  • Flexophore: pharmacophore similarity considering diverse conformers and PDB statistics

Databases

  • All chemical structures in Wikipedia can be downloaded and searched locally.
  • Fast and simple structure and target search in ChEMBL database with result retrieval.

Macros

  • (Almost) any sequence of tasks can be recorded as macro.
  • Macros can be created or edited interactively without scripting.
  • Macros allow to share or repeat complex tasks on updated or different data.