MarkLogic Server Features Hex

Feature-Rich Database to Integrate Data Better and Faster

Power your most important applications with a true multi-model database. Create value from complex data—search and query, build new apps, and enrich analytics and machine learning—with MarkLogic Server.

Key Features

Ingest data into MarkLogic as is, without worrying about predefined schemas and complex ETL. MarkLogic’s flexible, multi-model approach lets you bring in data from anywhere—relational databases, mainframes, fileservers, Hadoop—or any other source. It’s that easy.

Included Features

of Included Features

Multi-Model Database

Integrate all your data and metadata as a single resource in a scalable, high-performance operational database. With native storage for documents, text, graph, relational, geospatial and binary formats, you can choose the right mix of data models for your use-case without sacrificing data consistency.

Handle the growing app and data complexity with a truly multi-modal database that will comfortably meet your shifting requirements.

The MarkLogic modern multi-model database integrates data of any type and from any source while ensuring data consistency in a time- and cost-effective manner.

Effectively Consolidate Heterogeneous Data

Integrate data as is from diverse formats, applications, architectures and workflows into a single, unified backend with native support for multiple data models to fit your unique application needs.

Manage and Retrieve Reliable Information

Flexibly store and access data for all the supported data models with industry-standard programming languages, while ensuring data consistency between the different models.


Journaling of updates before they are committed to ensure transactions can be replayed in the face of system failures.

Implement Changes When You Need

Add data sources and make schema changes to support new use cases without impacting existing applications or re-ingesting the source data. Independently scale different components within the architecture as your needs change.

Included Features

of Included Features

Built-In Search

Get faster access to ingested data, better query performance and real-time alerting with a universal index. With a search engine built into MarkLogic’s core, you spend less time and effort building and configuring standard queries.

Power data integration and data access with the MarkLogic built-in search engine. Immediately search and discover any new data loaded into MarkLogic, while keeping track of your data as you harmonize it.

Leverage MarkLogic’s powerful search capabilities to build Google-like search features into apps or to improve the performance of transactional and analytical apps that require complex queries to be run efficiently.

API-First Approach

Speed up application development and delivery with a comprehensive toolkit of modern interfaces and APIs to load and access data in MarkLogic.

Sophisticated Indexing

With its “Ask Anything” Universal Index, data is indexed upon load so you can immediately begin asking questions —and gaining insights— across all your data.

Exceptional Performance

Get lightning-fast, sub-second results with native search capabilities. MarkLogic’s zero-latency indexing enables real-time results through a transactional repository.

Included Features

of Included Features

Certified Security

Protect critical information against data breaches and comply with regulators with market-leading security. The MarkLogic data platform provides robust, enterprise-grade security controls that are proven in mission-critical environments.

MarkLogic is the most secure database, providing your organization with ultimate protection, secure data sharing, back-up and compliance features.

Element- and Role-Based Access Control

Control the information users can see and edit. Assign user roles and set document permissions, determining user privileges and what they can do. MarkLogic secures data at the collection level, document level and even element/property level.

Advanced Encryption

Cutting-edge data encryption protects against unauthorized access of the database by a SysAdmin or Storage Admin. It allows data, configuration and logs to be encrypted while the files are resting on disk using AES-256 encryption, conforming to FIPS 140 criteria.

Built-In Auditing

MarkLogic closely monitors database activity and makes it possible to audit document access and updates, configuration changes, administrative actions, code execution and changes to access control.

Included Features

of Included Features

ACID Transactions

Don’t settle for a BASE-ic database. Use ACID transactions to ensure you don’t run the risk of encountering data corruption, stale reads, and inconsistent data.

If you’re running an operational and transactional system, then you need a database with all of the ACID properties (atomicity, consistency, isolation, and durability). These properties ensure that your enterprise-grade system never encounters issues like data corruption, stale reads, and inconsistent data. When it comes to mission-critical applications, ACID properties are not a “nice-to-have,” they are an absolute requirement for transactional consistency.

ACID compliance is a standard in relational databases, but MarkLogic is unique among almost all NoSQL databases because we support transactions that are 100% ACID compliant, whereas others have relaxed or even non-existent standards. MarkLogic’s ACID properties also apply to multi-document, multi-statement, and XA transactions (transactions between clusters), providing the unique reliability to run large-scale, operational systems for mission-critical use cases.

Document Locks

Document locks to protect data during updates and keep transactions from conflicting with one another.


Timestamps on documents that ensure a query only sees copies of documents that are valid at the time the query is run (also known as Multi-Version Concurrency Control).


Journaling of updates before they are committed to ensure transactions can be replayed in the face of system failures.

Commit Process

A commit process to ensure changed data is changed all at once or not at all, even across multiple hosts.

Included Features

of Included Features

Advanced Security

Out-of-the-box enterprise-grade security and elemental-grade security for use cases that require even more advanced security features for data sharing and further separation of duties using an external KMS, redaction and compartment security.

Define security models that enforce the exact level of security control over your data, while empowering employees to access just the data they need.

MarkLogic offers fine-grained access controls, proper separation of duties and the ability to anonymize data so that your data is both secure and shareable.

External Key Management

Employ additional separation of concerns and ease of management for storing encryption keys. This option is helpful when you want to leverage an external Key Management System (KMS) that is already in use.


For when pieces of your data need to be removed or obscured when exporting data for sharing. This feature is beneficial in meeting compliance guidelines like HIPAA, SEC17a-4, FINRA and GDPR, to name just a few.

Compartment Security

Further restrict data access by requiring users to have more than one role to view data, not just one of the right roles. It’s often employed to protect classified material in government systems.

Included Features

of Included Features

Smart Mastering

Master data quickly and automatically in a MarkLogic data hub, without buying a separate MDM tool. It leverages fuzzy logic and AI to match and merge data to build your 360-degree view in the context of all your data.

MarkLogic’s Smart Mastering leverages the flexibility of our data modeling approach to quickly build your 360 view in the context of everything it knows. It looks at the multi-layered relationships across all the data and uses fuzzy logic and AI (relevance scoring, database intelligence, and probabilistic algorithms to be specific) to match related records and provide confidence scores. Based on those scores, related records are merged automatically.

Unmerge records when your requirements dictate it. MarkLogic’s Smart Mastering is done in the context of the Operational Data Hub, where you can easily access both the raw data and the harmonized, mastered versions and look back at all your changes as it keeps the lineage and provenance of your data.

Secure, Governed

Your data is safe and well governed in MarkLogic as Smart Mastering is designed to maintain the full provenance and lineage of data—as it is mastered. It is essential to your business when dealing with regulations like GDPR, but it’s also helpful in providing great customer service.

Fast, Lightweight

Data integration projects go 5x faster with MarkLogic. You can expect the same agility with Smart Mastering. And, you don’t have to buy legacy MDM product that’s prone to failure. Smart Mastering provides out-of-the box rule configurations, a set of APIs and a visual interface to get you started quickly.

Smart, Automatic

MarkLogic’s Smart Mastering takes the full context of your data in mind as it matches and merges, automatically. It uses a trust-based approach to address duplicate, incomplete and partial entities. It also allows un-merging when mistakes are made (and they always are with MDM).

Included Features

of Included Features


Get the combined benefits of a document store and an RDF Triple Store. Semantics is ideal for storing relationships, improving data integration, managing metadata and building applications with highly connected data.

MarkLogic combines the benefits of a document store and an RDF Triple Store. JSON and XML documents provide incredible flexibility for modeling entities, while RDF triples—the data format for semantic graph data—are ideal for storing relationships.

With MarkLogic Semantics, you can store metadata, improve data integration and build applications using that integrated, highly connected data. Leverage MarkLogic Semantics to build advanced search apps, recommendation engines, fraud detection, knowledge graphs, drug discovery and network security.

Store in RDF. Query with SPARQL.

Add contextual meaning and create relationships within your data entities, connecting hundreds of billions of facts to power advanced search and algorithms. RDF triples allow you to create real-time connections between data, such as organization structures and relationships between documents and data.

Ontology-Driven Entity Extraction

MarkLogic’s ontology-drive entity extraction automatically identifies entities (people, places, and things) in free-flowing text and then return a list of those entities (extraction) or mark them up in the document (enrichment) for improved search and classification.

Included Features

of Included Features

Apache NiFi

Use point-and-click data flows to load data into MarkLogic. NiFi’s process automation and drag-and-drop interface help you load data from relational databases, Hadoop and various other sources.

Load your data into MarkLogic in less than five minutes using Apache NiFi. MarkLogic’s native integration with Apache NiFi and support for its processors enables you to automate the flow of data from any source system and get it into your target system through a drag-and-drop, configurable user interface.

With Apache NiFi, you can also use out-of-the-box processors to create data flows from relational databases such as MySQL or Postgres, Apache Kafka data streams and other sources in the Hadoop ecosystem, and many other data sources. If a processor doesn’t exist, you can build your own or you can create templates for common data flow patterns.

Developed by the NSA

10 years of development, with a focus on data security, provenance and ease of use within teams.

Any Data Source

Ingest data into MarkLogic from relational, Hadoop, S3 and more. Over 50 data flow processors and it’s easily configurable and extensible.

Drag & Drop UI

Friendly drag and drop UI makes it easier than ever to design the data flows to load data to MarkLogic and monitor the process.

Included Features

of Included Features

Tiered Storage

Store and manage data in different tiers based on cost and performance trade-offs. Easily moving data between tiers without any ETL or expensive infrastructure changes allows you to easily balance performance and capacity through the lifecycle of your data.

The MarkLogic Tiered Storage add-on feature lets you store and manage data in different tiers based on cost and performance trade-offs—whether it’s flash storage, traditional local or shared disk storage, HDFS or Amazon cloud storage.

With Tiered Storage, your data is easily migrated between tiers without any ETL, additional software or expensive infrastructure changes. This allows you to balance performance and capacity through the lifecycle of your data—hitting performance SLAs, making data governance easier and meeting compliance requirements.

Optimize Data Availability

Move data across tiers without having to take the data offline, perform any ETL, or even re-index it, allowing you to move data without concerns, knowing that it’s always available when and where you need it.

Manage Changes Efficiently

Easily migrate and resize data partitions. For example, if you have 18 forests across 3 hosts, you can change it to 20 forests across 4 hosts and have Tiered Storage manage the change.

Create Policy-Based Partitions

Partition data to different storage tiers using a set policy in database administration. For example, a policy can be created to automatically archive data if it is older than a certain date.

Reduce the Cost of Storage

Storage costs can vary widely from around $1 to $25 per gigabyte. With our Tiered Storage, you can avoid over-provisioning expensive storage for data that can be easily stored on a cheaper tier.

Leverage the Economics of Amazon S3 or HDFS

Use Amazon S3 or HDFS as distributed file systems for cheaply storing large volumes of archival data, without losing the ability to bring that data back into an active, operational storage tier quickly, and without any ETL or re-indexing.

Included Features

of Included Features

XA Transactions

Guarantee ACID properties for distributed transactions across clusters. Keep your data in sync by guaranteeing ACID transaction properties are maintained for distributed transactions between MarkLogic clusters or with another database system.

MarkLogic’s XA transactions feature lets you execute one unified ACID transaction between MarkLogic and another database system.

In compliance with the “eXtended Architecture” standard by The Open Group, MarkLogic’s XA transactions guarantee that MarkLogic’s ACID properties are maintained for transactional updates or are fully rolled back to the prior state.

Preserve ACID Properties

XA Transactions allows multiple resources (such as databases, applications etc.) to be accessed within the same transaction, preserving the ACID properties across all resources.

Manage the Transactions Lifecycle

MarkLogic has an XA transaction resource manager as required by the specification (note that you still need a Global Transaction Manager).

The Only NoSQL Database with XA Transactions

If you need XA Transactions and you want to use a NoSQL database, then MarkLogic is the only option.

Included Features

of Included Features

Flexible Deployment

Build your app once and run it wherever you want. Starting on premises and want to migrate to the cloud later? No problem. Need to change cloud vendors? No worries because MarkLogic is cloud neutral and there’s no cloud lock-in.

Build your application once and run it anywhere—on-premises, virtualized or in the cloud. With MarkLogic, your business dictates where your data lives.

Most databases are purpose-built for each particular environment—you run different versions of that vendor’s software depending on the environment. Unlike other databases that require you to choose up front where to deploy, with MarkLogic you get one enterprise product that runs in the environment of your choice. If you start on-premises and migrate to the cloud later on, that is okay. MarkLogic has been successfully running in production cloud environments for over a decade.

Built for the Cloud. Flexible to Change

Proven to Be Cloud Neutral

No vendor lock-in to proprietary cloud technologies. MarkLogic runs on AWS, Azure and Google Cloud.

Proven in Traditional Environments

MarkLogic is proven in private and hybrid cloud, as well as on-premises.

Flexibility to Change

If you start on-premises and migrate to the cloud later, that is okay.

Supports Multi-Cloud Architecture

Independence to run your primary system with one cloud vendor, disaster recovery with another and run development on a third cloud vendor.

Independent Access Control

Simplify the process of switching system integrators, cloud vendors or subcontractors using advanced security features.

Included Features

of Included Features


Deploy faster and more reliably by packaging applications in containers. This ensures that applications will run uninterrupted as they’re moved from one environment to another while reducing costs.

MarkLogic supports containers so you can deploy with greater ease and speed. You can use MarkLogic with Docker or Kubernetes to pack, ship and run applications within containers, developers simply write an application, place it in a container and deploy.

Containers can run on top of virtual machines or bare metal infrastructure. And because MarkLogic is cloud-neutral and platform-agnostic, running MarkLogic in containers offers you increased flexibility: You can easily write an application once and deploy it anywhere in the cloud or on-premises.


Easily deploy applications built on MarkLogic while maintaining security at the application layer across all deployed apps.


Create and run MarkLogic on containers in seconds and spin up additional query capacity for varying needs in near real time.


Enable rapid software development and smooth CI/CD (continuous integration and continuous delivery) with consistency.

MarkLogic is available at Docker Hub, the world’s largest library and community for containers.

Included Features

of Included Features

Embedded Machine Learning

Ready-to-use machine learning provides deeper levels of accuracy of your data right inside MarkLogic so you can better curate and model it. Create unique predictive models and systems that get continually smarter.

MarkLogic’s Embedded Machine Learning is a full deep-learning toolkit that operates as a run-time library installed right at the core of MarkLogic, in the database kernel. It exposes its functions as built-ins from JavaScript and XQuery, which means these functions run close to the data and are completely integrated.

Embedded machine learning was designed for peak performance not only for CPUs but also for GPUs, and it scales to multi-machine-multi-GPU systems. Additionally, it is designed using a compression technique that dramatically reduces communication costs, reducing inter-node communications and enabling highly scalable parallel training across multiple machines.

Improve Database Operations

Run queries more efficiently and scale autonomously based on workload patterns. With autonomous elasticity, MarkLogic can use models of infrastructure workload patterns to automatically adjust the rules that govern data and index rebalancing.

Improve Data Curation

Reduce complexity and increase automation of various steps in the data curation process. Pared with the Smart Mastering, for example, machine learning will augment the rules-based mastering process so that records are mastered with more accuracy, and models continue to improve as more data is processed.

Improving Data Science Workflows

For data scientists, it’s now simpler to just do the work of training and executing models right inside MarkLogic, where we can handle almost every part of the architecture and process.

Included Features

of Included Features


Answer the “where” question in the context of all your other data. Natively store, manage and search geospatial data—including points of interest, intersecting paths and regions of interest.

“Where?”—the basis for military operations, land use analysis, asset and utilities mapping and real-time tracking.

The MarkLogic database provides a single platform to natively store, manage and search geospatial data—including points of interest, intersecting paths and regions of interest. That data is all stored with context as MarkLogic also handles other data about entities (people, places and things), relationships (semantics, or “linked data”), imagery and video (large binaries), and time (temporal data). MarkLogic provides powerful geospatial search capabilities—for the most demanding geospatial applications.

Ideal for Unstructured Data

Built-in search allows queries of free text that may be tied directly to geospatial data. For example, there may be notes attached to an entity displayed on a map that now become searchable because MarkLogic indexes that content.

Extreme Precision

MarkLogic’s double precision enables more precise, high-resolution search queries and operations (e.g., when a geological survey agency needs to monitor objects that move in sub-meter increments).

Integration with Leading Geospatial Vendors

Leading geospatial mapping vendors including Esri ArcGIS, OpenGeo Suite, Berico Rivium, OGC-compliant GIS tools, and SPARQL-compliant semantic visualization tools are proven to integrate with MarkLogic, providing robust visualization capabilities and real-time access for users to put multi-dimensional geospatial data to work within a web interface.

Enterprise-Grade Database

The MarkLogic NoSQL database provides the enterprise features organizations require, including granular data security, high availability and disaster recovery and transactional consistency.

Geospatial Alerting

An easy-to-use API that leverages MarkLogic’s indexes to deliver real-time geospatial alerts at scale.

Region Search APIs

APIs that allow users to draw polygons on a map that contain or intersect areas of interest.

Included Features

of Included Features

High Availability and Disaster Recovery

Enterprise-class HA/DR so that you can have confidence your data is always available and scheduled downtime is minimized, reducing risk and avoiding interruptions.

The MarkLogic enterprise-grade high availability and disaster recovery (HA/DR) so that you can have confidence your data is always available and scheduled downtime is minimized, reducing risk and avoiding interruptions.

With MarkLogic, your data is always available on a hardened platform with uncompromising data resiliency. With 100% ACID-compliant transactions, you are guaranteed consistency after a failover or recovery event.

Database Replication for Disaster Recovery

Backup selected components or the entire database—using SSL out-of-the-box. Incremental backups, coupled with journal archiving, enable you to restore the database to a point-in-time that minimizes the recovery point objective (RPO) and uses less storage.

Geared for Disconnected, Intermittent and Latent Networks

Getting data when and where you need it can be challenging—and even risky or impossible. Global organizations often run into latency issues trying to replicate data from centralized data centers to remote areas. And, for military operations in particular, replicating data to the field is often problematic because of cybersecurity concerns and technical roadblocks created by adversaries armed with signal jamming equipment. MarkLogic’s Flexible Replication is the perfect solution for sharing data across disconnected, intermittent, and latent networks.

Automated Failover for High Availability

High availability requires continuity within a cluster, protecting against component failure with computers, networking equipment and power supplies.

Marklogic clusters are protected against failures by providing fault tolerance in both evaluator nodes (e-nodes) and data nodes (d-nodes). If either an e-node or d-node fails, other nodes automatically pick up the workload so that the data stored in forests is always available. High availability works with either local disk failover on DAS or SSD or shared disk storage, such as SAN, Amazon S3 or HDFS.

Included Features

of Included Features

Scalability and Elasticity

MarkLogic can scale horizontally in clusters on commodity hardware to hundreds of nodes, petabytes of data, and billions of documents—and still processes tens of thousands of transactions per second.

MarkLogic is a massively scalable enterprise-grade database that scales horizontally in clusters on commodity hardware to hundreds of nodes, petabytes of data and billions of documents—and still processes tens of thousands of transactions per second.

When demand dissipates, MarkLogic can scale back down without having to worry about complex sharding. With these features, organizations can handle incredible volumes of data and run large scale web applications—all without breaking the bank.


From three nodes to hundreds of nodes, or 10,000 documents to 1 billion documents—MarkLogic clusters scale horizontally as your data or access demand grows and shrinks.


Add or remove nodes in minutes and take advantage of automatic cluster rebalancing, helping you keep the database in line with performance needs without over-provisioning.

Run on Commodity Hardware

MarkLogic doesn’t need “big iron.” You can run MarkLogic on cost-effective commodity hardware in any environment—in the cloud, virtualized, on-premises, or a combination.

Shared Nothing Architecture

MarkLogic uses a shared-nothing architecture with no master-slave relationships, which means there is no risk of data loss if a node fails. If one node fails, another node automatically picks up the workload.

No Performance Degradation

MarkLogic was designed from the start to run large enterprise applications and does not reach a limit where there are large performance cliffs while scaling.

Fewer Nodes and Licenses

MarkLogic datasets and indexes do not have to fit in-memory, which means you can scale without the expense of dozens of boxes and licenses.

Included Features

of Included Features

Data Analytics and Business Intelligence

Derive insights using traditional SQL, enterprise search and semantics capabilities—all in one unified operational and analytical system. Modernize data analytics and business intelligence with MarkLogic.

Write more powerful queries, build better applications, and uncover new insights. Whether you are building custom analytics or using standard BI tools, MarkLogic accelerates your path to modernize data analytics and business intelligence.

MarkLogic’s patented multi-model querying capability makes it possible to use multiple lenses for analyzing multi-structured data. With different lenses, it is easy to make data fit-for-purpose for different downstream uses. No need to augment a traditional RDBMS warehouse with a newer data lake environment. Rather than making your tech stack more complex to govern and secure, MarkLogic simplifies your overall architecture.

Intelligent Data Discovery with Semantic Search

With Semantics, you can describe relationships and enrich data with domain knowledge to build rich knowledge graphs. MarkLogic’s built-in search auto indexes the data and the relationships so that you can combine full-text search with graph query for powerful semantic search.

SQL Analytics for Operational Reporting

You can create relational views on top of your multi-model data for SQL analytics without compromising flexibility and security. The underlying data format never changes, and MarkLogic’s Template Driven Extraction enables you to create a relational lens over your data so you can query it using standard SQL.

Real-Time Alerting for Real-Time Decisions

Using its sophisticated indexing, MarkLogic includes real-time alerting for timely responses to changes in data and instant decision making. With real-time alerts and advanced semantic and geospatial search capabilities you can utilize MarkLogic for faster communication and decision making.

Unified Interface for Multi-Model Querying

MarkLogic provides its Optic API as a unified query interface for multi-model data access. It allows you to search across documents, semantic graphs and relational views with a single, composable query. You can even use the Optic API to do relational operations like joins and aggregates over documents.

Connect to BI Applications or Build Your Own

Build bespoke analytics applications or funnel quality data into industry-leading tools like Tableau and Power BI. MarkLogic enables sophisticated search, map and graph visualizations as well as seamless integration with business intelligence (BI) tools in three common ways: ODBC, REST and file-based.

Complete List of Features

  • ACID Transactions
  • Bitemporal
  • Built-in Search
  • Certified Security
  • Compartment Security *
  • Compliance Archive
  • Configuration Management
  • Customizable Failover
  • Data Rollback
  • Encryption at Rest
  • Entity Services
  • External KMS *
  • Flexible Deployment
  • Flexible Replication *
  • Geospatial
  • HA/DR
  • Hadoop Integration
  • Java API
  • LDAP and Kerberos Support
  • MarkLogic Content Pump
  • Monitoring & Management
  • Node.js API
  • Optic API
  • Point-in-time Recovery
  • Real-Time Alerting
  • Redaction and Element Level Security
  • Rolling Upgrades
  • Scalability & Elasticity
  • Semantics *
  • Server-side JavaScript
  • Smart Mastering
  • SQL Support
  • Telemetry
  • Tiered Storage *
  • XA Transactions *

* Paid Options

MarkLogic In 4 Easy Steps

MarkLogic Prefooter Banner

Flexible Deployment & Licencing

Deploy MarkLogic on-premises or in the cloud. Try the free, full-featured version for development use.