AI & MACHINE LEARNING
BESPOKE DATA VISUALISATIONS
CUSTOM SOFTWARE DEVELOPMENT
CLOUD & OPERATIONS
DATA & ANALYTICS
EMBEDDED & ENGINEERING
IOT & CLOUD
Metadata is basically information that describes other data. In scientific research, it plays a crucial role by helping researchers understand the context, source, and structure of their datasets — collectively referred to as data provenance. Researchers need to know where data comes from, how they generated it, and under what conditions they collected it to ensure its quality, reproducibility, and trustworthiness.
However, despite its importance, researchers often store metadata across different formats like Excel sheets, XML files, RDF schemas, and web forms. This lack of standardization makes it hard to work with and even harder to trust. When information is inconsistent or poorly labeled, finding what you need can turn into a real struggle.
Rafał Klimek, a front-end developer at Holisticon Connect, knows this challenge well. With over ten years of experience building user-friendly interfaces, he focuses on creating tools that help researchers make sense of metadata chaos. His goal is simple: to bring clarity and order to complex information, making it accessible and easy to use. During Cambridge Tech Week, Rafał shared his insights on how better metadata management can turn disorganized data into clear, structured information that’s easy to work with.
In the world of scientific research, metadata plays a key role in turning raw data into something meaningful. It provides crucial information about how the data was collected, what it represents, and where it came from. Without clear and well-structured metadata, even the best datasets can be nearly impossible to understand or reuse effectively. The challenge is that metadata is often scattered across different formats – like Excel sheets, XML files, RDF schemas, or basic web forms. Each of these tools has its own layout and naming conventions, making it tough to bring everything together.
Rafał highlighted that this lack of standardization often leads to confusion and mistakes, especially in large projects where multiple teams are involved. “Good metadata is not just a nice-to-have – it’s essential for keeping things clear and organized,” he explained. When every team uses its own approach, simple tasks become complicated, and critical data can easily get lost in the shuffle.
Bad metadata can turn even the best datasets into useless piles of numbers. When data is poorly described or stored in inconsistent formats, it becomes nearly impossible to understand or use effectively. Rafał pointed out that this is a common problem in research projects where multiple teams work with different tools and naming conventions. Important details get lost, data is misinterpreted, and collaboration becomes frustratingly slow.
“Bad metadata makes good data useless,” he said during his talk, highlighting how a lack of proper structure can lead to costly mistakes and wasted time. In large projects, unclear metadata can lead researchers to repeat experiments or overlook valuable insights simply because they didn’t label or document the data properly.
Beyond just being inefficient, poor metadata is also a major roadblock to achieving FAIR data principles — ensuring that data is Findable, Accessible, Interoperable, and Reusable. Without a clear structure and consistent standards, datasets fail to meet these criteria, limiting their long-term value and making true data-driven research collaboration much harder.
Effective metadata management isn’t just about organization – it’s about avoiding errors, enabling reuse, and ensuring research data lives up to its full potential.
Bad metadata makes good data useless.
Bringing order to messy metadata starts with setting clear standards. Rafał’s team uses JSON Schema as the foundation for making data consistent and structured. JSON Schema defines how each piece of information should look, what fields are required, and how values should be formatted. This simple set of rules helps avoid confusion and makes it easy to share and understand data, even when it’s coming from different sources.
To make this process even smoother, the team uses lightweight formats like YAML and JSON. These text-based formats are easy to read and edit, which speeds up validation and reduces errors. Researchers can make updates in a simple text editor without worrying about complex code. As Rafał explained, “Good metadata is not just about keeping things tidy – it’s about making sure your data is ready to use, every time.”
Standardization doesn’t just make metadata cleaner; it makes it reliable. With a clear structure in place, researchers know exactly what to expect, no matter where the data comes from.
Creating clear standards for metadata is important, but it’s only the beginning. To really make metadata useful, you need good tools to manage it. Rafał’s team has focused on building user-friendly interfaces that make it easy to work with metadata, even for people who aren’t familiar with technical formats like JSON or YAML. One of their solutions is a WYSIWYG (What You See Is What You Get) Editor. This tool allows researchers to fill out metadata forms without having to understand complex code. They simply enter the information, and the system handles the formatting in the background.
For those who are more comfortable with code, the team also developed a browser-based text editor. It supports YAML and JSON, providing real-time validation and instant feedback on errors. “It’s like having an automatic spell-checker, but for data,” Rafał explained.
These tools don’t just make metadata entry easier – they also make it faster and more reliable. With features like autocomplete and error validation, researchers can input accurate metadata quickly, reducing mistakes and improving collaboration across projects.
Improving metadata isn’t just about making things look tidy – it’s about getting real, measurable results. Rafał’s team has seen the difference that well-structured metadata can make in research projects. By centralizing all metadata in one place and using consistent schemas, they’ve made it much easier for teams to find what they need without wasting time. When researchers organise everything clearly, they don’t have to dig through multiple sources or deal with conflicting formats. This level of clarity also makes it much easier to manage resources. For example, clear metadata helps teams see which lab equipment they’re actively using and which sits idle, enabling smarter decisions.
“You can’t manage what you can’t see,” Rafał said during his talk. Centralized and well-structured metadata doesn’t just help with finding information – it also makes projects easier to monitor and track. Tools like dashboards and real-time visualizations give decision-makers a clear view of ongoing research, helping them avoid surprises and make more informed choices.
You can’t manage what you can’t see.
Managing metadata well is more than just good practice – it’s the backbone of efficient research. When teams organise information clearly, they find what they need faster, collaborate more easily, and avoid costly mistakes. Rafał’s team has shown that with the right tools and smart planning, they can turn even the most complex datasets into something clear and accessible.
“Bad metadata makes good data useless,” Rafał often says. But the opposite is also true: good metadata unlocks the real value of data. It turns raw information into actionable insights, ready to drive research forward. In the end, smart metadata management isn’t just about keeping things tidy – it’s about unlocking the full potential of scientific discovery.
At Holisticon Connect, our core values of Passion and Execution drive us toward a Promising Future. We are a hands-on tech company that places people at the centre of everything we do. Specializing in Custom Software Development, Cloud and Operations, Bespoke Data Visualisations, Engineering & Embedded services, we build trust through our promise to deliver and a no-drama approach. We are committed to delivering reliable and effective solutions, ensuring our clients can count on us to meet their needs with integrity and excellence.
Let’s talk about your project needs. Send us a message and will get back to you as soon as possible.