McLean, USA – August 16 – Using its enhanced visual search technology, ZyLAB, a leading provider of eDiscovery and information management solutions, has located an additional set of more than 60 previously undetected items in the EDRM Enron PST Data Set that contain pornography, privacy, health and financial information. Responding to the invitation of EDRM, the leading standards organization for the eDiscovery and information governance market, to assist with an ongoing effort to cleanse the Enron data set, ZyLAB has shared this information with EDRM.
The Enron Data Set is an industry-standard collection of email data that was previously hosted by EDRM and in 2012 became an Amazon Web Services Public Data Set. The Enron Data Set has served for many years as an industry-standard collection of email data for electronic discovery training and is a valuable public resource for all sorts of researchers from all disciplines.
It has never been a secret that the data set that was originally made available by the Federal Energy Regulatory Commission (FERC) contained a high level of personally identifiable information (PII) about the company’s former employees.
Johannes Scholtes, Chief Strategy Officer of ZyLAB: “In May of this year a great cleansing effort has been made by the consultants from Nuix, identifying many items containing private, health and financial information. With our additional cleansing exercise ZyLAB responds to the invitation of the EDRM co-founders George Socha and Tom Gelbmann, to come forward and support this effort to protect the privacy of hundreds of individuals and help them locating any additional private data that may still exist in the data set.”
“We welcome ZyLAB’s efforts to further improve the EDRM Enron Data Set by identifying PII in the data set and thank them for bringing that information to our attention, “ said George Socha. “We invite others who are interested in helping to make these data sets available to the industry to follow ZyLAB’s example.”
By using the brand new ZyLAB Visual Classification technology in combination with the existing deep processing, content analytics and search capabilities, several hard to find items like documents containing social security and credit card numbers, protected health information, 1040 tax forms, and even pornography have been identified.
For over 30 years, ZyLAB’s unique search and content analytics technology has been developed to help customers find more relevant information than any other product on the market, regardless of spelling errors, OCR errors, deliberate hidden data, aliases, code words, digital format, location, or language and even regardless of the fact that the data contains explicit text in cases of images, video or audio recordings. The results of this effort, shows the power of the unique ZyLAB search technology.
ZyLAB’s industry-leading, modular eDiscovery and enterprise information management solutions enable organizations to manage boundless amounts of enterprise data in any format and language, to mitigate risk, reduce costs, investigate matters and elicit business productivity and intelligence.
The ZyLAB eDiscovery system is directly aligned with the Electronic Discovery Reference Model (EDRM) and the company’s products and services are used on an enterprise level by corporations, government agencies, courts, and law firms, as well as on specific projects for legal services, auditing, and accounting providers. ZyLAB systems are also available in a Software-as-a-Services (SaaS) model.
ZyLAB is positioned by Gartner, Inc. in the “Leaders” quadrant of the 2012 Magic Quadrant for eDiscovery Software and has received numerous other industry accolades. Headquartered in Amsterdam, the Netherlands and McLean, Virginia, ZyLAB also serves local markets from regional offices in New York, Barcelona, Frankfurt, London, Paris, and Singapore. To learn more about ZyLAB visit www.zylab.com