Alibaba Innovative Research (AIR) > Data Security and Privacy Protection
【CCF-AIR青年基金】数据防泄漏场景的屏幕翻拍翻录检测 Detection of screen recapture and its application to data loss prevention

Research Themes

Data Security and Privacy Protection

Background

Data security is bound to become a key research and development direction of all sectors of society. As far as concerned, the ability of data leakage prevention and traceability under the Internet sharing environment is still relatively weak, and data leakage has been happening all the time. "Insider" often has a variety of ways and means to disclose high-risk data, especially unstructured data leakage such as screen recapture, screenshots and screen snapping.  

2021年年初开始,国家网信办、工信部、公安部等多部门对数据安全、网络信息安全等涉及到国家安全的领域陆续密集出台相关的监管措施,数据安全势必将成为社会各界的重点研究和发展方向。就目前情况来看,互联网共享环境下的数据防泄露与溯源能力依旧较为薄弱,各地数据安全问题频发。数据泄露一直在发生,内鬼往往有各种各样泄露高风险数据的方式方法,特别是类似录屏、截图、偷拍屏幕这类非结构化数据泄露方式。

 

We define unstructured data as data with irregular or incomplete data structure and no predefined data model, and this kind of data is difficult to be represented by the two-dimensional logical table of the database. It includes all forms of office documents, text, pictures, XML, HTML, various reports, images and audio / video information. Among the risk points of different types of unstructured data mentioned above, the most common scenario is to disclose high-risk content through screen recapture and snapping. Since everyone has mobile-phone, the high fidelity format of high-risk and sensitive information data can be obtained in a very short time through screenshots or screen recapture. Although some digital watermarking technologies can be used to track and trace the source of information leakage, the scanning ability of the current security infrastructure for unstructured data content is still insufficient, which may lead to some high-risk information can be transmitted externally by using image, audio and other carriers to penetrate the current protection ability of data security. Especially for the leaker photographers with special disclosure purpose, the shooting action is usually relatively secret. After the leaker shooting, it is likely to carry out a series of attacks on the image and erase the watermark marks contained in the high-risk content. Or further, the candid photographer is likely to disguise the images taken and recorded on the screen by means of image fusion and information hiding, so as to realize the behavior of secret transmission. At present, many enterprises allow arbitrary transmission and copying of photos and images, and there are obvious "security vulnerabilities" in image unstructured data.  

非结构化数据是数据结构不规则或不完整,没有预定义的数据模型,不方便用数据库二维逻辑表来表现的数据。包括所有格式的办公文档、文本、图片、XML, HTML、各类报表、图像和音频/视频信息等等。尽管目前可以通过一些数字水印技术来实现对信息泄露的追踪和溯源,但是现在的安全基建对非结构化数据内容的扫描能力仍然存在不足,可能导致一些高风险信息可以利用图像、音频等载体进行对外传输,穿透目前数据安全的防护能力。特别是对于本身就带着特殊泄密目的的偷拍者,拍摄动作通常会进行得比较隐密,偷拍之后很可能会对图像进行一系列的攻击,对高风险内容包含的水印标记进行抹除。或者更进一步地,偷拍者很可能通过图像融合、信息隐藏的手段将拍屏、录屏的图像进行伪装,从而实现秘密外传的行为。很多企业内部对于照片类的图像,可以任意传输和拷贝,存在明显的非结构化图像数据方面的安全漏洞

 

Unstructured data leakage such as recapture and recording will have a very serious and bad negative impact on society, enterprises and individuals. Detection of mobile phone image/video recapture is an important topic related to data security. By studying efficient and accurate real-time detection of screen recapture data leakage, we can plug the data security loopholes in this aspect in the society to a great extent.

通过拍屏、录屏等非结构化数据泄漏行为将会对社会、企业和个人造成非常严重和恶劣的负面影响,因此针对拍屏、录屏的非结构化数据泄露检测是一项涉及数据安全的重要课题,通过研究高效准确的拍屏、录屏数据泄漏实时检测,可以有效拦截数据跨媒介传输泄漏。 

Target

This project proposes to conduct an in-depth research on two technical problems:

1)Efficient and accurate screen recapture detection technology. It can resist composite post-processing attacks including re-compression, cropping, scaling, multiple social media transmission and so on. 高效准确的屏幕翻拍、翻录检测技术。能够抵抗包含重压缩、裁剪、缩放、多次社交媒体传输等各种强度的复合后处理攻击。

 

2)Effectively combine the data content and recapture behavior to judge whether the recapture behavior is suspicious. When the recapture behavior is detected, the content of the recapture images and the behavior before and after the shooting are actively detected and analyzed through multi-modal machine learning technology and OCR technology, so as to effectively judge whether the recapture and the recapture behavior involve data leakage.

结合数据内容、拍摄行为,有效判断屏幕翻拍、翻录行为是否可疑的技术。在检测到屏幕翻拍发生的时候,通过多模态机器学习技术和OCR技术对翻拍内容和拍摄前后的操作行为进行主动检测和分析,有效判断翻拍、翻录行为是否涉及数据泄露。 

Related Research Topics

  • Efficient and accurate screen recapture detection  高效、准确的屏幕 翻拍、翻录检测手段
  • Analysis on the screen recapture behavior and leaked data content, effectively detect data leakage 结合数据内容、拍摄行为,有效检测数据泄漏的发生
  • To reveal the post-processing trace in the process of screen recapture 揭示屏幕翻拍翻录过程中的后处理痕迹 

Scan QR code
关注Ali TechnologyWechat Account