FECloud: A Trustworthy Forensics-Enabled Cloud Architecture

i

ii

Chapter 1

FECLOUD: A TRUSTWORTHY FORENSICS-ENABLED CLOUD ARCHITECTURE

Shams Zawoad and Ragib Hasan

Abstract The rapid migration from traditional computing and storage models tothe cloud model creates the necessity of supporting reliable forensics inthe cloud. However, current cloud computing architectures often lacksupport for forensic investigations because many of the assumptionsthat are taken for granted in traditional digital forensics do not applyto clouds. Moreover, trustworthiness of evidence can be questionablebecause of the possibility of collusion between dishonest cloud providers,malicious users, and investigators. Since reliability and accuracy ofevidence are very important factors while evaluating evidence during acriminal investigation and prosecution, we need to preserve the integrityof evidence before and after collecting from clouds.

In this paper, we first identify the required properties to supporttrustworthy forensics in clouds. Based on the requirements, we pro-pose a forensics-enabled cloud architecture (FECloud) to preserve andprovide required evidence while protecting the privacy and integrityof the evidence. FECloud is designed on top of Openstack – a popu-lar open source cloud computing platform. Incorporating architectureslike FECloud may impose significant business impacts on Cloud ServiceProviders (CSP) as well as customers. CSPs can attract more customerswith the assurance of providing proper forensics support. Likewise, cus-tomers do not require extreme investment on establishing their ownforensics friendly infrastructures.

Keywords: Cloud Forensics, Digital Forensics, Cloud Security, Cloud Architecture

1. Introduction

Cloud computing is one of the major forces behind many services.Today, consumers are enjoying the services provided by clouds whenthey access Gmail, Google Calendar, Dropbox, Microsoft Office Live, orrun hundreds of Amazon Elastic Compute Cloud (EC2) instances for

2

processing large-scale data. According to Gartner, consumers will storemore than one third of their digital content in the cloud by 2016 [1]. Arecent research by Market Research Media states that the global cloudcomputing market is expected to grow at a 30% Compound AnnualGrowth Rate (CAGR) reaching $270 billion in 2020 [2].

However, the highly-scalable computing and storage resources offeredby the cloud can be misused by malicious users to perform attacks frommachines inside the cloud [3, 4], or to keep some secret files (e.g., childpornography, stolen intellectual properties) in cloud storage [5, 6]. Toinvestigate such crimes involving clouds, investigators have to carry outdigital forensic investigations in the cloud environment. This particularbranch of forensics has become known as Cloud Forensics.

Unfortunately, many of the assumptions of traditional digital forensicsare not valid in the cloud computing model. One of the major hurdlesis that neither users nor investigators have physical access to the cloud.In clouds, each server contains files from many users. Hence, even witha subpoena, it is not feasible to seize servers from a data center with-out violating the privacy of many other users. The trustworthiness ofthe evidence is also questionable, because other than the cloud serviceprovider’s (CSP) word, there is no usual way to determine the integrityof the evidence. To provide on-demand services, cloud providers do notsupport persistent storage for terminated VMs. Hence, data resides inthe cloud VMs will be unavailable after terminating the VMs. This inturns makes it almost impossible to do forensics investigation if some il-legal activities have been occurred using such terminated VMs. Finally,cloud providers and investigators can collude with a malicious user tohide the trace of an illegal activity or to frame an honest user. For thesereasons, we need special cares to provide reliable forensics supports incurrent cloud infrastructures.

While there are several research works, which addressed the challengesof cloud forensics [7–10] and proposed solutions to overcome some ofthe problems [7, 11–13], we do not see any complete cloud computingarchitecture that can preserve and provide trustworthy evidence to lawenforcement agencies. Moreover, malicious cloud providers and collusionbetween different stakeholders have not been addressed in these works.

We argue that to support trustworthy forensics in clouds, we needto preserve logs, proof of data possession, provenance information, andtimestamp securely. The required evidence should also be easily avail-able to users, investigators, or the court authority. Based on the require-ments, we propose a forensics-friendly cloud computing architecture –FECloud, which is designed on top of the current OpenStack1 architec-ture. FECloud introduces five new components in the existing Open-

https://www.researchgate.net/publication/229042362_Technical_Issues_of_Forensic_Investigations_in_Cloud_Computing_Environments?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==


https://www.researchgate.net/publication/286192780_Understanding_Issues_in_cloud_forensics_Two_hypothetical_case_studies?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/260364240_Digital_Forensics_in_the_Cloud?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/260364243_Towards_Building_Proofs_of_Past_Data_Possession_in_Cloud_Forensics?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/302558183_Detecting_temporal_inconsistency_in_virtual_machine_activity_timelines?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/235712367_SecLaaS_Secure_Logging-as-a-Service_for_Cloud_Forensics?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/259164863_Design_and_implementation_of_FROST_Digital_forensic_tools_for_the_OpenStack_cloud_computing_platform?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

Zawoad & Hasan 3

Stack architecture namely: Logger (Themis), Data Possession (DP)Manager (Metis), Timestamp Manager (Chronos), Provenance Manager(Clio), and Proof Publisher (Brizo). We also add new modules withthe OpenStack Block Storage (Cinder) and Nova Compute to commu-nicate with the new components. OpenStack Dashboard (Horizon) andIdentity manager (Keystone) are augmented to provide user interface(UI) and authentication to the proposed components. Finally, we de-sign a forensics-enabled image for VMs to support the forensics relatedfeatures. By incorporating the proposed architecture, cloud providersmay attract more customers with the assurance of reliable forensics sup-port. Customers also do not need to establish a privately-owned, costlyforensics-enabled computing/storage infrastructures for critical businessapplications.

Contribution: The contributions of this work are as follows:

1 We systematically analyze the requirements and challenges to es-tablish trustworthy forensics supports in current cloud architec-tures, which can help researchers to focus on specific research sub-problems of the large cloud forensics problem domain.

2 We present the FECloud architecture, which provides the requiredproperties for reliable forensics investigations in clouds. To thebest of the authors’ knowledge, this is the first work that proposesa complete, trustworthy, and forensics-enabled cloud architectureon top of a popular open source cloud computing platform.

3 We show how various proposed components of the architectureacquire different types of evidence from the existing architecture,preserve them securely, and make them available to users and lawenforcement agencies. We also show how the trustworthiness ofvarious evidence can be verified by the court authority.

Organization: The rest of the paper is organized as follows: Sec-tion 1.2 presents the research motivations. Section 1.3 discusses therequired properties for a forensics-enabled cloud infrastructure. Section1.4 presents the challenges to establish a forensics-enabled cloud archi-tecture. We propose the FECloud architecture in Section 1.5. Section1.6 presents the related work and finally, we conclude in Section 1.7.

2. Research Motivations

The black-box nature of clouds can attract criminals to launch vari-ous kinds of malicious activities using clouds. In an ideal situation, weshould evaluate digital evidence based on the reliability of the system

4

and process that generated the evidence. As suggested by Strong etal., digital evidence are not the counterpart of statements provided byhumans, which should ideally be tested by cross-examination, whereasadmissibility of the digital evidence should be determined on the ba-sis of the reliability and accuracy of the process involved [14]. Hence,preservation and collection of trustworthy evidence is utterly importantin digital forensics and as well as in cloud forensics.

However, the very black-box nature of cloud computing also raisesquestions about the trustworthiness of evidence, when the evidence arecollected from the cloud. Because of the importance of reliable foren-sics in the cloud, the National Institute for Standards and Technology(NIST) recently established the NIST Cloud Computing Forensic Sci-ence Working Group (NCC-FSWG) to research cloud forensic sciencechallenges and to develop solutions, standards, and technologies to miti-gate the challenges that cannot be handled with current technologies andmethods [15, 16]. Many of the challenges of reliable forensics are new inthe cloud, while others are already faced by digital forensics examiners.According to Ruan et al., challenges to cloud forensics can broadly becategorized into technical, legal, and organizational challenges [17].

Currently, extensive research is ongoing to protect cloud form externalor internal attackers. However, in case of an attack, we need to investi-gate the incident. Besides protecting the cloud, it is important to focuson this issue. The goal of this work is to enable forensics investigationin clouds while ensuring the trustworthiness of evidence in the completelife cycle of cloud forensics, from evidence preservation to presentation.

3. Properties of a TrustworthyForensics-Enabled Cloud

Based on the unique characteristics of clouds and existing forensicsframework, we identify the following five crucial properties that a trust-worthy forensics-enabled cloud should ensure.

Trustworthy Log Management: Since the necessity of logs is indis-putable in forensic investigation, it is often the case that experiencedattackers first try to tamper with the logs to hide the traces of attacks[18]. An adversary can try to host a botnet server, spam email server, orphishing websites in cloud VMs and he can remove all the traces of thesemalicious activities later by tampering with the logs. For these attacks,we need appropriate logs to investigate how an attack occurred, when itoccurred, and by whom. Hence, a forensics enabled cloud should acquireall types of activity logs from VMs and store them in a persistent storagewhile ensuring the integrity and confidentiality of the logs.

https://www.researchgate.net/publication/229021339_Cloud_forensics_An_overview?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/2363097_Forward_Integrity_For_Secure_Audit_Logs?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

Zawoad & Hasan 5

Proof of Data Possession: Preserving proof of data possession isimportant for two reasons: to prove the presence of a particular file in agiven storage system at a particular time and to ensure the preservationof litigation hold.

To keep the personal computer clean, malicious users can keep theircontraband documents in clouds. Some incidents of storing contrabanddocuments in cloud storage have already been reported [5, 6]. In orderto prove that a suspect actually had some specific files in a given storageat a particular time, a forensics-enabled cloud should preserve the proofof data possession.

A litigation hold is a notice to an organization to preserve all theelectronically stored information (ESI) for a current lawsuit for a certaintime period. In current data storage models, this includes documentsstored in clouds. However, due to the fundamental natures of currentclouds, incorporating a trustworthy litigation hold management systemis very challenging. The need of litigation hold in clouds has appeared intwo recent cases [6, 19]. Hence, if a case involves cloud-based ESI, cus-tomers and CSPs share a mutual need and duty to ensure the litigationhold on cloud data. Whether a litigation was maintained or violated canbe ensured by preserving trustworthy proof of data possession.

Secure Timestamp: It has been seen in the past that the time associ-ated with digital evidence can be very crucial to discharge or condemna suspect [20]. A similar situation can happen with the cloud where acrime has occurred directly using cloud computing resources, or wherecloud activities can be used as evidence for any other crime. An attackercan change a VM’s system clock before launching any attack and laterreset it to the original time.

Tampering with the system clocks of the host and VM will provide aset a events which are temporally coherent but occurred in a differenttime domain than the actual. Any timeline of events generated with theassumption of trusted system clock of host and VM suffers from thisvulnerability. Hence, the timeline will not be admissible in a court oflaw. To provide a trustworthy timeline of events for a reliable forensicsinvestigation, we need to make sure that the system clocks of the hostand guest VMs have not been tampered with.

Secure Provenance: Provenance provides the history of an object.Hence, by implementing cloud provenance, investigators can reason aboutthe origins, collection or creation, evolution, and use of any evidence.Besides preserving the integrity of evidence, CSPs also need to providea proper chain of custody information, which can be accomplished bymaintaining proper provenance. However, as all the evidences and theaccess histories are under the control of CSPs, they can always tamper

https://www.researchgate.net/publication/220542555_Error_Uncertainty_and_Loss_in_Digital_Evidence?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

6

with the provenance record. Moreover, from the provenance data of theclouds, an attacker can learn confidential information about the datastored in the cloud. We need a secure provenance [21] scheme to protectprovenance information from these types of attack.

Availability of Evidence: The physical inaccessibility of the evidencemakes evidence acquisition a challenging task in the cloud. CSPs canplay a vital role in this step by providing a web based managementconsole or providing secure Application Programming Interface (API) tolaw enforcement agencies. Using a web console or API, customers andinvestigators can collect network, process, and database logs, as well asother digital evidence, and the provenance records of that evidence.

4. Challenges

Integrating the required properties in current cloud infrastructures ischallenging due to several characteristics of cloud computing.

Collusion Between Different Entities: Users or investigators havevery limited control over the evidence stored in clouds. With the state-of-the-art frameworks for collecting evidence from a cloud, investigatorsneed to believe the CSPs blindly, as they cannot verify whether the CSPsare providing valid evidence or not.

An employee of a CSP can collude with a malicious user to hide im-portant evidence or to inject invalid evidence to prove the malicioususer as innocent. Such a malicious CSP can provide incomplete logs,remove documents without keeping any trace, can maintain false times-tamp, and can tamper with various provenance information. Conversely,investigators can also be malicious and can alter any kind of evidencebefore presenting to court. In a traditional system, only the suspect andthe investigator can collude. The three-way collusion in clouds certainlyincreases the attack surface and makes cloud forensics more challenging.

Volatile Data: Data that reside in a VM are volatile, i.e., no datawill be preserved after terminating a VM. The volatile data can be doc-uments, network logs, operating system logs, and registry logs. If anadversary terminates VMs after doing a malicious activity that will leadto a complete loss of the crucial evidences. Though there is a way topreserve VM data by storing an image of the VM instance, an attackerwill not use it in order to remain clean.

CSPs can constantly monitor all the running VMs and store thevolatile data in a persistent storage so that they can provide logs orproofs of data possession when needed. However, simply preserving alldata of a terminated VM can overwhelm the storage of cloud providers.

https://www.researchgate.net/publication/32966408_Preventing_History_Forgery_with_Secure_Provenance?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

Zawoad & Hasan 7

Hence, we need to find an effective way to preserve the logs, data pos-session history or the provenance records.

Multi-tenancy: In clouds, multiple Virtual Machines (VM) can sharethe same physical infrastructure, i.e., data for multiple customers canbe co-located. An alleged user may claim that the evidence containsinformation of other users, not her. In this case, the investigator needsto prove it to the court that the evidence presented actually belongs tothe suspect. Conversely, in a traditional computing system, the owneris solely responsible for all the ESI located in her computing system.Moreover, in clouds, we need to preserve the privacy of other tenants.

5. FECloud Architecture

To preserve and provide cloud-based ESI to forensics investigatorssecurely, we propose FECloud, a forensics-enabled cloud architecture.FECloud introduces following five new components with the existingOpenStack architecture:

• Logger (Themis) collects logs from VMs, Cinder, and Nova computenodes and preserves them securely;• Data Possession (DP) Manager (Metis) collects proofs of data posses-sions from Cinder and stores the proofs securely;• Timestamp Manager (Chronos) handles timestamp verification cyclesbetween VMs, compute node, and itself and preserves the informationabout the verification phase securely;• Provenance Manager (Clio) collects various provenance records (e.g., data,application, state) from VMs, Nova compute, and Themis to securelycreate and preserve provenance chain;• Proof Publisher (Brizo) makes the proofs of different types of ESIpublicly available so that any alteration of evidence by users, CSPs, orinvestigators after-the-fact can be detected at the court.

Besides the aforementioned new components, we augment the Open-Stack Dashboard (Horizon) and Identity manager (Keystone) to supportUI and authentication to the proposed components. We also design aforensics-enabled image for VMs to provide the forensics related features.Figure 1 illustrates the proposed FECloud architecture and below we de-scribe the design in detail.

5.1 Logger (Themis)

The Logger (Themis) communicates with the OpenStack computenode (Nova), block storage (Cinder), and the running VMs to collect allpossible activity logs. To communicate with Themis, a new log provider

8

Figure 1. FECloud Architecture

module is added to the existing Nova and Cinder node of OpenStackand with the VM image.

The log provider module of Nova compute constantly monitors vari-ous activities (e.g., network activity, processor usage by VMs) of VMsrunning in the compute node and sends the logs to Themis. Some ofthe logs of VMs cannot be gathered from Nova compute, e.g., operat-ing system logs. Such logs are directly collected from VMs. The logprovider module of Cinder sends logs of block storage usage to Themis.Logs from different entities are sent to Themis by the Log API exposedby Themis.

Whenever Themis receives any logs via the Log API, it stores the logsin a persistent storage, so that we will not lose any log after terminat-ing the VMs. When a VM is in active state, Themis can track whichdata belongs to which VM. Hence, while preserving the data, Themiscan take care of segregating the data according to VM owners. In thisway, multiple VM owners’ data will not be co-mingled. We also pre-serve the confidentiality of the data from malicious cloud employee by

Zawoad & Hasan 9

using public-private key based encryption, so that only users and lawenforcement agencies can view the data.

To prevent the collusion between CSP, investigators, and cloud users,Themis creates cryptographic proofs of the logs using an accumulatordata structure, such as a One-Way accumulator [22] or Bloom filter [23].The proofs of the logs are stored in the proof of log database. By usingsuch proofs, the court authority can verify whether the logs presentedto the court are valid or not.

5.2 DP Manager (Metis)

The data possession manager (Metis) collects information about datapossession (DP) from Cinder and stores the proofs of data possessionin a proof of data possession database. Cinder is augmented with anew data possession module to communicate with this new component.Metis takes care of two issues: preserves proofs of past data possessionand preserves proofs of litigation hold maintenance.

A naive way to preserve the proof of DP is to store all the datain a persistent storage. However, this will increase the storage costsignificantly. An efficient way to preserve the proofs of data possessionis using accumulator data structures [22, 23]. Using accumulators, wecan store the proof of a thousand blocks of files in a single data structure,which requires significantly lower amount space compared to preserve theblocks. Moreover, using an accumulator, Metis can preserve the proofsof data possession without revealing the original data. When the courtauthority needs to verify whether some incriminating documents belongsto a suspect, first it collects the proof of DP of the suspect. Then usingthe membership checking method of the accumulator, they can verifywhether the documents in question exist in the proof or not.

The proof of DP can also be used in the court to determine the viola-tion of litigation holds. A defendant needs to present all the documentsthat are under a litigation hold to the court. The verification methodfirst creates DP information of the documents provided by the defen-dant. It then compares the generated DP information with the proofs ofDP collected from the cloud. Any deletion of documents by the defen-dant can be detected if the generated DP information does not matchwith the collected proofs.

5.3 Timestamp Manager (Chronos)

Since it is not possible to block a VM owner to change the systemtime of a VM, or a malicious system administrator to change the systemtime of a compute node, we propose a tamper-evident scheme using the

https://www.researchgate.net/publication/225132262_One-way_Accumulators_A_Decentralized_Alternative_to_Digital_Signatures_LNCS?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==


https://www.researchgate.net/publication/220424808_SpaceTime_Trade-Offs_in_Hash_Coding_With_Allowable_Errors?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==


10

Timestamp Manager (Chronos) to protect alterations of a VM’s or Novacompute node’s timestamp.

A secure timestamp verification protocol runs between three enti-ties: Nova compute node, running VMs, and the timestamp manager(Chronos). In this stage, each entity verifies the timestamp of othersso that timestamp alteration by any entity can be detected by oth-ers. Later, traces about the verification phase are stored securely usinghash-chain scheme in a proof of timestamp database. Before beginningthe verification cycle, VM and Chronos determine the Round Trip Time(RTT) with Nova compute. Validity of a requestor’s timestamp dependson the current timestamp of verifier and the RTT values. The timestampof one requestor is attested by two other entities and each attestationis later certified by an entity other than the verifier and requestor. Anew Time Keeper module is added to the Nova compute node to handlethe timestamp verification phase. Public key encryption and signaturegeneration take place in all the communications to preserve the integrityand confidentiality of the scheme.

With this new feature included with the cloud, investigators can nowpresent the trace of timestamp verification stage to the court besides theevidence collected from the cloud. As the timestamp verification infor-mation is preserved using hash-chain scheme, adversaries cannot changethe system time without breaking the chain of the verification informa-tion. However, alteration of the verification chain can be detected atthe court while verifying the integrity of the chain. Hence, integratingChronos will ensure that the timestamp associated with the evidence istrustworthy.

5.4 Provenance Manager (Clio)

The secure provenance manager (Clio) extracts provenance records ofdata, application, and VM state from the log database as well as fromthe provenance layer of Nova compute, and the running VMs. Since theLogger node (Themis) collects logs of data modification from the blockstorage, Clio collects necessary log records to build the data provenancefrom Themis through the Provenance API. Provenance records for thevirtual file system and applications running inside the VMs are directlycollected from VMs using the same API. Finally, provenance records tocreate system level provenance of Nova compute is collected from theprovenance layer of Nova.

After collecting various provenance records, Clio applies secure prove-nance chaining [24, 25] to preserve the integrity of the provenance records.The provenance chain database stores the secure provenance informa-

https://www.researchgate.net/publication/221353810_The_Case_of_the_Fake_Picasso_Preventing_History_Forgery_with_Secure_Provenance?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/221103815_Introducing_secure_provenance_Problems_and_challenges?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

Zawoad & Hasan 11

tion. To ensure that a malicious CSP cannot modify the chain after-the-fact, head of the provenance chain will be stored periodically to theproof of chain database after some certain epoch.

5.5 Proof Publisher (Brizo)

The proof publisher node (Brizo) periodically publishes the proof oflogs, data possession, timestamp verification, and provenance chain pub-licly on the web. When proofs are publicly available, CSP or investigatorcannot alter any ESI or provide fake evidence, since proofs of those fakeevidence will not exist in the published proof. Hence, regularly publish-ing the proofs ensures the forward integrity of the evidence, i.e. the ESIfor which the proof is already published cannot be altered by any of theentities.

Information published by Brizo can be available by RSS feed to protectit from manipulation by the CSP after publishing the proofs. We canalso build a trust model by engaging other CSPs in the proof publicationprocess. Whenever one CSP publishes a proof, that proof will also beshared among other CSPs. Therefore, we can get a valid proof as longas more than 50% of the CSPs are honest.

5.6 Access to Evidence Through Horizon

To mitigate the problem of the availability of cloud-based ESI, all theESI can be collected from the OpenStack dashboard (Horizon). Hence,investigators will not need the physical access to cloud infrastructuresto acquire logs, data possession, or provenance information. Four newmodules are added with Horizon to provide user interface (UI) for Metis,Chronos, Clio, and Themis, where one module is dedicated for one com-ponent. Using these modules, users, investigators, and the court author-ity can collect activity logs, provenance, proof of data possession, andproof of timestamp.

5.7 Forensics-enabled Image

From only Nova compute and Cinder, we cannot acquire all the evi-dence required for prosecuting various types of criminal incidents. Hence,without introducing new capabilities to VMs, it will not be possible todevelop a forensics-enabled cloud completely. We proposes a forensics-enabled image for VMs, which is illustrated in Figure 2. A VM, launchedusing such image can support the required forensics features. Some ofthe modules that we propose will be inside the kernel, while some othersare inside the application layer. Below we describe the functionalities ofthese modules:

12

Figure 2. A Forensics-enabled VM Image

• Virtual File System (VFS) Monitor: This module is placed inside thekernel to trace the operations on VFS, which are mostly important tobuild the data provenance of the VM.• System Call Tracer: This module is also placed inside the kernel totrack all the system call. System call information can reveal the activityof cloud users and are also important to build the application and stateprovenance.• Kernel Communicator: This module resides inside the applicationlayer and performs as a bridge between the kernel and the applicationlayers. This module is responsible to collect information from the virtualfile system monitor and the system call tracer module of the kernel andfeeds the information to the other modules of the application layer.• Chronos Handler: This application layer module participates in thetimestamp verification step to verify the timestamp of Nova computenode and Chronos node and also to get its own timestamp verified bythe other two entities.• Themis Communicator: This module first collects the system callinformation and VFS activities from the Kernel communicator and sendsthese logs to Themis using the log API.• Clio Communicator: This module sends provenance records for ap-plications, VFS, and VM state to Clio using the provenance API. Theprovenance records are collected from the VFS monitor and system calltracer via the kernel communicator module.• Nova Communicator: Nova communicator is required to communicatebetween the compute node and VMs. This communicator module isrequired in the timestamp verification phase where Nova compute nodeand a VM verifies each others timestamp.

5.8 Preliminary Results

Till now, we developed cryptographic frameworks for Metis [10] andThemis [12]. In [10], we designed a Bloom filter based data possessionscheme – PPDP for storage-as-a-service cloud. We determined that after




Zawoad & Hasan 13

integrating the proposed scheme, a user can face 3.73 to 0.13% overheadin terms of time to upload files based on the file size and security prop-erties. This overhead actually decreases with the increase in file size andbecomes almost constant when the file size crossed 6 MB. The storageoverhead on cloud provider’s side is also low. Regardless of the file size,PPDP requires approximately 1262 bytes to preserve the proof of 1000files.

In [12], we proposed a secure logging scheme for Themis. The requiredsecurity properties are ensured by the use of accumulator and hash-chainscheme. We used Bloom filter and RSA accumulator for this work.We found that while RSA accumulator provides better security thanBloom filter, it suffers from higher overhead in terms of log insertion,verification, and storage. Our design supports O(n) time and spacecomplexity for log insertion and storage. The verification algorithm takesconstant amount of time to verify logs using both of the accumulatorschemes.

Our ongoing work on securing the system time of Nova compute andVMs using the help of Chronos reveals that we can execute the times-tamp verification cycle between these three entities in every 60 secondswhile introducing less than 1% system overhead on each entities. Werun the verification protocol between 20 VMs and one Nova computeand one Chronos for 24 hours with a verification frequency of 60 secondsand found that the system is 99.98% stable.

6. Related Work

Since isolation helps to protect evidence from contamination, Delportet al. focused on isolating an instance to mitigate the multi-tenancy issue[26]. In [27], Hay et al. showed that if a VM instance is compromisedby installing some rootkit to hide the malicious events, it is still possi-ble to identify those malicious events by performing VMI. To make theactivity logs available to customers/investigators Bark et al. proposedto expose read-only APIs by CSPs [7]. Zawoad et al. proposed a SecureLogging-as-a-Service to store VM activities, which ensures integrity andconfidentiality of logs from a malicious CSP and investigators [12]. Todetect temporal inconsistencies in a VM’s timeline, Thorpe et al. devel-oped a log auditor for the cloud environment [11].

Recently, Dykstra et al. implemented FROST, a forensic data collec-tion tool for OpenStack [13]. Using FROST, cloud users/investigatorscan acquire an image of the virtual disks associated with any of the user’svirtual machines, and validate the integrity of those images with crypto-graphic checksums. It is also possible to collect logs of all API requests






https://www.researchgate.net/publication/220803350_Isolating_a_cloud_instance_for_a_digital_forensic_investigation?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/228956044_Forensics_examination_of_volatile_system_data_using_virtual_introspection?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

14

made to CSP and OpenStack firewall logs for VMs. Provenance forcloud computing is relatively new research area and was first proposedby Muniswamy-Reddy et al. [28]. The same research group proposed asolution for gathering provenance data from Xen Hypervisor [29]. Luet al. introduced the concept of secure provenance in cloud [30]. Theyproposed a Trusted Third Party (TTP) based scheme for secure cloudprovenance, which can ensure the confidentiality of the data, unforge-ability and full anonymity of the signature, and full traceability from asignature. In [31], authors proposed to build a legal hold framework inclouds. The framework receives legal hold information indicating a legalhold applicable to modification or deletion of a document. However, thispatent did not focus on trustworthy management of litigation holds toprotect a hold from a dishonest CSP, defendant, or plaintiff.

7. Conclusion and Future Work

Till now, investigators have to depend on the CSP to collect evidencefrom clouds. To make the situation even worse, there is no way toverify whether the CSP is providing the correct evidence to the inves-tigators, or the investigators are presenting valid evidence to the court.To enable trustworthy forensics support in current cloud architectures,we identified the required properties and the challenges associated withthese properties. We designed FECloud – a forensic enabled cloud ar-chitecture on top of OpenStack, which is a widely used open sourcecloud computing framework. Implementing our proposed architecturewill preserve the trustworthiness of evidence and will eliminate the de-pendency on CSP. However, the CSPs need to come forward to adaptsuch forensics-enabled architecture.

Currently, we are working to design an efficient secure cloud prove-nance scheme for Clio and integrate all the proposed components withOpenStack. After integrating all the components, we will evaluate theoverhead and stability of different components of OpenStack using stan-dard benchmark tools of OpenStack, such as Rally2, which will deter-mine the feasibility of using the proposed architecture in a real cloudscenario.

Acknowledgment

This research was supported by the National Science Foundation CA-REER Award CNS-1351038, a Google Faculty Research Award, and theDepartment of Homeland Security Grant FA8750- 12-2-0254.

https://www.researchgate.net/publication/221353837_Making_a_Cloud_Provenance-Aware?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/230707537_Collecting_Provenance_via_the_Xen_Hypervisor?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

https://www.researchgate.net/publication/221609240_Secure_provenance_The_essential_of_bread_and_butter_of_data_forensics_in_cloud_computing?el=1_x_8&enrichId=rgreq-a36cf1f81b869c21be7eed1b5bbd6339-XXX&enrichSource=Y292ZXJQYWdlOzI2ODY4ODg5MjtBUzoyMDU1MzI0OTg1MzQ0MDBAMTQyNjAxNDE3NDE3OA==

Zawoad & Hasan 15

Notes1. http://www.openstack.org/

2. https://wiki.openstack.org/wiki/Rally

References

[1] Gartner, Gartner says that consumers will store more than a third oftheir digital content in the cloud by 2016,( http://goo.gl/39a0y),June, 2012.

[2] Market Research Media, Global cloud computing market forecast2015-2020, (http://goo.gl/AR3FBD).

[3] The Register, Amazon cloud hosts nasty banking trojan ( http:

//goo.gl/xGNkNO), 2011.

[4] Infosecurity-magazine, DDoS-ers launch attacks from Amazon EC2(http://goo.gl/vrXrHE), July 2014.

[5] www.bbc.com, Lostprophets’ Ian Watkins: ‘Tech savvy’ web haul (http://goo.gl/C8FVnC), December 2013.

[6] Dist. Court, SD Texas, “Quantlab technologies ltd. v. godlevsky CivilAction No. 4: 09-cv-4039, 2014.

[7] D. Birk and C. Wegener, Technical issues of forensic investigationsin cloud computing environments, Systematic Approaches to DigitalForensic Engineering, 2011.

[8] J. Dykstra and A. Sherman, Understanding issues in cloud forensics:Two hypothetical case studies, Journal of Network Forensics, vol. b,no. 3, pp. 19–31, 2011.

[9] S. Zawoad and R. Hasan, Digital forensics in the cloud, The Journalof Defense Software Engineering), vol. 26, no. 5, 2013.

[10] S. Zawoad and R. Hasan, Towards building proofs of past datapossession in cloud forensics, ASE Science Journal, vol. 1, no. 4, pp.195–207, 2012.

[11] S. Thorpe and I. Ray, Detecting temporal inconsistency in virtualmachine activity timelines, Journal of Information Assurance & Se-curity, vol. 7, no. 1, 2012.

[12] S. Zawoad, A. K. Dutta, and R. Hasan, SecLaaS: Secure logging-as-a-service for cloud forensics, Proc. of the 8th ASIACCS, ACM,2013.

[13] J. Dykstra and A. T. Sherman, Design and implementation of frost:Digital forensic tools for the OpenStack cloud computing platform,Digital Investigation, vol. 10, pp. S87–S95, 2013.

[14] J. W. Strong and K. S. Broun, McCormick on evidence, West Pub-lishing Company, 1992.





















16

[15] P. Mell and T. Grance, Nist cloud computing forensic science chal-lenges, Draft NISTIR 8006, June 2014.

[16] NIST, Cloud forensic science ( http://collaborate.nist.

gov/twiki-cloud-computing/bin/view/CloudComputing/

CloudForensics).[17] K. Ruan, J. Carthy, T. Kechadi, and M. Crosbie, Cloud forensics:

An overview, Proc. of IFIP WG Digital Forensics, 2011.[18] M. Bellare and B. Yee, Forward integrity for secure audit logs, Tech-

nical report, Computer Science and Engineering Department, Uni-versity of California at San Diego, Tech. Rep., 1997.

[19] Dist. Court, SD Ohio, Brown v. tellermate holdings ltd. Case No.2: 11-cv-1122, 2014.

[20] E. Casey, Error, uncertainty, and loss in digital evidence, Interna-tional Journal of Digital Evidence, vol. 1, no. 2, pp. 1–45, 2002.

[21] R. Hasan, R. Sion, and M. Winslett, Preventing history forgery withsecure provenance, ACM TOS, vol. 5, no. 4, pp. 1–43, 2009.

[22] J. Benaloh and M. De Mare, One-way accumulators: A decentralizedalternative to digital signatures, Proc. of EUROCRYPT, Springer,pp. 274–285, 1994.

[23] B. Bloom, Space/time trade-offs in hash coding with allowable er-rors, Communications of the ACM, vol. 13, no. 7, pp. 422–426, 1970.

[24] R. Hasan, R. Sion, and M. Winslett, The case of the fake Picasso:Preventing history forgery with secure provenance, Proc. of FAST,USENIX, pp. 1–12, 2009.

[25] R. Hasan, R. Sion, and M. Winslett, Introducing secure provenance:problems and challenges, Proc. of StorageSS, ACM, pp. 13–18, 2007.

[26] M. K. Waldo Delport, Martin S. Olivier, Isolating a cloud instancefor a digital forensic investigation, Proc. of ISSA, 2011.

[27] B. Hay and K. Nance, Forensics examination of volatile system datausing virtual introspection, ACM SIGOPS Operating Systems Re-view, vol. 42, no. 3, pp. 74–82, 2008.

[28] K. Muniswamy-Reddy, P. Macko, and M. Seltzer, Making a cloudprovenance-aware, Proc. of TaPP, USENIX, 2009.

[29] M. Seltzer, P. Macko, and M. Chiarini, Collecting provenance viathe Xen hypervisor, Proc. of TaPP, USENIX, 2011.

[30] R. Lu, X. Lin, X. Liang, and X. Shen, Secure provenance: Theessential of bread and butter of data forensics in cloud computing,Proc. of the 5th ASIACCS, ACM, pp. 282–292, 2010.

[31] O. Schmidt, Managing a legal hold on cloud documents, U.S. PatentApp. 13/543,254, 2012.
































Documents

FECloud: A Trustworthy Forensics-Enabled Cloud Architecture