forked from psal/anonymouth
-
Notifications
You must be signed in to change notification settings - Fork 1
/
README.txt
187 lines (149 loc) · 6.27 KB
/
README.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
Anonymouth
-Document Anonymization Tool
-Version: 0.5
----------------------------------------------------
The Privacy, Security and Automation Lab (PSAL)
Drexel University, Philadelphia PA
http://psal.cs.drexel.edu/
----------------------------------------------------
P.I.
Dr. Rachel Greenstadt [email protected]
Developed by:
Andrew W.E. McDonald [email protected]
Marc Barrowclift [email protected]
Jeff Ulman
Joe Muoio
====================================================
-------------------INTRODUCTION---------------------
====================================================
Anonymouth is a Java-based application that aims to
give users to tools and knowledge needed to begin
anonymizing documents they have written.
It does this by firing up JStylo libraries (an
author detection application also develped by PSAL)
to detect stylometric patterns and determine
features (like word length, bigrams, trigrams, etc.)
that the user should remove/add to help obsure their
style and identity.
Though Anonymouth and it's team works hard to
provide you with tools to help remove your identity
from documents you have written, WE CAN IN NO WAY
GUARANTEE THAT YOUR DOCUMENT IS ANONYMOUS OR NOT
ANONYMOUS. Anonymouth is always giving you it's best
guess, it's best idea of where your document stands,
though that should not any any time be taken as an
absolute (for example, you could have forgotten to
remove your name from the document and Anonymouth
has no way to know that that's your name and should
remove it). What we can say is Anonymouth is only as
good as you make it, and when used right can be
helpful in guiding your document towards the right
direction.
====================================================
-------------------INSTALLATION---------------------
====================================================
There quickest and easiest way to install Anonymouth
is to clone or download the zip of the Anonymouth
github project here:
https://github.com/psal/anonymouth
And then import this project as an existing project
into Eclipse (or clone and import directly within
Eclipse if you have the Eclipse eGit plugin).
This is currently the only ready way to compile and
run Anonymouth. We will be including a updated build
file soon so that you may build and run Anonymouth
easily within the command land, but sadly it hasn't
been done yet so this is the only option currently
====================================================
-------------------BUILD AND RUN--------------------
====================================================
Once Anonymouth is all set up in Eclipse, you need
only run ThePresident from the package
edu.drexel.psal.anonymouth.gooie to begin using it.
Please note that there are two main package
categories, JStylo and Anonymouth. The majority of
Anonymouth developement should be in the Anonymouth
packages as Anonymouth simply uses the JStylo
libraries for parts of the initial document process,
so beginners need only concern themselves with the
Anonymouth packages.
====================================================
-------------------DEPENDENCIES---------------------
====================================================
Java 7 is required to run Anonymouth. If you don't
yet have it, get it at Oracle's website here and
follow the installation instructions there:
http://java.com/en/download/index.jsp
If you are unsure whether or not you have it
installed, follow these steps to see:
OS X:
-Open up Terminal (Applications/Utilities)
-Type "java -version" without the quotes
-If you see something like:
java version "1.7.x_xx"
then you're ready to go! If not, then that
means you most likely don't have Java 7
installed, in which case you should go to the
download link above
WINDOWS:
-Follow the instructions here:
http://www.java.com/en/download/help/version_
manual.xml
if you have version "1.7.x_xx", then you're
good to go! If not, then that means you most
likely don't have Java 7 installed, in which
case you should go to the download link above
If you are using Eclipse, also make sure that Java
7 is your selected compiler (Preferences/Java/
Compiler) and is an included Library in your java
Build Path (Not sure how to do this? Google is your
friend).
----------------------------------------------------
Anonymouth requires the included jsan_resources
directory in it's running directory (The main
Anonymouth directory containing lib, src, etc.). It
should be in the correct directy by default.
----------------------------------------------------
Anonymouth requires a corpus (basically a database
of other authors and documents they have written) to
run. It needs this so it can classify your documents
with respect to these other documents and their
styles so that Anonymouth can give you an idea of
how anonymous it thinks your document is and what
features to remove/add to help you get there. Three
different corpi are included in the project
directory for you to choose and are located at:
./anonymouth/jsan_resources/corpora/amt
./anonymouth/jsan_resources/corpora/drexel_1
./anonymouth/jsan_resources/enron_demo
Though we included corpi, you are more than welcome
to use any other corpus you may have. It is
recommended to use many different combinations of
authors so you can get the best posisble picture of
where your document stands anonymously with respect
to others.
----------------------------------------------------
Anonymouth also needs the following jars in the lib
directory (everything should already be included):
weka 3.7.9
fasttag 2.0
Jama 1.0.3
jaws 1.3
jcommon 1.0
freechart 1.0.14
jgaap 5.4.0
microsoft translator 0.6.1
miglayout 4.0
tt4j 1.0.15
Stanford postagger -
ui -
====================================================
---------------------LICENSE------------------------
====================================================
License V2:
JStylo was released by the Privacy, Security and
Automation lab at Drexel University in 2011 under
the AGPLv3 license. A copy of this license is
included with the repository/program. If for some
reason it is absent, it can be viewed here:
http://www.gnu.org/licenses/agpl.htmla