-
Notifications
You must be signed in to change notification settings - Fork 9
/
tools.html
202 lines (180 loc) · 11.6 KB
/
tools.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
<!doctype html>
<html lang="fr">
<html prefix="og: https://ogp.me/ns#">
<head>
<title>Our Tools | HTR-United</title>
<!-- block header -->
<meta charset="utf-8">
<meta name="viewport" content="width=device-width, initial-scale=1">
<meta name="author" content="Alix Chagué">
<meta name="author" content="Thibault Clérice">
<meta name="description" content="HTR-United is a catalog and an ecosystem for sharing and finding ground truth for optical character or handwritten text recognition (OCR/HTR)." />
<title>HTR-United</title>
<meta property="og:description"
content="HTR-United is a catalog and an ecosystem for sharing and finding ground truth for optical character or handwritten text recognition (OCR/HTR)." />
<meta property="og:locale" content="en_GB" />
<meta property="og:locale:alternate" content="fr_FR" />
<meta property="og:site_name" content="HTR-United" />
<meta property="og:image" content="https://htr-united.github.io/assets/images/logo_htr-united.png" />
<meta property="og:type" content="website" />
<meta property="og:title" content="HTR-United" />
<!-- Bootstrap core CSS -->
<link href="static/css/bootstrap.min.css" rel="stylesheet">
<!-- Favicons -->
<link rel="icon" href="assets/images/favicon.ico" />
<!-- Icon library -->
<link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/font-awesome/5.15.4/css/all.min.css">
<!-- Custom styles for this template -->
<link href="static/css/custom.css" rel="stylesheet">
<script type="text/javascript">
(function() {
document.head.querySelector("meta[property='og:title']").setAttribute("content", window.title);
})();
</script>
<!-- endblock header -->
</head>
<body id="top">
<!-- navbar -->
<nav class="navbar navbar-expand-lg navbar-light bg-light rounded fixed-top d-flex flex-wrap align-items-center justify-content-center justify-content-md-between border-bottom py-3 mb-4 px-5 shadow-lg">
<a class="navbar-brand" href="index.html">HTR-United</a>
<a class="navbar-brand" href="https://github.com/HTR-United"><i class="fab fa-github"></i></a>
<button class="navbar-toggler" type="button" data-toggle="collapse" data-target="#navbar_htr-united" aria-controls="navbar_htr-united" aria-expanded="false" aria-label="Toggle navigation">
<span class="navbar-toggler-icon"></span><!--<i class="far fa-minus-square"></i>-->
</button>
<div class="collapse navbar-collapse" id="navbar_htr-united">
<ul class="navbar-nav my-2 my-md-0 mr-md-3 col-sm-10">
<li class="nav-item">
<span class="mx-4"></span>
</li>
<li class="nav-item">
<a class="nav-link btn btn-nav me-2" href="catalog.html"><i class="fas fa-search"></i> <i18n vanilla-i18n="nav.catalog">Explorer le catalogue</i18n></a>
</li>
<li class="nav-item">
<a class="nav-link btn btn-nav me-2" href="document-your-data.html"><i class="fas fa-edit"></i> <i18n vanilla-i18n="nav.form">Enregistrer des données</i18n></a>
</li>
<li class="nav-item">
<a class="nav-link btn btn-nav me-2" href="tools.html"><i class="fas fa-hammer"></i> <i18n vanilla-i18n="nav.tools">Les Outils</i18n></a>
</li>
<li class="nav-item">
<a class="nav-link btn btn-nav me-2" href="actions.html"><i class="fas fa-cogs"></i> <i18n vanilla-i18n="nav.actions">Automatisation Github</i18n></a>
</li>
<li class="nav-item">
<a class="nav-link btn btn-nav me-2" href="team.html"><i class="fas fa-child"></i> <i18n vanilla-i18n="nav.team">L'Équipe</i18n></a>
</li>
</ul>
<div class="col-sm-2 text-end">
<select id="vanilla-i18n-toggler">
<option>Français</option>
<option>English</option>
</select>
</div>
</div>
</nav>
<!-- Main -->
<main>
<div class="px-4 py-5 my-5" id="main">
<div class="container">
<h1><i18n vanilla-i18n="tools.title">Tools</i18n></h1>
<p vanilla-i18n="tools.intro">In order to produce good data and to ensure an easy way to control or quantify the data produced, we offer a certain set of tools with tutorials on how to use them both in a standalone way or in a continuous integration way.</p>
<section class="row">
<div class="col-offset-1 col text-justify">
<h2>HUMGenerator <a href="https://github.com/HTR-United/htr-united-metadata-generator"><span class="fa fa-link"></span></a></h2>
<p><i18n vanilla-i18n="tools.humg">HUMGenerator (<em>H</em>tr <em>U</em>nited <em>M</em>etadata <em>G</em>enerator) generates metadata, specifically metrics for your corpora. It's specifically being used to generate the <code>volume</code> key of the catalog schema. It can generate a character count table, to help users choose new kind of ground truth to cover more characters.</i18n></p>
<p style="text-align:right;"><a href="./demo/humg.html" data-target="#demo-humg" class="toggle-demo btn btn-sm btn-primary"><span class="fa fa-desktop"></span> <span vanilla-i18n="tools.seeDemo">See demo</span></a></p>
</div>
<div class="col-md-2"><a href="https://github.com/HTR-United/htr-united-metadata-generator">
<img class="img-fluid rounded" alt="HUM Generator" src="./static/img/humgenerator.min.png"/>
</a>
</div>
<section id="demo-humg" class="d-none">
<iframe width="100%" height="400px"></iframe>
</section>
</section>
<hr />
<section class="row">
<div class="col-md-2">
<a href="https://github.com/HTR-United/htruc">
<img src="./static/img/HTRUC.min.png" class="img-fluid rounded" alt="HTRUC" />
</a>
</div>
<div class="col-offset-1 col text-justify">
<h2>HTRUC <a href="https://github.com/HTR-United/htruc"><span class="fa fa-link"></span></a></h2>
<p><i18n vanilla-i18n="tools.htruc">HTRUC allows for controlling that the htr-united catalog file is parsable and HTR-United compliant. HTRUC is also meant to offer the tools to parse, augment and compile statistics about catalog file(s). This allows the HTR-United central repository to build a general catalog for everyone's use, see <a href="https://github.com/HTR-United/htr-united/blob/master/htr-united.yml">the main catalog</a>. It can also be used with <em>HUMGenerator</em> to update a catalog file with new volumes. </i18n></p>
</div>
</section>
<hr />
<section class="row">
<div class="col-offset-1 col text-justify">
<h2>HTRVX <a href="https://github.com/HTR-United/htrvx"><span class="fa fa-link"></span></a></h2>
<p><i18n vanilla-i18n="tools.htrvx">HTRVX focuses on controlling the quality of the XML. It provides many options in terms of quality control: schema validation (checking that your ALTO or PAGE is okay), empty line or children-less region detection, Segmonto compatibility for segmentation (see <a href="https://segmonto.github.io/">the documentation on the Segmonto controlled vocabulary</a>)</i18n></p>
<p style="text-align:right;"><a href="./demo/htrvx.html" data-target="#demo-htrvx" class="toggle-demo btn btn-sm btn-primary"><span class="fa fa-desktop"></span> <span vanilla-i18n="tools.seeDemo">See demo</span></a></p>
</div>
<div class="col-md-2">
<a href="https://github.com/HTR-United/htrvx"><img src="./static/img/htrvx.min.png" class="img-fluid rounded" alt="HTRVX" /></a>
</div>
</section>
<section id="demo-htrvx" class="d-none">
<iframe width="100%" height="400px"></iframe>
</section>
<hr />
<section class="row">
<div class="col-md-2"><a href="https://github.com/ponteIneptique/choco-mufin"><img class="img-fluid rounded" src="./static/img/chocomufin.min.png" alt="ChocoMufin"></a>
</div>
<div class="col-offset-1 col text-justify">
<h2>ChocoMufin <a href="https://github.com/ponteIneptique/choco-mufin"><span class="fa fa-link"></span></a></h2>
<p><i18n vanilla-i18n="tools.chocomufin">ChocoMufin focuses on the characters that are used in the ground truth. It can either help you have an overview of the character used, can be used to control which one are used (to avoid having two characters for the same purpose, which can be common with historical documents) or, using a conversion table, it can be used to normalize a whole set of ground-truth.</i18n></p>
</div>
</section>
</div>
</div>
</main>
<!-- New Footer -->
<footer class="footer mt-auto border-top bg-light" id="footer">
<div class="row bg-light pb-5">
<div class="col-1 col-md">
</div>
<div class="col-10 col-md">
<h5 class="pb-2 py-4">HTR-United <small class="mb-1 text-muted">· 2020–2024</small></h5>
<ul class="list-unstyled text-small">
<li class="mb-1"><a class="link-secondary text-decoration-none" href="data-reuse-charter.html"><i18n vanilla-i18n="nav.charter">Charte de réutilisation des données</i18n></a></li>
<li class="mb-1"><a class="link-secondary text-decoration-none" href="changelog.html"><i18n vanilla-i18n="footer.changelog">Changelog du schéma</i18n></a></li>
</ul>
<h5><i18n vanilla-i18n="footer.about">À propos</i18n></h5>
<ul class="list-unstyled text-small">
<li class="mb-1"><a class="link-secondary text-decoration-none" href="team.html"><i18n vanilla-i18n="footer.team">Qui est derrière HTR-United ?</i18n></a></li>
</ul>
</div>
<div class="col-5 col-md">
</div>
</div>
</footer>
<!-- Bootstrap core JavaScript
================================================== -->
<!-- Placed at the end of the document so the pages load faster -->
<script src="https://code.jquery.com/jquery-3.3.1.slim.min.js" integrity="sha384-q8i/X+965DzO0rT7abK41JStQIAqVgRVzpbzo5smXKp4YfRvH+8abtTE1Pi6jizo" crossorigin="anonymous"></script>
<!--<script>window.jQuery || document.write('<script src="../../assets/js/vendor/jquery-slim.min.js"><\/script>')</script>-->
<script src="https://cdnjs.cloudflare.com/ajax/libs/popper.js/1.14.7/umd/popper.min.js" integrity="sha384-UO2eT0CpHqdSJQ6hJty5KVphtPhzWj9WO1clHTMGa3JDZwrnQq4sF86dIHNDz0W1" crossorigin="anonymous"></script>
<script src="https://stackpath.bootstrapcdn.com/bootstrap/4.3.1/js/bootstrap.min.js" integrity="sha384-JjSmVgyd0p3pXB1rRibZUAYoIIy6OrQ6VrjIEaFf/nJGzIxFDsf4x0xIM+B07jRM" crossorigin="anonymous"></script>
<!--<script src="static/js/bootstrap.bundle.min.js" integrity="sha384-p34f1UUtsS3wqzfto5wAAmdvj+osOnFyQFpp4Ua3gs/ZVWx6oOypYoCJhGGScy+8" crossorigin="anonymous"></script>-->
<!--<script src="/docs/5.0/dist/js/bootstrap.bundle.min.js" integrity="sha384-p34f1UUtsS3wqzfto5wAAmdvj+osOnFyQFpp4Ua3gs/ZVWx6oOypYoCJhGGScy+8" crossorigin="anonymous"></script>-->
<!-- i18n -->
<script type="text/javascript" src="static/js/i18n-advanced.js"></script>
<script type="text/javascript" src="static/js/i18n.js"></script>
<!-- end of i18n -->
<script type="text/javascript">
(function() {
document.querySelectorAll(".toggle-demo").forEach((el) => {
el.addEventListener("click", function(e) {
e.preventDefault();
console.log(el.getAttribute("data-target"));
const iFrameSrc = el.href,
iFrameContainer = document.querySelector(el.getAttribute("data-target")),
iFrame = iFrameContainer.querySelector("iframe");
iFrameContainer.classList.remove("d-none");
iFrame.src = iFrameSrc;
})
});
})();
</script>
</body>
</html>