Home
Browse
Proteins (PIs)
Domains (IRDs)
Reactive Loops (RCL)
Linkers
Target Protease
Taxonomy
Domain Architecture
Data Analysis
Proteins (PIs)
Domains (IRDs)
Reactive Loops (RCL)
Linkers
Target Protease
Search
Downloads
Contact Us
Proteins
Proteins (PIs)
Proteins details
PINIR_51
[
Q1WL39
] :
Trypsin proteinase inhibitor
397
Length
Nicotiana pauciflora
Specie
PI
Gene Name
43846
Mass
11
Domain Count
Overview
Function
Domains
Amino acid (aa) Composition
Visualization
Sequence :
FASTA Sequence copied to clipboard!
×
FASTA
MAVHRVSFLA
LLLLFGMSLL
VSNVEHADAM
ACPFNCDPRI
AYEVCPRTEE
KKNDRICTNC
CAGTKGCKYF
70
SDDGTFICEG
ESDPRNPKAC
PRNCDGRIAY
GICPLSEEKK
NDRICTNCCV
GTKGCKYFSD
DGTFICEGES
140
DPRNPKACPR
NCDGRIAYGI
CPRTEEKKND
RICTNCCAGT
KGCKYFSDDG
TFICEGESDP
RNPKACPRNC
210
DGRIAYGICP
RTEEKKNDRI
CTNCCAGTKG
CKYFSDDGTF
VCEGESDPRN
PKACPRNCDG
RIAYGICPRT
280
EEKKNDRICT
NCCAGTKGCK
YFSDDGTFVC
EGESDPRNPK
ACPRNCDERI
AYGICPRTEE
KKNNQICTNC
350
CAGTKGCNYF
SANGTFICEG
ESEYVSKVDE
YVHEVENDLQ
KSRVAVS
397
SIGNAL PEPTIDE
SEQUENCE
Position
MAVHRVSFLALLLLFGMSLLVSNVEHADA
1 ↔ 29
DISULPHIDE BOND
Disulfidebondtype
Position
Disulfide bond 1
3 ↔ 28
Disulfide bond 1
3 ↔ 40
Disulfide bond 2
6 ↔ 24
Disulfide bond 2
7 ↔ 32
Disulfide bond 3
7 ↔ 36
Disulfide bond 3
16 ↔ 38
Disulfide bond 4
13 ↔ 49
Disulfide bond 4
31 ↔ 49
DOMAINS ARCHITECTURE
View diagram
Domain
Position
Length
Architecture
IRD-169
287 - 338
52
IRD-193
55 - 106
52
IRD-194
113 - 164
52
IRD-195
171 - 222
52
IRD-196
229 - 280
52
IRD-619
320 - 370
51
IRD-629
88 - 138
51
IRD-630
146 - 196
51
IRD-631
204 - 254
51
IRD-631
262 - 312
51
IRD-667
30 - 80
51
ISO - ELECTRIC POINT
Iso Eelectric Point
charge at pH 7.4
charge at pH 5.5
charge at pH 8.0
5.754
2.3
-22.4
-39.8
CROSS REFERENCES
Uniprot
Interpro Id
Pfam Id
EMBL
Q1WL39
View PI in InterPro
IPR003465
View protein in Pfam
PF02428
DQ158193
REFERENCES
PubMed ID:
16534618
|
Application:
REACTIVE LOOPS
Entry
RCL Sequence
Target Protease
Position
RL-1
CPRNC
Trypsin
91 - 95
RL-1
CPRNC
Trypsin
91 - 95
RL-1
CPRNC
Trypsin
149 - 153
RL-1
CPRNC
Trypsin
149 - 153
RL-1
CPRNC
Trypsin
207 - 211
RL-1
CPRNC
Trypsin
207 - 211
RL-1
CPRNC
Trypsin
265 - 269
RL-1
CPRNC
Trypsin
265 - 269
RL-1
CPRNC
Trypsin
323 - 327
RL-1
CPRNC
Trypsin
323 - 327
RL-5
CPFNC
Chymotrypsin
33 - 37
GO TERMS
Molecular Function
Cellular Component
Biological Process
serine-type endopeptidase inhibitor activity (GO:0004867)
None
None
SPATIO TEMPORAL DISTRIBUTION
Tissue Distribution
Induction
Elicitor Molecules
Target Insect
Remarks
Reference
No Records Found
BIO-CHEMICAL PROPERTIES
Ki (uM)
IC50 (uM)
Target Protease
Cross Reactivity
Remarks
Reference
No Records Found
BIO-PHYSICAL PROPERTIES
Affinity
Binding Energy
Stability
Remarks
Reference
No Records Found
DOMAIN STRUCTURE FOLD
Beta Sheet Numbers
Alpha Helix Numbers
Coil Numbers
Disulphide Bonds Numbers
Remarks
Reference
No Records Found
Entry
Domain Position
RCL
RCL Position
Linker
Linker Position
Domain Type
IRD-169
287 - 338
CPRNC
322 - 326
DPRNP
315 - 319
H-L Type (Type-I)
IRD-193
55 - 106
CPRNC
90 - 94
DPRNP
83 - 87
H-L Type (Type-I)
IRD-194
113 - 164
CPRNC
148 - 152
DPRNP
141 - 145
H-L Type (Type-I)
IRD-195
171 - 222
CPRNC
206 - 210
DPRNP
199 - 203
H-L Type (Type-I)
IRD-196
229 - 280
CPRNC
264 - 268
DPRNP
257 - 261
H-L Type (Type-I)
IRD-619
320 - 370
CPRNC
322 - 326
EEKKN
339 - 343
L-H Type (Type-II)
IRD-629
88 - 138
CPRNC
90 - 94
EEKKN
107 - 111
L-H Type (Type-II)
IRD-630
146 - 196
CPRNC
148 - 152
EEKKN
165 - 169
L-H Type (Type-II)
IRD-631
204 - 254
CPRNC
206 - 210
EEKKN
223 - 227
L-H Type (Type-II)
IRD-631
262 - 312
CPRNC
264 - 268
EEKKN
281 - 285
L-H Type (Type-II)
IRD-667
30 - 80
CPFNC
32 - 36
EEKKN
49 - 53
L-H Type (Type-II)
Domain(
IRD-169
)
:
RICTNCCAGTKGCKYFSDDGTFVCEGES
DPRNP
KA
CPRNC
DERIAYGICPRT
Reactive Loop(RCL)
:
CPRNC
Linker
:
DPRNP
Domain Type
:
H-L Type (Type-I)
Disulphide Bond
:
3,40 ; 6,24 ; 7,36 ; 13,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
V
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
E
R
I
A
Y
G
I
C
P
R
T
Domain(
IRD-193
)
:
RICTNCCAGTKGCKYFSDDGTFICEGES
DPRNP
KA
CPRNC
DGRIAYGICPLS
Reactive Loop(RCL)
:
CPRNC
Linker
:
DPRNP
Domain Type
:
H-L Type (Type-I)
Disulphide Bond
:
3,40 ; 6,24 ; 7,36 ; 13,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
L
S
Domain(
IRD-194
)
:
RICTNCCVGTKGCKYFSDDGTFICEGES
DPRNP
KA
CPRNC
DGRIAYGICPRT
Reactive Loop(RCL)
:
CPRNC
Linker
:
DPRNP
Domain Type
:
H-L Type (Type-I)
Disulphide Bond
:
3,40 ; 6,24 ; 7,36 ; 13,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
R
I
C
T
N
C
C
V
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
Domain(
IRD-195
)
:
RICTNCCAGTKGCKYFSDDGTFICEGES
DPRNP
KA
CPRNC
DGRIAYGICPRT
Reactive Loop(RCL)
:
CPRNC
Linker
:
DPRNP
Domain Type
:
H-L Type (Type-I)
Disulphide Bond
:
3,40 ; 6,24 ; 7,36 ; 13,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
Domain(
IRD-196
)
:
RICTNCCAGTKGCKYFSDDGTFVCEGES
DPRNP
KA
CPRNC
DGRIAYGICPRT
Reactive Loop(RCL)
:
CPRNC
Linker
:
DPRNP
Domain Type
:
H-L Type (Type-I)
Disulphide Bond
:
3,40 ; 6,24 ; 7,36 ; 13,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
V
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
Domain(
IRD-619
)
:
KA
CPRNC
DERIAYGICPRT
EEKKN
NQICTNCCAGTKGCNYFSANGTFICEG
Reactive Loop(RCL)
:
CPRNC
Linker
:
EEKKN
Domain Type
:
L-H Type (Type-II)
Disulphide Bond
:
3,28 ; 7,32 ; 16,38 ; 31,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
K
A
C
P
R
N
C
D
E
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
N
Q
I
C
T
N
C
C
A
G
T
K
G
C
N
Y
F
S
A
N
G
T
F
I
C
E
G
Domain(
IRD-629
)
:
KA
CPRNC
DGRIAYGICPLS
EEKKN
DRICTNCCVGTKGCKYFSDDGTFICEG
Reactive Loop(RCL)
:
CPRNC
Linker
:
EEKKN
Domain Type
:
L-H Type (Type-II)
Disulphide Bond
:
3,28 ; 7,32 ; 16,38 ; 31,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
L
S
E
E
K
K
N
D
R
I
C
T
N
C
C
V
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
Domain(
IRD-630
)
:
KA
CPRNC
DGRIAYGICPRT
EEKKN
DRICTNCCAGTKGCKYFSDDGTFICEG
Reactive Loop(RCL)
:
CPRNC
Linker
:
EEKKN
Domain Type
:
L-H Type (Type-II)
Disulphide Bond
:
3,28 ; 7,32 ; 16,38 ; 31,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
Domain(
IRD-631
)
:
KA
CPRNC
DGRIAYGICPRT
EEKKN
DRICTNCCAGTKGCKYFSDDGTFVCEG
Reactive Loop(RCL)
:
CPRNC
Linker
:
EEKKN
Domain Type
:
L-H Type (Type-II)
Disulphide Bond
:
3,28 ; 7,32 ; 16,38 ; 31,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
V
C
E
G
Domain(
IRD-631
)
:
KA
CPRNC
DGRIAYGICPRT
EEKKN
DRICTNCCAGTKGCKYFSDDGTFVCEG
Reactive Loop(RCL)
:
CPRNC
Linker
:
EEKKN
Domain Type
:
L-H Type (Type-II)
Disulphide Bond
:
3,28 ; 7,32 ; 16,38 ; 31,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
V
C
E
G
Domain(
IRD-667
)
:
MA
CPFNC
DPRIAYEVCPRT
EEKKN
DRICTNCCAGTKGCKYFSDDGTFICEG
Reactive Loop(RCL)
:
CPFNC
Linker
:
EEKKN
Domain Type
:
L-H Type (Type-II)
Disulphide Bond
:
3,28 ; 7,32 ; 16,38 ; 31,49
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
M
A
C
P
F
N
C
D
P
R
I
A
Y
E
V
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
DOMAINS ARCHITECTURE
Domain
Position
Length
Architecture
IRD-169
287 - 338
52
IRD-193
55 - 106
52
IRD-194
113 - 164
52
IRD-195
171 - 222
52
IRD-196
229 - 280
52
IRD-619
320 - 370
51
IRD-629
88 - 138
51
IRD-630
146 - 196
51
IRD-631
204 - 254
51
IRD-631
262 - 312
51
IRD-667
30 - 80
51
Amino Acid
Count
% by Frequency
Alanine
23
5.79
Cysteine
48
12.09
Aspartic acid
29
7.3
Glutamic acid
31
7.81
Phenylalanine
15
3.78
Glycine
34
8.56
Histidine
3
0.76
Isoleucine
21
5.29
Lysine
30
7.56
Leucine
9
2.27
Methionine
3
0.76
Asparagine
28
7.05
Proline
23
5.79
Glutamine
2
0.5
Arginine
28
7.05
Serine
19
4.79
Threonine
23
5.79
Valine
14
3.53
Tryptophan
0
0
Tyrosine
14
3.53
Architecture
×
Signal Peptide
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
M
A
V
H
R
V
S
F
L
A
L
L
L
L
F
G
M
S
L
L
V
S
N
V
E
H
A
D
A
M
A
C
P
F
N
C
D
P
R
I
A
Y
E
V
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
L
S
E
E
K
K
N
D
R
I
C
T
N
C
C
V
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
V
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
V
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
E
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
N
Q
I
C
T
N
C
C
A
G
T
K
G
C
N
Y
F
S
A
N
G
T
F
I
C
E
G
E
S
E
Y
V
S
K
V
D
E
Y
V
H
E
V
E
N
D
L
Q
K
S
R
V
A
V
S
A
V
H
R
V
S
F
L
A
L
L
L
L
F
G
M
S
L
L
V
S
N
V
E
H
A
D
A
M
Domains
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
M
A
V
H
R
V
S
F
L
A
L
L
L
L
F
G
M
S
L
L
V
S
N
V
E
H
A
D
A
M
A
C
P
F
N
C
D
P
R
I
A
Y
E
V
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
L
S
E
E
K
K
N
D
R
I
C
T
N
C
C
V
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
I
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
V
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
G
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
D
R
I
C
T
N
C
C
A
G
T
K
G
C
K
Y
F
S
D
D
G
T
F
V
C
E
G
E
S
D
P
R
N
P
K
A
C
P
R
N
C
D
E
R
I
A
Y
G
I
C
P
R
T
E
E
K
K
N
N
Q
I
C
T
N
C
C
A
G
T
K
G
C
N
Y
F
S
A
N
G
T
F
I
C
E
G
E
S
E
Y
V
S
K
V
D
E
Y
V
H
E
V
E
N
D
L
Q
K
S
R
V
A
V
S