mercury

mirror of https://github.com/Mercury-Language/mercury.git synced 2026-04-22 04:43:53 +00:00

Author	SHA1	Message	Date
Zoltan Somogyi	59b11f84ce	Update the debugger test directory. Replace __ with . as the module qualifier symbol. Replace references to io.state with just io. Replace DCGs with state variables. Replace (C->T;E) syntax for if-then-elses with (if C then T else E) syntax. Replace if-then-elses with switches when possible and where this does not affect what is being tested. Replace separate pred and mode declarations with predmode declarations. Put predicate and function declarations just before the definition of the predicate or function. Delete unneeded module qualifications on predicate and function declarations and definitions. Update .exp files (and if needed, .inp files) for the line number changes that result from the above. For tests that have more than one .exp file and where one of those files is affected by the above, add a section to the source file header that says which .exp file is for what grade, with XXXs for the (as yet) unknown parts.	2018-08-28 21:20:59 +10:00
Zoltan Somogyi	33eb3028f5	Clean up the tests in half the test directories. tests/accumulator/.m: tests/analysis_/.m: tests/benchmarks/.m: tests/debugger/.{m,exp,inp}: tests/declarative_debugger/.{m,exp,inp}: tests/dppd/.m: tests/exceptions/.m: tests/general/.m: tests/grade_subdirs/.m: tests/hard_coded/*.m: Make these tests use four-space indentation, and ensure that each module is imported on its own line. (I intend to use the latter to figure out which subdirectories' tests can be executed in parallel.) These changes usually move code to different lines. For the debugger tests, specify the new line numbers in .inp files and expect them in .exp files.	2015-02-14 20:14:03 +11:00
Peter Wang	3788a9d6fb	Improve Unicode support. Branches: main Improve Unicode support. Declare that we use the Unicode character set, and UTF-8 or UTF-16 for the internal string representation (depending on the backend). User code may be written to those assumptions. Other external encodings can be supported in the future by translating to/from Unicode internally. The `char' type now represents a Unicode code point. NOTE: questions about how to handle unpaired surrogate code points, etc. have been left for later. library/char.m: Define a `char' to be a Unicode code point and extend ranges appropriately. Add predicates: to_utf8, to_utf16, is_surrogate, is_noncharacter. Update some documentation. library/io.m: Declare I/O predicates on text streams to read/write code points, not ambiguous "characters". Text files are expected to use UTF-8 encoding. Supporting other encodings is for future work. Update the C and Erlang implementations to understand UTF-8 encoding. Update Java and C# implementations to read/write code points (Mercury char) instead of UTF-16 code units. Add `may_not_duplicate' attributes to some foreign_procs. Improve Erlang implementations of seeking and getting the stream size. library/string.m: Declare the string representations, as described earlier. Distinguish between code units and code points everywhere. Existing functions and predicates which take offset and length arguments continue to take them in terms of code units. Add procedures: count_code_units, count_codepoints, codepoint_offset, to_code_unit_list, from_code_unit_list, index_next, unsafe_index_next, unsafe_prev_index, unsafe_index_code_unit, split_by_codepoint, left_by_codepoint, right_by_codepoint, substring_by_codepoint. Make index, index_det call error/1 if an illegal sequence is detected, as they already do for invalid offsets. Clarify that is_all_alpha, is_all_alnum_or_underscore, is_alnum_or_underscore only succeed for the ASCII characters under each of those categories. Clarify that whitespace stripping functions only strip whitespace characters in the ASCII range. Add comments about the future treatment of surrogate code points (not yet implemented). Use Mercury format implementation when necessary instead of `sprintf'. The %c specifier does not work for code points which require multi-byte representation. The field width modifier for %s only works if the string contains only single-byte code points. library/lexer.m: Conform to string encoding changes. Simplify code dealing with \uNNNN escapes now that encoding/decoding is handled by the string module. library/term_io.m: Allow code points above 126 directly in Mercury source. NOTE: \x and \o codes are treated as code points by this change. runtime/mercury_types.h: Redefine `MR_Char' to be `int' to hold a Unicode code point. `MR_String' has to be defined as a pointer to `char' instead of a pointer to `MR_Char'. Some C foreign code will be affected by this change. runtime/mercury_string.c: runtime/mercury_string.h: Add UTF-8 helper routines and macros. Make hash routines conform to type changes. compiler/c_util.m: Fix output_quoted_string_lang so that it correctly outputs non-ASCII characters for each of the target languages. Fix quote_char for non-ASCII characters. compiler/elds_to_erlang.m: Write out code points above 126 normally instead of using escape syntax. Conform to string encoding changes. compiler/mlds_to_cs.m: Change Mercury `char' to be represented by C# `int'. compiler/mlds_to_java.m: Change Mercury `char' to be represented by Java `int'. doc/reference_manual.texi: Uncomment description of \u and \U escapes in string literals. Update description of C# and Java representations for Mercury `char' which are now `int'. tests/debugger/tailrec1.m: Conform to renaming. tests/general/string_replace.exp: tests/general/string_replace.m: Test non-ASCII characters to string.replace. tests/general/string_test.exp: tests/general/string_test.m: Test non-ASCII characters to string.duplicate_char, string.pad_right, string.pad_left, string.format_table. tests/hard_coded/char_unicode.exp: tests/hard_coded/char_unicode.m: Add test for new procedures in `char' module. tests/hard_coded/contains_char_2.m: Test non-ASCII characters to string.contains_char. tests/hard_coded/nonascii.exp: tests/hard_coded/nonascii.m: tests/hard_coded/nonascii_gen.c: Add code points above 255 to this test case. Change test data encoding to UTF-8. tests/hard_coded/string_class.exp: tests/hard_coded/string_class.m: Add test case for string.is_alpha, etc. tests/hard_coded/string_codepoint.exp: tests/hard_coded/string_codepoint.exp2: tests/hard_coded/string_codepoint.m: Add test case for new string procedures dealing with code points. tests/hard_coded/string_first_char.exp: tests/hard_coded/string_first_char.m: Add test case for all modes of string.first_char. tests/hard_coded/string_hash.m: Don't use buggy random.random/5 predicate which can overflow on a large range (such as the range of code points). tests/hard_coded/string_presuffix.exp: tests/hard_coded/string_presuffix.m: Add test case for string.prefix, string.suffix, etc. tests/hard_coded/string_set_char.m: Test non-ASCII characters to string.set_char. tests/hard_coded/string_strip.exp: tests/hard_coded/string_strip.m: Test non-ASCII characters to string stripping procedures. tests/hard_coded/string_sub_string_search.m: Test non-ASCII characters to string.sub_string_search. tests/hard_coded/unicode_test.exp: Update expected output due to change of behaviour of `string.to_char_list'. tests/hard_coded/unicode_test.m: Test non-ASCII character in separator string argument to string.join_list. tests/hard_coded/utf8_io.exp: tests/hard_coded/utf8_io.m: Add tests for UTF-8 I/O. tests/hard_coded/words_separator.exp: tests/hard_coded/words_separator.m: Add test case for `string.words_separator'. tests/hard_coded/Mmakefile: Add new test cases. Make special_char test case run on all backends. tests/hard_coded/special_char.exp: tests/valid/mercury_java_parser_follow_code_bug.m: Reencode these files in UTF-8. NEWS: Add a news entry.	2011-04-04 07:10:42 +00:00
Zoltan Somogyi	53286dd4bf	Implement a new compiler option, --exec-trace-tail-rec, that preserves direct Estimated hours taken: 30 Branches: main Implement a new compiler option, --exec-trace-tail-rec, that preserves direct tail recursion in det and semidet procedures even when debugging is enabled. This should allow the debugging of programs that previously ran out of stack. The problem arose because even a directly tail-recursive call had some code after it: the code for the EXIT event, like this: p: incr_sp fill in the usual debug slots CALL EVENT ... /* tail call / move arguments to registers as usual call p, return to p_ret p_ret: / code to move output arguments to right registers is empty / EXIT EVENT decr_sp return If the new option is enabled, the compiler will now generate code like this: p: incr_sp fill in the usual debug slots fill in new "stack frame reuse count" slot with 0 CALL EVENT p_1: ... / tail call */ move arguments to registers as usual update the usual debug slots increment the "stack frame reuse count" slot TAILCALL EVENT goto p_1 The new TAIL event takes place in the caller's stack frame, so that the local variables of the caller are available. This includes the arguments of the recursive call (though if they are unnamed variables, the debugger will not show them). The TAIL event serves as a replacement for the CALL event of the recursive invocation. compiler/options.m: Add the new option. compiler/handle_options.m: Handle an implication of the new option: the declarative debugger does not (yet) understand TAIL events. compiler/mark_tail_calls.m: New module to mark directly tail recursive calls and the procedures containing them as such. compiler/hlds.m: compiler/notes/compiler_design.html: Mention the new module. compiler/mercury_compile.m: Invoke the new module when the new option asks us to. compiler/hlds_goal.m: Add the feature used to mark tail recursive calls for the debugger. Rename an existing feature with a similar but not identical purpose to avoid possible confusion. compiler/hlds_pred.m: Add a field to proc_infos that says whether the procedure contains tail recursive calls. Minor style improvements. compiler/passes_aux.m: Minor change to accommodate the needs of the new module. compiler/code_info.m: Transmit the information from mark_tail_calls to the code generator. compiler/call_gen.m: Implement the new option. compiler/trace_gen.m: Reserve the extra slot needed for the new option. Switch to state variable notation in the code that does the slot allocation, since this is less error-prone than the previous approach. compiler/layout.m: compiler/layout_out.m: compiler/stack_layout.m: Remember what stack slot holds the stack frame reuse counter, for transmission to the runtime system. compiler/proc_gen.m: Add the new label needed for tail recursion. Put the arguments of some procedures into a more logical order. compiler/deep_profiling.m: compiler/deforest.m: compiler/saved_vars.m: compiler/table_gen.m: Conform to the changes above. compiler/trace_params.m: mdbcomp/prim_data.m: runtime/mercury_trace_base.[ch]: Add the new event type. Convert mercury_trace_base.h to four-space indentation. runtime/mercury_stack_layout.h: Add a field to the execution trace information we have for each procedure that gives the number of the stack slot (if any) that holds the stack frame reuse counter. Add a macro to get the value in the counter. Convert this header file to four-space indentation. runtime/mercury_stack_trace.[ch]: When walking the stack, we now have to be prepared to encounter stack frames that have been reused. Modify the algorithms in this module accordingly, and modify the interfaces of the exported functions to allow the functions' callers to behave accordingly as well. Group the information we gather about stack frame for printing into one structure, and document it. Convert the header to four-space indentation. library/exception.m: mdbcomp/trace_counts.m: Conform to the changes above. In trace_counts.m, fix an apparent cut-and-paste error (that hasn't caused any test case failures yet). trace/mercury_trace.c: Modify the implementation of the "next" and "finish" commands to accommodate the possibility that the procedure at the selected depth may have had its stack frame reused. In such cases tests/debugger/tailrec1.{m,inp,exp,data}: A new test case to check the handling of tail recursive procedures.	2008-11-25 07:46:57 +00:00

4 Commits